A company is collecting a large amount of data from a fleet of IoT devices. Data is stored as Optimized Row Columnar (ORC) files in the Hadoop Distributed File System (HDFS) on a persistent Amazon EMR cluster. The company's data analytics team queries the data by using SQL in Apache Presto deployed on the same EMR cluster. Queries scan large amounts of data, always run for less than 15 minutes, and run only between 5 PM and 10 PM.
The company is concerned about the high cost associated with the current solution. A solutions architect must propose the most cost-effective solution that will allow SQL data queries.
Which solution will meet these requirements?
Alabi
Highly Voted 2 years agosarlos
Most Recent 1 year, 1 month agohelloworldabc
10 months, 1 week agokgpoj
10 months, 3 weeks agoTonytheTiger
1 year, 3 months agokejam
1 year, 5 months agoCProgrammer
1 year, 6 months agocareer360guru
1 year, 7 months agoggrodskiy
1 year, 11 months agoNikkyDicky
1 year, 11 months agoSkyZeroZx
1 year, 11 months agoSmileyCloud
1 year, 12 months agoshree2023
2 years agogd1
2 years agoPhuocT
2 years agobhanus
2 years agopsyx21
2 years ago