exam questions

Exam AWS Certified Solutions Architect - Professional All Questions

View all questions & answers for the AWS Certified Solutions Architect - Professional exam

Exam AWS Certified Solutions Architect - Professional topic 1 question 902 discussion

A company is collecting a large amount of data from a fleet of IoT devices. Data is stored as Optimized Row Columnar (ORC) files in the Hadoop Distributed File
System (HDFS) on a persistent Amazon EMR cluster. The company's data analytics team queries the data by using SQL in Apache Presto deployed on the same
EMR cluster. Queries scan large amounts of data, always run for less than 15 minutes, and run only between 5 PM and 10 PM.
The company is concerned about the high cost associated with the current solution. A solutions architect must propose the most cost-effective solution that will allow SQL data queries.
Which solution will meet these requirements?

  • A. Store data in Amazon S3. Use Amazon Redshift Spectrum to query data.
  • B. Store data in Amazon S3. Use the AWS Glue Data Catalog and Amazon Athena to query data.
  • C. Store data in EMR File System (EMRFS). Use Presto in Amazon EMR to query data.
  • D. Store data in Amazon Redshift. Use Amazon Redshift to query data.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
ArreRaja
Highly Voted 2 years, 11 months ago
It is B. You need Redshift cluster to run Redshift spectrum which is expensive. Cost per query is same in Redshift spectrum vs Athena though Athena is not for complex or parallel queries with large data set
upvoted 5 times
...
ggrodskiy
Most Recent 2 years, 1 month ago
Correct b
upvoted 1 times
...
janvandermerwer
2 years, 9 months ago
Selected Answer: B
Process of elimination -> Athena is usually cheaper than redshift for smaller jobs. EMR File system sounds expensive
upvoted 1 times
...
athenaguo
2 years, 10 months ago
Selected Answer: D
https://aws.amazon.com/cn/premiumsupport/knowledge-center/configure-emr-s3-hadoop-storage/
upvoted 1 times
...
JohnPi
2 years, 10 months ago
Selected Answer: B
Athena uses Presto
upvoted 1 times
...
cale
2 years, 11 months ago
Selected Answer: B
It is B
upvoted 2 times
...
azureaspirant
2 years, 11 months ago
Redshift Spectrum is for scanning large amount of data
upvoted 1 times
...
Rocketeer
2 years, 11 months ago
Should be B
upvoted 2 times
...
Johannes1509
2 years, 11 months ago
Selected Answer: B
The answer should be #B. Athena is less expensive as Redshift Spectrum (https://stackoverflow.com/questions/50250114/athena-vs-redshift-spectrum)
upvoted 4 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...