exam questions

Exam AWS Certified Solutions Architect - Associate SAA-C02 All Questions

View all questions & answers for the AWS Certified Solutions Architect - Associate SAA-C02 exam

Exam AWS Certified Solutions Architect - Associate SAA-C02 topic 1 question 761 discussion

A company wants to measure the effectiveness of its recent marketing campaigns. The company performs batch processing on .csv files of sales data and stores the results in an Amazon S3 bucket once every hour. The S3 bucket contains petabytes of objects. The company runs one-time queries in Amazon Athena to determine which products are most popular on a particular date for a particular region. Queries sometimes fail or take longer than expected to finish running.
Which actions should a solutions architect take to improve the query performance and reliability? (Choose two.)

  • A. Reduce the S3 object sizes to less than 128 MB.
  • B. Partition the data by date and region in Amazon S3.
  • C. Store the files as large, single objects in Amazon S3.
  • D. Use Amazon Kinesis Data Analytics to run the queries as part of the batch processing operation.
  • E. Use an AWS Glue extract, transform, and load (ETL) process to convert the .csv files into Apache Parquet format.
Show Suggested Answer Hide Answer
Suggested Answer: AB 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
attila9778
Highly Voted 2 years, 8 months ago
B, E As per: https://aws.amazon.com/blogs/big-data/top-10-performance-tuning-tips-for-amazon-athena/ we can use: 1. Partition your data, and 5. Optimize columnar data store generation A is not an optimal choice, detailed in 4. Optimize file sizes: "However, if your files are too small (generally less than 128 MB), the execution engine might be spending additional time with the overhead of opening S3 files, listing directories, getting object metadata, setting up data transfer, reading file headers, reading compression dictionaries, and so on." C also not the right choice: "On the other hand, if your file is not splittable and the files are too large, the query processing waits until a single reader has completed reading the entire file. That can reduce parallelism."
upvoted 8 times
...
BECAUSE
Most Recent 2 years ago
Selected Answer: BE
B and E are the answers
upvoted 2 times
...
ruqui
2 years ago
Selected Answer: BC
According to https://aws.amazon.com/blogs/big-data/top-10-performance-tuning-tips-for-amazon-athena, answer should be B, C
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...