exam questions

Exam AWS Certified Data Analytics - Specialty All Questions

View all questions & answers for the AWS Certified Data Analytics - Specialty exam

Exam AWS Certified Data Analytics - Specialty topic 1 question 147 discussion

A reseller that has thousands of AWS accounts receives AWS Cost and Usage Reports in an Amazon S3 bucket. The reports are delivered to the S3 bucket in the following format:
<example-report-prefix>/<example-report-name>/yyyymmdd-yyyymmdd/<example-report-name>.parquet
An AWS Glue crawler crawls the S3 bucket and populates an AWS Glue Data Catalog with a table. Business analysts use Amazon Athena to query the table and create monthly summary reports for the AWS accounts. The business analysts are experiencing slow queries because of the accumulation of reports from the last
5 years. The business analysts want the operations team to make changes to improve query performance.
Which action should the operations team take to meet these requirements?

  • A. Change the file format to .csv.zip
  • B. Partition the data by date and account ID
  • C. Partition the data by month and account ID
  • D. Partition the data by account ID, year, and month
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
Ramshizzle
Highly Voted 2 years, 10 months ago
Selected Answer: D
Should be D. We want to create monthly reports for each account. So we want to query the data by account-id and month. Only month is not enough, we have to add year, otherwise we query the previous year's months as well.
upvoted 8 times
...
f4bi4n
Highly Voted 2 years, 11 months ago
Selected Answer: D
should be D, by date is too precise and by account helps as well
upvoted 6 times
...
pk349
Most Recent 2 years ago
D: I passed the test
upvoted 1 times
...
mawsman
2 years, 1 month ago
Selected Answer: D
Date/account id partitioning would create a partition for each day thus the analysts would need to ingest a different date range of partitions for each account int other analysis each month. Account ID/year/month would more accurately represent the query pattern and avoid the need for analysts to specify a data range. hence D
upvoted 1 times
...
rocky48
2 years, 9 months ago
Selected Answer: D
Selected Answer: D
upvoted 2 times
...
Richie1217
2 years, 9 months ago
Selected Answer: D
Per account and monthly
upvoted 1 times
...
Bik000
2 years, 11 months ago
Selected Answer: B
Answer is B
upvoted 1 times
...
[Removed]
3 years ago
Selected Answer: D
monthly summary reports for the AWS accounts.
upvoted 2 times
CHRIS12722222
3 years ago
I concur
upvoted 1 times
...
...
CHRIS12722222
3 years ago
B looks good to me
upvoted 3 times
CHRIS12722222
3 years ago
chnge to D
upvoted 4 times
...
...
rb39
3 years ago
Selected Answer: B
Partition by date is the good practice here
upvoted 3 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago