Exam AWS Certified Data Analytics - Specialty topic 1 question 146 discussion

Exam question from Amazon's AWS Certified Data Analytics - Specialty

Question #: 146
Topic #: 1

[All AWS Certified Data Analytics - Specialty Questions]

A data engineer is using AWS Glue ETL jobs to process data at frequent intervals. The processed data is then copied into Amazon S3. The ETL jobs run every 15 minutes. The AWS Glue Data Catalog partitions need to be updated automatically after the completion of each job.
Which solution will meet these requirements MOST cost-effectively?

A. Use the AWS Glue Data Catalog to manage the data catalog. Define an AWS Glue workflow for the ETL process. Define a trigger within the workflow that can start the crawler when an ETL job run is complete.
B. Use the AWS Glue Data Catalog to manage the data catalog. Use AWS Glue Studio to manage ETL jobs. Use the AWS Glue Studio feature that supports updates to the AWS Glue Data Catalog during job runs.
C. Use an Apache Hive metastore to manage the data catalog. Update the AWS Glue ETL code to include the enableUpdateCatalog and partitionKeys arguments.
D. Use the AWS Glue Data Catalog to manage the data catalog. Update the AWS Glue ETL code to include the enableUpdateCatalog and partitionKeys arguments.

Show Suggested Answer

Suggested Answer: D 🗳️

by astalavista1 at April 23, 2022, 10:24 p.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

lovelazur

1 year, 3 months ago

Selected Answer: D

D is the best for small effort

upvoted 2 times

...

pk349

2 years, 2 months ago

D: I passed the test

upvoted 4 times

...

silvaa360

2 years, 6 months ago

Selected Answer: D

Although A can be a very good solution, as it is possible and is even more visual on what is happening, of course creating a workflow and a trigger will be more costly than putting this options in the ETL code. So, D for sure.

upvoted 4 times

...

lkarwot

2 years, 8 months ago

Selected Answer: B

"Business analysts use Amazon Athena to query the table and create monthly summary reports for the AWS accounts" Given the above, data should be partitioned by date first in order to calculate summary report for all accounts for a particular month. Correct answer is B

upvoted 2 times

...