exam questions

Exam AWS Certified Data Analytics - Specialty All Questions

View all questions & answers for the AWS Certified Data Analytics - Specialty exam

Exam AWS Certified Data Analytics - Specialty topic 1 question 146 discussion

A data engineer is using AWS Glue ETL jobs to process data at frequent intervals. The processed data is then copied into Amazon S3. The ETL jobs run every 15 minutes. The AWS Glue Data Catalog partitions need to be updated automatically after the completion of each job.
Which solution will meet these requirements MOST cost-effectively?

  • A. Use the AWS Glue Data Catalog to manage the data catalog. Define an AWS Glue workflow for the ETL process. Define a trigger within the workflow that can start the crawler when an ETL job run is complete.
  • B. Use the AWS Glue Data Catalog to manage the data catalog. Use AWS Glue Studio to manage ETL jobs. Use the AWS Glue Studio feature that supports updates to the AWS Glue Data Catalog during job runs.
  • C. Use an Apache Hive metastore to manage the data catalog. Update the AWS Glue ETL code to include the enableUpdateCatalog and partitionKeys arguments.
  • D. Use the AWS Glue Data Catalog to manage the data catalog. Update the AWS Glue ETL code to include the enableUpdateCatalog and partitionKeys arguments.
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
lovelazur
1 year, 1 month ago
Selected Answer: D
D is the best for small effort
upvoted 2 times
...
pk349
2 years ago
D: I passed the test
upvoted 4 times
...
silvaa360
2 years, 4 months ago
Selected Answer: D
Although A can be a very good solution, as it is possible and is even more visual on what is happening, of course creating a workflow and a trigger will be more costly than putting this options in the ETL code. So, D for sure.
upvoted 4 times
...
lkarwot
2 years, 5 months ago
Selected Answer: B
"Business analysts use Amazon Athena to query the table and create monthly summary reports for the AWS accounts" Given the above, data should be partitioned by date first in order to calculate summary report for all accounts for a particular month. Correct answer is B
upvoted 2 times
...
MultiCloudIronMan
2 years, 6 months ago
I choose D as well
upvoted 1 times
...
rocky48
2 years, 9 months ago
Selected Answer: D
I agree with D
upvoted 1 times
...
arboles
2 years, 9 months ago
Selected Answer: D
D is cost effective
upvoted 2 times
...
dushmantha
2 years, 10 months ago
Selected Answer: D
I agree with D
upvoted 1 times
...
Teraxs
3 years ago
Selected Answer: D
D - most cost effective as not rerunning the crawler https://docs.aws.amazon.com/glue/latest/dg/update-from-job.html
upvoted 3 times
...
wata9821
3 years ago
answer: D https://docs.aws.amazon.com/glue/latest/dg/update-from-job.html
upvoted 2 times
...
siju13
3 years ago
D seems correct
upvoted 2 times
...
[Removed]
3 years ago
Selected Answer: D
as per the document
upvoted 1 times
...
astalavista1
3 years ago
D - According to doc- https://docs.aws.amazon.com/glue/latest/dg/update-from-job.html
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago