exam questions

Exam AWS Certified Data Analytics - Specialty All Questions

View all questions & answers for the AWS Certified Data Analytics - Specialty exam

Exam AWS Certified Data Analytics - Specialty topic 1 question 117 discussion

A large telecommunications company is planning to set up a data catalog and metadata management for multiple data sources running on AWS. The catalog will be used to maintain the metadata of all the objects stored in the data stores. The data stores are composed of structured sources like Amazon RDS and Amazon
Redshift, and semistructured sources like JSON and XML files stored in Amazon S3. The catalog must be updated on a regular basis, be able to detect the changes to object metadata, and require the least possible administration.
Which solution meets these requirements?

  • A. Use Amazon Aurora as the data catalog. Create AWS Lambda functions that will connect and gather the metadata information from multiple sources and update the data catalog in Aurora. Schedule the Lambda functions periodically.
  • B. Use the AWS Glue Data Catalog as the central metadata repository. Use AWS Glue crawlers to connect to multiple data stores and update the Data Catalog with metadata changes. Schedule the crawlers periodically to update the metadata catalog.
  • C. Use Amazon DynamoDB as the data catalog. Create AWS Lambda functions that will connect and gather the metadata information from multiple sources and update the DynamoDB catalog. Schedule the Lambda functions periodically.
  • D. Use the AWS Glue Data Catalog as the central metadata repository. Extract the schema for RDS and Amazon Redshift sources and build the Data Catalog. Use AWS crawlers for data stored in Amazon S3 to infer the schema and automatically update the Data Catalog.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
srinivasa
Highly Voted 3 years, 8 months ago
Answer: B https://docs.aws.amazon.com/glue/latest/dg/crawler-data-stores.html
upvoted 17 times
...
cloudlearnerhere
Highly Voted 2 years, 7 months ago
Correct answer is B as AWS Glue Data Catalog can act as the central metadata repository with Glue Crawlers which can connect to multiple data stores and update the Data Catalog with metadata changes. Options A & C are wrong they would increase the administration work. Option D is wrong as Glue Crawlers can connect to all the mentioned datastores. https://docs.aws.amazon.com/glue/latest/dg/crawler-data-stores.html
upvoted 7 times
cloudlearnerhere
2 years, 7 months ago
Option D Is wrong and extracting schema is a manual work
upvoted 2 times
...
...
pk349
Most Recent 2 years, 1 month ago
B: I passed the test
upvoted 2 times
...
rocky48
2 years, 10 months ago
Selected Answer: B
Selected Answer: B
upvoted 3 times
...
Bik000
3 years ago
Selected Answer: B
My Answer is B
upvoted 2 times
...
moon2351
3 years, 3 months ago
Selected Answer: B
Answer is B
upvoted 2 times
...
ali98
3 years, 7 months ago
Answer B
upvoted 4 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...