Exam AWS Certified Data Analytics - Specialty topic 1 question 117 discussion

Exam question from Amazon's AWS Certified Data Analytics - Specialty

Question #: 117
Topic #: 1

[All AWS Certified Data Analytics - Specialty Questions]

A large telecommunications company is planning to set up a data catalog and metadata management for multiple data sources running on AWS. The catalog will be used to maintain the metadata of all the objects stored in the data stores. The data stores are composed of structured sources like Amazon RDS and Amazon
Redshift, and semistructured sources like JSON and XML files stored in Amazon S3. The catalog must be updated on a regular basis, be able to detect the changes to object metadata, and require the least possible administration.
Which solution meets these requirements?

A. Use Amazon Aurora as the data catalog. Create AWS Lambda functions that will connect and gather the metadata information from multiple sources and update the data catalog in Aurora. Schedule the Lambda functions periodically.
B. Use the AWS Glue Data Catalog as the central metadata repository. Use AWS Glue crawlers to connect to multiple data stores and update the Data Catalog with metadata changes. Schedule the crawlers periodically to update the metadata catalog.
C. Use Amazon DynamoDB as the data catalog. Create AWS Lambda functions that will connect and gather the metadata information from multiple sources and update the DynamoDB catalog. Schedule the Lambda functions periodically.
D. Use the AWS Glue Data Catalog as the central metadata repository. Extract the schema for RDS and Amazon Redshift sources and build the Data Catalog. Use AWS crawlers for data stored in Amazon S3 to infer the schema and automatically update the Data Catalog.

Show Suggested Answer

Suggested Answer: B 🗳️

by srinivasa at Oct. 25, 2021, 4:38 a.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

srinivasa

Highly Voted 3 years, 8 months ago

Answer: B https://docs.aws.amazon.com/glue/latest/dg/crawler-data-stores.html

upvoted 17 times

...

cloudlearnerhere

Highly Voted 2 years, 7 months ago

Correct answer is B as AWS Glue Data Catalog can act as the central metadata repository with Glue Crawlers which can connect to multiple data stores and update the Data Catalog with metadata changes. Options A & C are wrong they would increase the administration work. Option D is wrong as Glue Crawlers can connect to all the mentioned datastores. https://docs.aws.amazon.com/glue/latest/dg/crawler-data-stores.html

upvoted 7 times