exam questions

Exam AWS Certified Big Data - Specialty All Questions

View all questions & answers for the AWS Certified Big Data - Specialty exam

Exam AWS Certified Big Data - Specialty topic 1 question 1 discussion

Exam question from Amazon's AWS Certified Big Data - Specialty
Question #: 1
Topic #: 1
[All AWS Certified Big Data - Specialty Questions]

A data engineer in a manufacturing company is designing a data processing platform that receives a large volume of unstructured data. The data engineer must populate a well-structured star schema in Amazon
Redshift.
What is the most efficient architecture strategy for this purpose?

  • A. Transform the unstructured data using Amazon EMR and generate CSV data. COPY the CSV data into the analysis schema within Redshift.
  • B. Load the unstructured data into Redshift, and use string parsing functions to extract structured data for inserting into the analysis schema.
  • C. When the data is saved to Amazon S3, use S3 Event Notifications and AWS Lambda to transform the file contents. Insert the data into the analysis schema on Redshift.
  • D. Normalize the data using an AWS Marketplace ETL tool, persist the results to Amazon S3, and use AWS Lambda to INSERT the data into Redshift.
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
Corram
Highly Voted 3 years, 7 months ago
A is correct. Not B - never load unstructured data to Redshift Not C - s3 event + lambda would be more suitable for incremental, continuous S3-Redshift integration. Here, we have one large bulk load, so event notifications don't make sense and lambda may not be able to handle all transformation in one call due to service limits. Not D - Normalization is the act of adjusting values on a scale, usually subtracting mean and dividing by standard deviation. That doesn't make sense here.
upvoted 9 times
...
TomHanks
Most Recent 3 years, 7 months ago
A is correct
upvoted 1 times
...
NikkyDicky
3 years, 7 months ago
A for sure
upvoted 1 times
...
yuvaraj228
3 years, 7 months ago
A is right
upvoted 1 times
...
san2020
3 years, 7 months ago
Selected A.
upvoted 1 times
...
shouvanik
3 years, 8 months ago
option a is correct. Using EMR, we can process un-structured data, and put schema on top of it before saving it to s3
upvoted 3 times
...
kalpanareddy
3 years, 8 months ago
Answer is A
upvoted 2 times
...
M2
3 years, 9 months ago
Answer is A.
upvoted 2 times
...
exams
3 years, 9 months ago
answer a is correct
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...