exam questions

Exam AWS Certified Data Engineer - Associate DEA-C01 All Questions

View all questions & answers for the AWS Certified Data Engineer - Associate DEA-C01 exam

Exam AWS Certified Data Engineer - Associate DEA-C01 topic 1 question 229 discussion

A company wants to ingest streaming data into an Amazon Redshift data warehouse from an Amazon Managed Streaming for Apache Kafka (Amazon MSK) cluster. A data engineer needs to develop a solution that provides low data access time and that optimizes storage costs.

Which solution will meet these requirements with the LEAST operational overhead?

  • A. Create an external schema that maps to the MSK cluster. Create a materialized view that references the external schema to consume the streaming data from the MSK topic.
  • B. Develop an AWS Glue streaming extract, transform, and load (ETL) job to process the incoming data from Amazon MSK. Load the data into Amazon S3. Use Amazon Redshift Spectrum to read the data from Amazon S3.
  • C. Create an external schema that maps to the streaming data source. Create a new Amazon Redshift table that references the external schema.
  • D. Create an Amazon S3 bucket. Ingest the data from Amazon MSK. Create an event-driven AWS Lambda function to load the data from the S3 bucket to a new Amazon Redshift table.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
rdiaz
1 month, 2 weeks ago
Selected Answer: B
AWS Glue streaming ETL supports reading from Amazon MSK, applying transformations, and writing the result to Amazon S3 in near-real time. • Amazon Redshift Spectrum allows querying data directly from S3 without loading it into Redshift storage, which helps optimize storage costs. • This solution: • Provides low-latency data access via Spectrum. • Avoids duplicating or materializing unnecessary data in Redshift. • Has low operational overhead because Glue streaming jobs are managed, scalable, and require minimal infrastructure maintenance.
upvoted 1 times
...
XP_2600
1 month, 2 weeks ago
Selected Answer: B
Seems B
upvoted 1 times
...
Tani0908
1 month, 3 weeks ago
Selected Answer: A
Streaming data can be directly ingested from Amazon MSK to Redshift using external schemas and materialized views, there is no need for etl process or s3 in between
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...