Exam AWS Certified Data Engineer - Associate DEA-C01 All Questions

View all questions & answers for the AWS Certified Data Engineer - Associate DEA-C01 exam

Exam AWS Certified Data Engineer - Associate DEA-C01 topic 1 question 29 discussion

Exam question from Amazon's AWS Certified Data Engineer - Associate DEA-C01

Question #: 29
Topic #: 1

[All AWS Certified Data Engineer - Associate DEA-C01 Questions]

A data engineer needs to use AWS Step Functions to design an orchestration workflow. The workflow must parallel process a large collection of data files and apply a specific transformation to each file.
Which Step Functions state should the data engineer use to meet these requirements?

A. Parallel state
B. Choice state
C. Map state
D. Wait state

Show Suggested Answer

Suggested Answer: C 🗳️

by rralucard_ at Feb. 4, 2024, 1:46 p.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

GabrielSGoncalves

11 months, 3 weeks ago

Selected Answer: C

Clearly is mapping state

upvoted 1 times

...

pypelyncar

1 year, 2 months ago

Selected Answer: C

The Map state allows you to define a single execution path for processing a collection of data items in parallel. This aligns perfectly with the data engineer's requirement of parallel processing a large collection of data files

upvoted 3 times

...

FunkyFresco

1 year, 2 months ago

Selected Answer: C

to execute in parallel

upvoted 1 times

...

sveni1502

1 year, 2 months ago

Selected Answer: C

C is Correct To meet the requirement of parallel processing a large collection of data files and applying a specific transformation to each file, the data engineer should use the Map state in AWS Step Functions. The Map state is specifically designed to run a set of tasks in parallel for each element in a collection or array. Each element (in this case, each data file) is processed independently and in parallel, allowing the workflow to take advantage of parallel processing.

upvoted 3 times

...

lucas_rfsb

1 year, 4 months ago

Selected Answer: C

C, Map state is correct

upvoted 1 times

...

Aesthet

1 year, 6 months ago

With Step Functions, you can orchestrate large-scale parallel workloads to perform tasks, such as on-demand processing of semi-structured data. These parallel workloads let you concurrently process large-scale data sources stored in Amazon S3. For example, you might process a single JSON or CSV file that contains large amounts of data. Or you might process a large set of Amazon S3 objects. To set up a large-scale parallel workload in your workflows, include a Map state in Distributed mode.

upvoted 1 times

...

Aesthet

1 year, 6 months ago

C is correct. Map state is designed precisely for the requirement described. It allows you to iterate over a collection of items, processing each item individually. The Map state can automatically manage the iteration and execute the specified transformation on each item in parallel, making it the perfect choice for parallel processing of a large collection of data files.

upvoted 1 times

...

rralucard_

1 year, 6 months ago

Selected Answer: C

The Map state is specifically designed for processing a collection of items (like data files) in parallel. It allows you to apply a transformation or a set of steps to each item in the input array independently. The Map state automatically iterates over each item in the array and performs the defined steps. This makes it ideal for scenarios where you need to process a large number of files in a similar manner, as in your requirement.

upvoted 2 times

...