exam questions

Exam AWS Certified Data Engineer - Associate DEA-C01 All Questions

View all questions & answers for the AWS Certified Data Engineer - Associate DEA-C01 exam

Exam AWS Certified Data Engineer - Associate DEA-C01 topic 1 question 29 discussion

A data engineer needs to use AWS Step Functions to design an orchestration workflow. The workflow must parallel process a large collection of data files and apply a specific transformation to each file.
Which Step Functions state should the data engineer use to meet these requirements?

  • A. Parallel state
  • B. Choice state
  • C. Map state
  • D. Wait state
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
GabrielSGoncalves
9 months, 1 week ago
Selected Answer: C
Clearly is mapping state
upvoted 1 times
...
pypelyncar
12 months ago
Selected Answer: C
The Map state allows you to define a single execution path for processing a collection of data items in parallel. This aligns perfectly with the data engineer's requirement of parallel processing a large collection of data files
upvoted 3 times
...
FunkyFresco
1 year ago
Selected Answer: C
to execute in parallel
upvoted 1 times
...
sveni1502
1 year ago
Selected Answer: C
C is Correct To meet the requirement of parallel processing a large collection of data files and applying a specific transformation to each file, the data engineer should use the Map state in AWS Step Functions. The Map state is specifically designed to run a set of tasks in parallel for each element in a collection or array. Each element (in this case, each data file) is processed independently and in parallel, allowing the workflow to take advantage of parallel processing.
upvoted 3 times
...
lucas_rfsb
1 year, 2 months ago
Selected Answer: C
C, Map state is correct
upvoted 1 times
...
Aesthet
1 year, 4 months ago
With Step Functions, you can orchestrate large-scale parallel workloads to perform tasks, such as on-demand processing of semi-structured data. These parallel workloads let you concurrently process large-scale data sources stored in Amazon S3. For example, you might process a single JSON or CSV file that contains large amounts of data. Or you might process a large set of Amazon S3 objects. To set up a large-scale parallel workload in your workflows, include a Map state in Distributed mode.
upvoted 1 times
...
Aesthet
1 year, 4 months ago
C is correct. Map state is designed precisely for the requirement described. It allows you to iterate over a collection of items, processing each item individually. The Map state can automatically manage the iteration and execute the specified transformation on each item in parallel, making it the perfect choice for parallel processing of a large collection of data files.
upvoted 1 times
...
rralucard_
1 year, 4 months ago
Selected Answer: C
The Map state is specifically designed for processing a collection of items (like data files) in parallel. It allows you to apply a transformation or a set of steps to each item in the input array independently. The Map state automatically iterates over each item in the array and performs the defined steps. This makes it ideal for scenarios where you need to process a large number of files in a similar manner, as in your requirement.
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...