exam questions

Exam Professional Data Engineer All Questions

View all questions & answers for the Professional Data Engineer exam

Exam Professional Data Engineer topic 1 question 185 discussion

Actual exam question from Google's Professional Data Engineer
Question #: 185
Topic #: 1
[All Professional Data Engineer Questions]

You issue a new batch job to Dataflow. The job starts successfully, processes a few elements, and then suddenly fails and shuts down. You navigate to the
Dataflow monitoring interface where you find errors related to a particular DoFn in your pipeline. What is the most likely cause of the errors?

  • A. Job validation
  • B. Exceptions in worker code
  • C. Graph or pipeline construction
  • D. Insufficient permissions
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
AWSandeep
Highly Voted 2 years, 2 months ago
Selected Answer: B
B. Exceptions in worker code While your job is running, you might encounter errors or exceptions in your worker code. These errors generally mean that the DoFns in your pipeline code have generated unhandled exceptions, which result in failed tasks in your Dataflow job. Exceptions in user code (for example, your DoFn instances) are reported in the Dataflow monitoring interface. Reference (Lists all answer choices and when to pick each one): https://cloud.google.com/dataflow/docs/guides/troubleshooting-your-pipeline#Causes
upvoted 13 times
...
zellck
Highly Voted 1 year, 11 months ago
Selected Answer: B
B is the answer. https://cloud.google.com/dataflow/docs/guides/troubleshooting-your-pipeline#detect_an_exception_in_worker_code While your job is running, you might encounter errors or exceptions in your worker code. These errors generally mean that the DoFns in your pipeline code have generated unhandled exceptions, which result in failed tasks in your Dataflow job. Exceptions in user code (for example, your DoFn instances) are reported in the Dataflow monitoring interface.
upvoted 6 times
...
MaxNRG
Most Recent 10 months, 2 weeks ago
Selected Answer: B
The most likely cause of the errors you're experiencing in Dataflow, particularly if they are related to a particular DoFn (Dataflow's parallel processing operation), is B. Exceptions in worker code. When a Dataflow job processes a few elements successfully before failing, it suggests that the overall job setup, permissions, and pipeline graph are likely correct, as the job was able to start and initially process data. However, if it fails during execution and the errors are associated with a specific DoFn, this points towards issues in the code that executes within the workers. This could include: 1. Runtime exceptions in the code logic of the DoFn. 2. Issues handling specific data elements that might not be correctly managed by the DoFn code (e.g., unexpected data formats, null values, etc.). 3. Resource constraints or timeouts if the DoFn performs operations that are resource-intensive or long-running.
upvoted 2 times
MaxNRG
10 months, 2 weeks ago
To resolve these issues, you should: 1. Inspect the stack traces and error messages in the Dataflow monitoring interface for details on the exception. 2. Test the DoFn with a variety of data inputs, especially edge cases, to ensure robust error handling. 3. Review the resource usage and performance characteristics of the DoFn if the issue is related to resource constraints.
upvoted 2 times
...
...
vaga1
1 year, 5 months ago
Selected Answer: B
A. Job validation - since it started successfully, it must have been validated. B. Exceptions in worker code - possible C. Graph or pipeline construction - same as A. D. Insufficient permissions - no elements to say that, and it should led to invalidation.
upvoted 3 times
...
Atnafu
1 year, 11 months ago
C Code error
upvoted 1 times
...
pluiedust
2 years, 1 month ago
Selected Answer: B
B is correct
upvoted 2 times
...
ducc
2 years, 2 months ago
Selected Answer: B
B is correct
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago