exam questions

Exam Professional Data Engineer All Questions

View all questions & answers for the Professional Data Engineer exam

Exam Professional Data Engineer topic 1 question 147 discussion

Actual exam question from Google's Professional Data Engineer
Question #: 147
Topic #: 1
[All Professional Data Engineer Questions]

You are implementing several batch jobs that must be executed on a schedule. These jobs have many interdependent steps that must be executed in a specific order. Portions of the jobs involve executing shell scripts, running Hadoop jobs, and running queries in BigQuery. The jobs are expected to run for many minutes up to several hours. If the steps fail, they must be retried a fixed number of times. Which service should you use to manage the execution of these jobs?

  • A. Cloud Scheduler
  • B. Cloud Dataflow
  • C. Cloud Functions
  • D. Cloud Composer
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
mario_ordinola
Highly Voted 3 years, 9 months ago
if someone are not sure that D is the answer, I suggest to don't take the exam
upvoted 42 times
...
madhu1171
Highly Voted 4 years, 9 months ago
D should be the answer
upvoted 23 times
...
patitonav
Most Recent 11 months, 3 weeks ago
Selected Answer: D
No duobt
upvoted 2 times
...
barnac1es
1 year, 2 months ago
Selected Answer: D
Workflow Orchestration: Cloud Composer is a fully managed workflow orchestration service based on Apache Airflow. It allows you to define, schedule, and manage complex workflows with multiple steps, including shell scripts, Hadoop jobs, and BigQuery queries. Dependency Management: You can define dependencies between different steps in your workflow to ensure they are executed in a specific order. Retry Mechanism: Cloud Composer provides built-in retry mechanisms, so if any step fails, it can be retried a fixed number of times according to your configuration. Scheduled Execution: Cloud Composer allows you to schedule the execution of your workflows on a regular basis, meeting the requirement for executing the jobs on a schedule.
upvoted 1 times
...
AzureDP900
1 year, 11 months ago
D is right
upvoted 3 times
...
zellck
2 years ago
Selected Answer: D
D is the answer. https://cloud.google.com/composer/docs/concepts/overview Cloud Composer is a fully managed workflow orchestration service, enabling you to create, schedule, monitor, and manage workflows that span across clouds and on-premises data centers.
upvoted 4 times
...
DataEngineer_WideOps
2 years, 5 months ago
Selected Answer: A
Cloud Composer for sure.
upvoted 1 times
[Removed]
2 years, 2 months ago
Composer is D
upvoted 2 times
...
...
nadavw
2 years, 6 months ago
D. per document "Scheduler" is aimed to a single service and composer for an ETL , in addition it's not even specified all jobs are on cloud so only composer can handle it.
upvoted 1 times
nadavw
2 years, 6 months ago
https://cloud.google.com/blog/topics/developers-practitioners/choosing-right-orchestrator-google-cloud
upvoted 1 times
...
...
medeis_jar
2 years, 11 months ago
Selected Answer: D
Cloud Composer
upvoted 2 times
...
JG123
3 years ago
Why there are so many wrong answers? Examtopics.com are you enjoying paid subscription by giving random answers from people? Ans: D
upvoted 2 times
...
daghayeghi
3 years, 10 months ago
D: the main point is that Cloud Composer should be used when there is inter-dependencies between the job, e.g. we need the output of a job to start another whenever the first finished, and use dependencies coming from first job.
upvoted 4 times
...
ashuchip
3 years, 12 months ago
D seems to be quiet relevant , because using composure you can do all things which are being asked to perform, even retry property is there in composure.
upvoted 3 times
...
Alasmindas
4 years, 1 month ago
The correct answer is Option A : Cloud Scheduler . Although at first instance, I thought it should be Cloud Composer but then looking at the question and reading it few times - it concluded me to go for Option A. Cloud Scheduler has built in retry handling so you can set a fixed number of times and doesn't have time limits for requests. The functionality is much simpler than Cloud Composer. Cloud Composer is managed Apache Airflow that "helps you create, schedule, monitor and manage workflows. For automate scheduled jobs - the most preferred method would be Scheduler, Composer would typically be used when we want to orchestrate many managed services and automate the work flow.
upvoted 5 times
baubaumiaomiao
2 years, 12 months ago
You forgot the "These jobs have many interdependent steps" which can be handled only though Composer
upvoted 1 times
...
kavs
4 years, 1 month ago
A seems to be right
upvoted 1 times
mumukshu
4 years ago
I think D , how scheduler can handle this part " The jobs are expected to run for many minutes up to several hours"
upvoted 3 times
...
...
...
Abby1356
4 years, 1 month ago
should be A
upvoted 1 times
...
arghya13
4 years, 2 months ago
Answer should be A..Cloud scheduler..cloud composer is an workflow manager. Can't run unix,bigquery jobs
upvoted 2 times
...
Tanmoyk
4 years, 3 months ago
D should be the best option
upvoted 3 times
...
haroldbenites
4 years, 3 months ago
D is correct
upvoted 3 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...