Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.

Unlimited Access

Get Unlimited Contributor Access to the all ExamTopics Exams!
Take advantage of PDF Files for 1000+ Exams along with community discussions and pass IT Certification Exams Easily.

Exam Professional Data Engineer topic 1 question 295 discussion

Actual exam question from Google's Professional Data Engineer
Question #: 295
Topic #: 1
[All Professional Data Engineer Questions]

You are designing the architecture to process your data from Cloud Storage to BigQuery by using Dataflow. The network team provided you with the Shared VPC network and subnetwork to be used by your pipelines. You need to enable the deployment of the pipeline on the Shared VPC network. What should you do?

  • A. Assign the compute.networkUser role to the Dataflow service agent.
  • B. Assign the compute.networkUser role to the service account that executes the Dataflow pipeline.
  • C. Assign the dataflow.admin role to the Dataflow service agent.
  • D. Assign the dataflow.admin role to the service account that executes the Dataflow pipeline.
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
raaad
Highly Voted 4 months, 1 week ago
Selected Answer: A
- Dataflow service agent is the one responsible for setting up and managing the network resources that Dataflow requires. - By granting the compute.networkUser role to this service agent, we are enabling it to provision the necessary network resources within the Shared VPC for your Dataflow job.
upvoted 6 times
...
chrissamharris
Most Recent 2 weeks, 3 days ago
Selected Answer: B
I believe the answer is B. All authentication documentation points to Service Accounts. https://cloud.google.com/dataflow/docs/concepts/authentication#on-gcp Dataflow service agent typically manages general interactions with the Dataflow service but does not execute the actual jobs.
upvoted 1 times
...
saschak94
3 months, 1 week ago
Selected Answer: A
All projects that have used the resource Dataflow Job have a Dataflow Service Account, also known as the Dataflow service agent. Make sure the Shared VPC subnetwork is shared with the Dataflow service account and has the Compute Network User role assigned on the specified subnet.
upvoted 3 times
...
Matt_108
4 months ago
Selected Answer: A
Option A, I do agree with Raaad, it's the dataflow service agent that needs the networkUser role, because it's the one that provisions the network resources https://cloud.google.com/dataflow/docs/guides/specifying-networks#shared
upvoted 3 times
tibuenoc
3 months, 1 week ago
But your link it's explain that "Network User role must be assigned to the Dataflow service account" Make sure the Shared VPC subnetwork is shared with the Dataflow service account and has the Compute Network User role assigned on the specified subnet. The Compute Network User role must be assigned to the Dataflow service account in the host project.
upvoted 1 times
ML6
3 months ago
All projects that have used the resource Dataflow Job have a Dataflow Service Account, also known as the Dataflow service agent. Source: https://cloud.google.com/dataflow/docs/concepts/security-and-permissions#df-service-account
upvoted 1 times
...
...
...
task_7
4 months, 1 week ago
Selected Answer: B
compute.networkUser to the service account that executes the Dataflow pipeline.
upvoted 3 times
...
BIGQUERY_ALT_ALT
4 months, 1 week ago
Selected Answer: B
Option B is Correct. Explanation: You need to give compute networkuser role to service account that is processing the pipeline as it will need to deploy nessesary worker nodes on the shared vpc project. Option A is incorrect as Dataflow Service Agent is Google MGS service account that will not responsible for running or deoplying workers in shared vpc. Option C and D is incorrect as dataflow.admin is elevated privlages to create and manage all of dataflow components not deploying resources in shared vpc.
upvoted 1 times
...
GCP001
4 months, 1 week ago
B. Assign the compute.networkUser role to the service account that executes the Dataflow pipeline. See the ref - https://cloud.google.com/dataflow/docs/guides/specifying-networks
upvoted 1 times
raaad
4 months, 1 week ago
Option A makes more sense: - Assigning the compute.networkUser role to the pipeline's service account grants it unnecessary and possibly excessive permissions outside its core responsibility of data processing. The question focused specifically on the deployment aspect (i.e., provisioning of network resources like VMs) rather than what the pipeline accesses or processes once it's running.
upvoted 1 times
GCP001
4 months ago
Yes , I agree, it should be A. Dataflow service account should be the one having this permission instaed of worker
upvoted 1 times
...
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...