exam questions

Exam Associate Data Practitioner All Questions

View all questions & answers for the Associate Data Practitioner exam

Exam Associate Data Practitioner topic 1 question 58 discussion

Actual exam question from Google's Associate Data Practitioner
Question #: 58
Topic #: 1
[All Associate Data Practitioner Questions]

Your company’s ecommerce website collects product reviews from customers. The reviews are loaded as CSV files daily to a Cloud Storage bucket. The reviews are in multiple languages and need to be translated to Spanish. You need to configure a pipeline that is serverless, efficient, and requires minimal maintenance. What should you do?

  • A. Load the data into BigQuery using Dataproc. Use Apache Spark to translate the reviews by invoking the Cloud Translation API. Set BigQuery as the sink.
  • B. Use a Dataflow templates pipeline to translate the reviews using the Cloud Translation API. Set BigQuery as the sink.
  • C. Load the data into BigQuery using a Cloud Run function. Use the BigQuery ML create model statement to train a translation model. Use the model to translate the product reviews within BigQuery.
  • D. Load the data into BigQuery using a Cloud Run function. Create a BigQuery remote function that invokes the Cloud Translation API. Use a scheduled query to translate new reviews.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
n2183712847
1 month, 4 weeks ago
Selected Answer: B
The best option is B. Dataflow template with Cloud Translation API. Option B is best because Dataflow templates are serverless, managed, and efficient for data pipelines like translation. Option A (Dataproc/Spark) is incorrect because Dataproc is not serverless and adds maintenance. Option C (Cloud Run/BigQuery ML) is incorrect because training a BigQuery ML model for translation is overly complex for this. Option D (Cloud Run/Remote Function) is incorrect because it adds unnecessary complexity with remote functions and scheduled queries. Therefore, Option B, Dataflow template, is the most streamlined and best-fit serverless solution.
upvoted 1 times
...
SaquibHerman
2 months, 1 week ago
Selected Answer: B
Dataflow is a fully managed
upvoted 3 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago