exam questions

Exam Associate Data Practitioner All Questions

View all questions & answers for the Associate Data Practitioner exam

Exam Associate Data Practitioner topic 1 question 31 discussion

Actual exam question from Google's Associate Data Practitioner
Question #: 31
Topic #: 1
[All Associate Data Practitioner Questions]

You are a data analyst at your organization. You have been given a BigQuery dataset that includes customer information. The dataset contains inconsistencies and errors, such as missing values, duplicates, and formatting issues. You need to effectively and quickly clean the data. What should you do?

  • A. Develop a Dataflow pipeline to read the data from BigQuery, perform data quality rules and transformations, and write the cleaned data back to BigQuery.
  • B. Use Cloud Data Fusion to create a data pipeline to read the data from BigQuery, perform data quality transformations, and write the clean data back to BigQuery.
  • C. Export the data from BigQuery to CSV files. Resolve the errors using a spreadsheet editor, and re-import the cleaned data into BigQuery.
  • D. Use BigQuery's built-in functions to perform data quality transformations.
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
n2183712847
1 month, 3 weeks ago
Selected Answer: D
it's already in bigquery, so just preform the transformation in the dataset
upvoted 2 times
...
n2183712847
2 months ago
Selected Answer: D
The best solution for effective and quick data cleaning is D. Use BigQuery's built-in functions. This is the most efficient and quickest approach as it leverages the power of BigQuery SQL for data transformations directly within the BigQuery environment. Option B (Cloud Data Fusion) is a good visual alternative but slower to set up than direct SQL. Option A (Dataflow) is powerful but more complex and time-consuming for initial cleaning. Option C (Spreadsheet Editor) is manual, inefficient, and not scalable for millions of records. Therefore, Option D offers the optimal balance of effectiveness and speed for cleaning data within BigQuery.
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago