exam questions

Exam DP-200 All Questions

View all questions & answers for the DP-200 exam

Exam DP-200 topic 2 question 4 discussion

Actual exam question from Microsoft's DP-200
Question #: 4
Topic #: 2
[All DP-200 Questions]

DRAG DROP -
You develop data engineering solutions for a company.
A project requires analysis of real-time Twitter feeds. Posts that contain specific keywords must be stored and processed on Microsoft Azure and then displayed by using Microsoft Power BI. You need to implement the solution.
Which five actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.
Select and Place:

Show Suggested Answer Hide Answer
Suggested Answer:
Step 1: Create an HDInisght cluster with the Spark cluster type
Step 2: Create a Jyputer Notebook

Step 3: Create a table -
The Jupyter Notebook that you created in the previous step includes code to create an hvac table.
Step 4: Run a job that uses the Spark Streaming API to ingest data from Twitter
Step 5: Load the hvac table into Power BI Desktop
You use Power BI to create visualizations, reports, and dashboards from the Spark cluster data.
References:
https://acadgild.com/blog/streaming-twitter-data-using-spark
https://docs.microsoft.com/en-us/azure/hdinsight/spark/apache-spark-use-with-data-lake-store

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
cadio30
Highly Voted 4 years, 1 month ago
The propose solution is correct. A table cannot be created if the notebook is not yet available, the scenario is in assumption the table is within the hdinsight spark cluster.
upvoted 6 times
cadio30
4 years, 1 month ago
Reference: https://docs.microsoft.com/en-us/azure/hdinsight/spark/apache-spark-load-data-run-query
upvoted 2 times
...
...
Aragorn_2021
Most Recent 4 years, 1 month ago
Agree with Pairon. Ideal to have Target table table created first before Jupyter notebook
upvoted 1 times
...
Pairon
4 years, 1 month ago
I agree with the answer, but maybe we can swapp second and third step?
upvoted 1 times
...
tucho
4 years, 2 months ago
With the proposed solution, "who" runs the Jupyter notebook? :-(
upvoted 1 times
...
JohnCrawford
4 years, 2 months ago
answer appears correct. https://docs.microsoft.com/en-us/azure/hdinsight/spark/apache-spark-use-bi-tools
upvoted 3 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...