exam questions

Exam DP-201 All Questions

View all questions & answers for the DP-201 exam

Exam DP-201 topic 2 question 35 discussion

Actual exam question from Microsoft's DP-201
Question #: 35
Topic #: 2
[All DP-201 Questions]

You have a large amount of sensor data stored in an Azure Data Lake Storage Gen2 account. The files are in the Parquet file format.
New sensor data will be published to Azure Event Hubs.
You need to recommend a solution to add the new sensor data to the existing sensor data in real-time. The solution must support the interactive querying of the entire dataset.
Which type of server should you include in the recommendation?

  • A. Azure SQL Database
  • B. Azure Cosmos DB
  • C. Azure Stream Analytics
  • D. Azure Databricks
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️
Azure Stream Analytics is a fully managed PaaS offering that enables real-time analytics and complex event processing on fast moving data streams.
By outputting data in parquet format into a blob store or a data lake, you can take advantage of Azure Stream Analytics to power large scale streaming extract, transfer, and load (ETL), to run batch processing, to train machine learning algorithms, or to run interactive queries on your historical data.
Reference:
https://azure.microsoft.com/en-us/blog/new-capabilities-in-stream-analytics-reduce-development-time-for-big-data-apps/

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
sdas1
Highly Voted 4 years, 2 months ago
As per below link the answer is correct. https://azure.microsoft.com/en-in/blog/new-capabilities-in-stream-analytics-reduce-development-time-for-big-data-apps/
upvoted 5 times
...
cadio30
Highly Voted 4 years ago
Both Azure Databricks and Azure Stream analytics can output data to parquet format and have interactive queries as well. For simplicity, I'll choose Azure Stream Analytics
upvoted 5 times
cadio30
3 years, 12 months ago
Opt to choose Azure Databricks instead of Azure Streaming Analytics due to the keywork 'large dataset' Reference: https://techcommunity.microsoft.com/t5/analytics-on-azure/azure-stream-analytics-real-time-analytics-for-big-data-made/ba-p/549621
upvoted 2 times
...
BobFar
4 years ago
ASA doesn't support Parquet format.!
upvoted 2 times
BobFar
4 years ago
I was wrong, it supports now https://azure.microsoft.com/en-us/updates/stream-analytics-offers-native-support-for-parquet-format/#:~:text=Azure%20Stream%20Analytics%20now%20offers,in%20the%20Big%20Data%20ecosystems.
upvoted 1 times
...
...
mbravo
4 years ago
One of the requirements is to be able to interactively query the whole (possibly very large) dataset according to the scenario. This requirement alone is a perfect fit for Spark. I highly doubt there is a sensible way to achieve this with ASA. Therefore I vote for Databricks.
upvoted 1 times
...
...
daradev
Most Recent 3 years, 10 months ago
By outputting data in parquet format into a blob store or a data lake, you can take advantage of Azure Stream Analytics to power large scale streaming extract, transfer, and load (ETL), to run batch processing, to train machine learning algorithms, or to run interactive queries on your historical data. Soure: https://azure.microsoft.com/en-in/blog/new-capabilities-in-stream-analytics-reduce-development-time-for-big-data-apps/
upvoted 1 times
hello_there_
3 years, 10 months ago
What this quote says is that ASA can output parquet format to blob storage, so that another tool can then run interactive queries on the data. ASA itself can't do interactive queries on parquet in blob storage, which is what is required here. I'd go with databricks.
upvoted 1 times
...
...
VG2007
4 years, 1 month ago
Native support for egress in Apache parquet format into Azure Blob Storage is now generally available. Parquet is a columnar format enabling efficient big data processing. By outputting data in parquet format into a blob store or a data lake, you can take advantage of Azure Stream Analytics to power large scale streaming extract, transfer, and load (ETL), to run batch processing, to train machine learning algorithms, or to run interactive queries on your historical data. We are now announcing general availability of this feature for egress to Azure Blob Storage.
upvoted 3 times
...
jms309
4 years, 2 months ago
I think that Databrick is a good answer. I'm not sure if Azure Stream Analytics is another right answer but maybe there are two possibilities
upvoted 2 times
anamaster
4 years, 1 month ago
interactive querying eliminates ASA
upvoted 3 times
...
niwe
4 years, 1 month ago
Azure Stream Analytics does not support Parquet data format. https://docs.microsoft.com/en-us/azure/stream-analytics/stream-analytics-stream-analytics-query-patterns
upvoted 2 times
saifone
4 years, 1 month ago
It does as of July 2019 https://azure.microsoft.com/en-us/updates/stream-analytics-offers-native-support-for-parquet-format/
upvoted 2 times
...
...
...
sdas1
4 years, 2 months ago
As per below link the answer is correct. new-capabilities-in-stream-analytics-reduce-development-time-for-big-data-app
upvoted 1 times
...
YOMYOM
4 years, 2 months ago
is C really the correct answer pls?
upvoted 2 times
H_S
4 years, 2 months ago
i think it's D because the interactive querying of the entire dataset. entire dataset/interative isn't possible with A.stream
upvoted 7 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...