exam questions

Exam DP-203 All Questions

View all questions & answers for the DP-203 exam

Exam DP-203 topic 2 question 13 discussion

Actual exam question from Microsoft's DP-203
Question #: 13
Topic #: 2
[All DP-203 Questions]

You are designing a statistical analysis solution that will use custom proprietary Python functions on near real-time data from Azure Event Hubs.
You need to recommend which Azure service to use to perform the statistical analysis. The solution must minimize latency.
What should you recommend?

  • A. Azure Synapse Analytics
  • B. Azure Databricks
  • C. Azure Stream Analytics
  • D. Azure SQL Database
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
kolakone
Highly Voted 3 years, 8 months ago
My answer will be B Stream Analytics supports "extending SQL language with JavaScript and C# user-defined functions (UDFs)". There is no mention of Python support; hence Stream Analytics is not correct. https://docs.microsoft.com/en-us/azure/stream-analytics/stream-analytics-introduction Azure Databricks supports near real-time data from Azure Event Hubs. And includes support for R, SQL, Python, Scala, and Java. So I will go for option B.
upvoted 87 times
anto69
3 years, 4 months ago
But Python runs on Event Hubs why the other service does should support Python too?
upvoted 2 times
Aditya0891
2 years, 11 months ago
It's mentioned that "python runs on real time data from event hubs not on event hubs". Also event hub is to gather that data and after that it is analyzed by either databricks stream analytics. And since stream analytics doesn't support python so the answer is databricks
upvoted 2 times
RoyP654
1 year, 12 months ago
therefore i agree wih ASA
upvoted 2 times
RoyP654
1 year, 12 months ago
python can run Event Hubs libraries real time, it doesn't have to be supported by ASA, it just needs to send data to analytics service
upvoted 1 times
ExamDestroyer69
1 year, 5 months ago
@RoyP654, the question asks which service to perform the statistical analysis (e.g. execute the python) suggesting that the python has not/will not be ran in events hubs
upvoted 1 times
...
...
...
...
...
...
anto69
Highly Voted 3 years, 4 months ago
I'm sure it's Stream Analytics cause Event Hubs already supports Python (https://docs.microsoft.com/en-us/azure/event-hubs/event-hubs-python-get-started-send). We don't need the other service to support it. We just need to lower costs. Hence ASA is the correct solution
upvoted 16 times
RoyP654
1 year, 12 months ago
the question does not ask which service can run Python, it's asking where to send the data for analytics since Python can run with Event Hubs libraries
upvoted 1 times
...
...
fbc11c2
Most Recent 9 months ago
Selected Answer: B
Should be B. See the link below. Under the heading "When to use other technologies", it mentions "Azure Stream Analytics supports user-defined functions (UDF) or user-defined aggregates (UDA) in JavaScript for cloud jobs and C# for IoT Edge jobs. C# user-defined deserializers are also supported. If you want to implement a deserializer, a UDF, or a UDA in other languages, such as Java or Python, you can use Spark Structured Streaming. You can also run the Event Hubs EventProcessorHost on your own virtual machines to do arbitrary streaming processing." As the question mentions user-defined-function (UDF) in Python, ASA seems not support UDF in Python. Should use Spark Structured Streaming, which in this case here is Azure Databricks. https://learn.microsoft.com/en-us/azure/stream-analytics/streaming-technologies
upvoted 1 times
...
evangelist
10 months, 4 weeks ago
C is wrong, stream analytics is a SQL based analytics
upvoted 1 times
...
e56bb91
11 months ago
Selected Answer: B
ChatGPT 4o Azure Databricks is well-suited for real-time data processing and analytics. It provides a collaborative environment for working with Apache Spark, which is ideal for performing complex statistical analyses and machine learning tasks in real-time.
upvoted 1 times
...
Selected Answer: B
It says we are using python after it is sent to event hubs: "custom proprietary Python functions on near real-time data FROM Azure Event Hubs". Yes, we can send events to Event Hub with python but it says that we are running statistical analysis AFTER we send it to Event Hub. Therefore, my answer is Databricks
upvoted 1 times
...
eb36a01
1 year ago
Azure Stream Analytics: Azure Stream Analytics is designed for real-time data processing and can directly ingest data from Azure Event Hubs. However, it has limited support for custom Python functions. It is more suitable for simple real-time analytics and transformations rather than complex statistical analysis with custom code. correct answer: Azure Databricks, we have custom python function
upvoted 2 times
...
Dusica
1 year, 1 month ago
B is correct
upvoted 1 times
...
poesklap
1 year, 2 months ago
Selected Answer: B
Azure Databricks provides a fast and scalable Apache Spark-based analytics platform that supports Python, among other programming languages. It allows you to perform near real-time data processing and analysis efficiently, making it ideal for scenarios where low latency is a priority. Additionally, it offers seamless integration with Azure Event Hubs, enabling you to ingest data in real-time and apply custom Python functions for statistical analysis.
upvoted 1 times
...
Elanche
1 year, 3 months ago
B. Azure Databricks Azure Databricks provides a fully managed Apache Spark-based analytics platform that is well-suited for processing and analyzing real-time streaming data. It offers native integration with Azure Event Hubs, allowing you to ingest data in real-time and apply custom Python functions for statistical analysis with minimal latency. Additionally, Databricks provides scalable compute resources, optimized processing capabilities, and support for various programming languages, making it an ideal choice for near real-time data analysis scenarios.
upvoted 1 times
...
moneytime
1 year, 3 months ago
C is correct. At near realtime ,the window functions in azure stream analytics can be employed in compute some statical values (e.g count,maximum,min,avg. etc) of the data streaming from the even hub.
upvoted 1 times
...
Azure_2023
1 year, 4 months ago
Selected Answer: B
Corrected!!! FROM Azure Event Hubs, not ON Azure Databricks
upvoted 1 times
...
Azure_2023
1 year, 4 months ago
Selected Answer: D
FROM Azure Event Hubs, not ON
upvoted 1 times
...
prshntdxt7
1 year, 4 months ago
Selected Answer: C
chatGPT explains - Azure Stream Analytics is designed for real-time data stream processing and analytics. It can ingest data from various sources, including Azure Event Hubs, and allows you to run near real-time analytics using a SQL-like language. With Stream Analytics, you can easily apply custom Python functions using user-defined functions (UDFs) and achieve low-latency processing. Azure Synapse Analytics and Azure Databricks are powerful analytics services, but they are more suitable for complex analytics and big data processing rather than near real-time, low-latency scenarios. Azure SQL Database is a relational database service and is not specifically designed for real-time stream processing. Therefore, in this case, Azure Stream Analytics is the recommended choice for minimizing latency in statistical analysis on near real-time data from Azure Event Hubs.
upvoted 3 times
...
sdg2844
1 year, 4 months ago
Selected Answer: C
Simply, they always want the Stream Analytics answer. It's the most straightforward.
upvoted 3 times
...
maxCarter
1 year, 5 months ago
Azure Databricks
upvoted 1 times
...
HSZ
1 year, 9 months ago
Selected Answer: C
From ChatGPT, To minimize latency for statistical analysis on near real-time data from Azure Event Hubs, I recommend using Azure Stream Analytics (Option C). Azure Stream Analytics is designed for real-time data processing and can ingest and analyze data from Event Hubs with low latency, making it a suitable choice for this scenario.
upvoted 3 times
mav2000
1 year, 5 months ago
Also from ChatGPT (GPT4) lol: For processing near real-time data with custom proprietary Python functions and minimizing latency, the best service would be: B. Azure Databricks Here’s why: Azure Databricks is an Apache Spark-based analytics service that integrates smoothly with Azure services such as Azure Event Hubs. It supports real-time streaming data processing and can execute custom Python code, which is necessary for your custom statistical analysis functions. Databricks is designed to handle large-scale data processing and analytics with low latency, making it suitable for near real-time scenarios. The other services have their uses but may not be the optimal choice for this particular scenario
upvoted 1 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...