exam questions

Exam DP-201 All Questions

View all questions & answers for the DP-201 exam

Exam DP-201 topic 1 question 10 discussion

Actual exam question from Microsoft's DP-201
Question #: 10
Topic #: 1
[All DP-201 Questions]

You are designing a data processing solution that will implement the lambda architecture pattern. The solution will use Spark running on HDInsight for data processing.
You need to recommend a data storage technology for the solution.
Which two technologies should you recommend? Each correct answer presents a complete solution.
NOTE: Each correct selection is worth one point.

  • A. Azure Cosmos DB
  • B. Azure Service Bus
  • C. Azure Storage Queue
  • D. Apache Cassandra
  • E. Kafka HDInsight
Show Suggested Answer Hide Answer
Suggested Answer: AE 🗳️
To implement a lambda architecture on Azure, you can combine the following technologies to accelerate real-time big data analytics:
✑ Azure Cosmos DB, the industry's first globally distributed, multi-model database service.
✑ Apache Spark for Azure HDInsight, a processing framework that runs large-scale data analytics applications
Azure Cosmos DB change feed, which streams new data to the batch layer for HDInsight to process

✑ The Spark to Azure Cosmos DB Connector
E: You can use Apache Spark to stream data into or out of Apache Kafka on HDInsight using DStreams.
Reference:
https://docs.microsoft.com/en-us/azure/cosmos-db/lambda-architecture

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
Abhilvs
Highly Voted 4 years, 10 months ago
for batch processing - cosmos DB , for Stream processing - Kafka HDinsight
upvoted 26 times
...
mclawson1966
Highly Voted 5 years, 1 month ago
Is Kafka considered a data storage solution? I thought it was a streaming technology.
upvoted 11 times
JamesCho
4 years, 11 months ago
https://www.confluent.io/blog/okay-store-data-apache-kafka/ [ it states something like this - "It is much closer in architecture to a distributed filesystem or database then to traditional message queue." ]
upvoted 2 times
...
...
Wendy_DK
Most Recent 4 years ago
Question here is :You need to recommend a data storage technology for the solution. Answer: cosmos DB and Blob blob. Yet Azure Kafka is for stream processing
upvoted 1 times
...
sjain91
4 years ago
for batch: Cosmos DB for stream: Kafka HD insight
upvoted 2 times
...
davita8
4 years ago
A. Azure Cosmos DB D. Apache Cassandra
upvoted 3 times
...
Deepu1987
4 years, 2 months ago
Given solution is right & pls go through this link https://www.bluegranite.com/blog/exploring-the-lambda-architecture-in-azure Kafka hdsight is for ingestion Cosmos DB for processing
upvoted 3 times
...
syu31svc
4 years, 4 months ago
https://www.bluegranite.com/blog/exploring-the-lambda-architecture-in-azure Kafka for ingestion As for processing, Cosmos DB would be it
upvoted 1 times
...
Tombarc
5 years ago
Lambda architecture is usually built with Cassandra as a storage solution and Kafka as a Data stream technology, so Cosmos DB is the correct answer. There is no such thing as Apache Cassandra.
upvoted 8 times
chaoxes
4 years, 4 months ago
What do you mean? There is Apache Cassandra - a distributed, wide column storage on Apache license. However, Cosmos DB & HDI Kafka are the answers for this question.
upvoted 2 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago