Exam DP-200 topic 2 question 10 discussion

Actual exam question from Microsoft's DP-200

Question #: 10
Topic #: 2

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You plan to create an Azure Databricks workspace that has a tiered structure. The workspace will contain the following three workloads:
✑ A workload for data engineers who will use Python and SQL
✑ A workload for jobs that will run notebooks that use Python, Scala, and SQL
✑ A workload that data scientists will use to perform ad hoc analysis in Scala and R
The enterprise architecture team at your company identifies the following standards for Databricks environments:
✑ The data engineers must share a cluster.
✑ The job cluster will be managed by using a request process whereby data scientists and data engineers provide packaged notebooks for deployment to the cluster.
✑ All the data scientists must be assigned their own cluster that terminates automatically after 120 minutes of inactivity. Currently, there are three data scientists.
You need to create the Databricks clusters for the workloads.
Solution: You create a High Concurrency cluster for each data scientist, a High Concurrency cluster for the data engineers, and a Standard cluster for the jobs.
Does this meet the goal?

A. Yes
B. No

Show Suggested Answer

Suggested Answer: B 🗳️
No need for a High Concurrency cluster for each data scientist.
Standard clusters are recommended for a single user. Standard can run workloads developed in any language: Python, R, Scala, and SQL.
A high concurrency cluster is a managed cloud resource. The key benefits of high concurrency clusters are that they provide Apache Spark-native fine-grained sharing for maximum resource utilization and minimum query latencies.
References:
https://docs.azuredatabricks.net/clusters/configure.html

by M0e at Oct. 1, 2020, 10:44 a.m.

Comments

Submit Cancel

ACSC

Highly Voted 4 years, 5 months ago

A workload that data scientists will use to perform ad hoc analysis in Scala and R. High Concurrency clusters don't support Scala. Answer is "No".

upvoted 9 times

...

cadio30

Most Recent 4 years, 1 month ago

Definitely the answer is NO

upvoted 1 times

...

Hassan_Mazhar_Khan

4 years, 1 month ago

A workload that data scientists will use to perform ad hoc analysis in Scala and R. High Concurrency clusters don't support Scala. Answer is "No".

upvoted 2 times

...

watata

4 years, 3 months ago

Answers should be "yes"...

upvoted 1 times

watata

4 years, 3 months ago

sorry, its "no", because for data Scientist should be Standard cluster

upvoted 1 times

...

karishura

4 years, 3 months ago

No - Right answer

upvoted 2 times

...

M0e

4 years, 8 months ago

"There is no need for a High concurrency cluster" does not mean, it does not fulfil the goal. High concurrency clusters can be used for data scientists workload. It is just more expensive. There is no requirement regarding keeping the costs low, mentioned in the question. So the correct answer to this question is also "A - Yes."

upvoted 3 times

...