Exam DP-203 All Questions

View all questions & answers for the DP-203 exam

Exam DP-203 topic 4 question 3 discussion

Actual exam question from Microsoft's DP-203

Question #: 3
Topic #: 4

You create an Azure Databricks cluster and specify an additional library to install.
When you attempt to load the library to a notebook, the library in not found.
You need to identify the cause of the issue.
What should you review?

A. notebook logs
B. cluster event logs
C. global init scripts logs
D. workspace logs

Show Suggested Answer

Suggested Answer: B 🗳️

by dragos_dragos62000 at June 30, 2021, 8:22 a.m.

Comments

Submit Cancel

Dizzystar

Highly Voted 3 years, 2 months ago

I should say Cluster Event logs: Azure Databricks provides three kinds of logging of cluster-related activity: Cluster event logs, which capture cluster lifecycle events, like creation, termination, configuration edits, and so on. Apache Spark driver and worker logs, which you can use for debugging. Cluster init-script logs, valuable for debugging init scripts. https://docs.microsoft.com/en-us/azure/databricks/clusters/clusters-manage#event-log

upvoted 34 times

...

dragos_dragos62000

Highly Voted 3 years, 6 months ago

Correct

upvoted 11 times

...

Elanche

Most Recent 8 months, 4 weeks ago

B. cluster event logs. Explanation: Cluster event logs provide information about the cluster's lifecycle events, including the initialization process. When you specify an additional library to install on the Databricks cluster, the installation process is part of the cluster initialization. Reviewing the cluster event logs can help you determine whether the library installation process encountered any errors or issues that prevented the library from being installed successfully. Any errors or warnings during the library installation process would likely be logged in the cluster event logs, providing insights into the cause of the issue.

upvoted 1 times

...

be8a152

10 months, 3 weeks ago

B. Cluster Event Logs

upvoted 1 times

...

dakku987

11 months, 3 weeks ago

Selected Answer: B

its so simple yet you all are making it hard

upvoted 2 times

...

jongert

12 months ago

Selected Answer: B

The correct answer is B Cluster event logs capture two init script events: INIT_SCRIPTS_STARTED and INIT_SCRIPTS_FINISHED, indicating which scripts are scheduled for execution and which have completed successfully. INIT_SCRIPTS_FINISHED also captures execution duration. https://docs.databricks.com/en/init-scripts/logs.html

upvoted 2 times

...

Momoanwar

1 year ago

Selected Answer: B

ChatGpt : if the library was to be installed through: - Standard Databricks library installation methods: Check the cluster event logs (B). - A global init script: Check the global init scripts logs (C). Without additional context or explicit mention of an init script being used, option B is typically the more standard choice for initial troubleshooting.

upvoted 1 times

...

kkk5566

1 year, 4 months ago

Legacy global init scripts and cluster-named init scripts are deprecated and cannot be used in new workspaces starting February 21, 2023. On September 1st, 2023, Azure Databricks will disable legacy global init scripts for all workspaces.

upvoted 1 times

...

kkk5566

1 year, 4 months ago

Selected Answer: C

should be C

upvoted 1 times

...

[Removed]

1 year, 5 months ago

Selected Answer: B

Cluster event logs

upvoted 2 times

...

vctrhugo

1 year, 6 months ago

Selected Answer: B

Cluster event logs in Azure Databricks provide detailed information about the cluster's lifecycle events, including the installation and initialization of libraries. By reviewing the cluster event logs, you can examine the events related to library installation and determine if any errors or issues occurred during the process.

upvoted 3 times

...

auwia

1 year, 6 months ago

Selected Answer: C

Cluster event logs do not log init script events for each cluster node; only one node is selected to represent them all. https://learn.microsoft.com/en-us/azure/databricks/clusters/init-scripts

upvoted 1 times

vctrhugo

1 year, 6 months ago

Installation and initialization of libraries is not part of init scripts.

upvoted 2 times

bch9994

1 year, 4 months ago

That's incorrect. It is a part of init scripts. Some examples of tasks performed by init scripts include: Set system properties and environment variables used by the JVM. Modify Spark configuration parameters. Modify the JVM system classpath in special cases. Install packages and libraries not included in Databricks Runtime. To install Python packages, use the Azure Databricks pip binary located at /databricks/python/bin/pip to ensure that Python packages install into the Azure Databricks Python virtual environment rather than the system Python environment. For example, /databricks/python/bin/pip install <package-name>. https://learn.microsoft.com/en-us/azure/databricks/init-scripts/

upvoted 1 times

...

aemilka

1 year, 8 months ago

Selected Answer: C

Additional libraries are installed in global init scripts, so correct answer is C. Some examples of tasks performed by init scripts include: - Install packages and libraries not included in Databricks Runtime. To install Python packages, use the Azure Databricks pip binary located at /databricks/python/bin/pip to ensure that Python packages install into the Azure Databricks Python virtual environment rather than the system Python environment. For example, /databricks/python/bin/pip install <package-name>. - Modify the JVM system classpath in special cases. - Set system properties and environment variables used by the JVM. - Modify Spark configuration parameters. ref: https://learn.microsoft.com/en-us/azure/databricks/clusters/init-scripts

upvoted 2 times

vctrhugo

1 year, 6 months ago

There are two primary ways to install a library on a cluster: - Install a workspace library that has been already been uploaded to the workspace. - Install a library for use with a specific cluster only.

upvoted 1 times

...

kornat

1 year, 8 months ago

Selected Answer: C

correct

upvoted 2 times

...

esaade

1 year, 9 months ago

Selected Answer: B

the best option in this scenario would be to review the cluster event logs to identify the cause of the issue where an additional library is not found in the Azure Databricks cluster.

upvoted 3 times

...

lafita

1 year, 10 months ago

Answer C. A global init script runs on every cluster created in your workspace. Global init scripts are useful when you want to enforce organization-wide library configurations or security screens. Only admins can create global init scripts. You can create them using either the UI or REST API.

upvoted 2 times

...

youngbug

1 year, 11 months ago

Selected Answer: C

cluster evnet logs only record start and finish event, so C is right, init script logs record the details of running.

upvoted 2 times

...

Load full discussion...