exam questions

Exam DP-100 All Questions

View all questions & answers for the DP-100 exam

Exam DP-100 topic 1 question 14 discussion

Actual exam question from Microsoft's DP-100
Question #: 14
Topic #: 1
[All DP-100 Questions]

This question is included in a number of questions that depicts the identical set-up. However, every question has a distinctive result. Establish if the recommendation satisfies the requirements.
You are in the process of creating a machine learning model. Your dataset includes rows with null and missing values.
You plan to make use of the Clean Missing Data module in Azure Machine Learning Studio to detect and fix the null and missing values in the dataset.
Recommendation: You make use of the Replace with median option.
Will the requirements be satisfied?

  • A. Yes
  • B. No
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
synapse
Highly Voted 3 years, 2 months ago
Selected Answer: A
This is an incomplete question. We don't know what type of data it is. Continuous or categorical. If it's continuous then it's A else its B
upvoted 13 times
...
Xsytt419
Highly Voted 3 years, 5 months ago
should the answer be YES?
upvoted 6 times
...
lianaliam
Most Recent 2 days, 16 hours ago
Selected Answer: A
replace with mean for replace missing values
upvoted 1 times
...
Vinit9
8 months, 2 weeks ago
Selected Answer: A
Using the Clean Missing Data module in Azure Machine Learning Studio with the Replace with median option can help detect and fix null and missing values in your dataset. The Replace with median option replaces missing values in a dataset with the median value of the corresponding column. This method of imputing missing values can provide a good balance between preserving the overall distribution of the data and avoiding the introduction of extreme values. By using the Clean Missing Data module with the Replace with median option, you can help ensure that your dataset is cleaned and ready for use in creating a machine learning model, satisfying the requirements.
upvoted 1 times
...
james2033
8 months, 2 weeks ago
Selected Answer: A
https://learn.microsoft.com/en-us/previous-versions/azure/machine-learning/studio-module-reference/clean-missing-data#bkmk_ReplaceMissing - Replace using MICE (Multivariate Imputation using Chained Equations) - Custom substitution value - Replace with mean - Replace with median: Calculates the column median value, and uses the median value as the replacement for any missing value in the column. (*) - Replace with mode - Remove entire row - Replace using Probabilistic PCA 'rows with null and missing values', so no removing entire row. I choose YES (*), it is A.
upvoted 2 times
...
Xsesi
10 months, 2 weeks ago
Selected Answer: B
Since we do not know the type of data. Replace with mode would be prefer if the data is categorical.
upvoted 1 times
...
deyoz
1 year, 3 months ago
Answer is no, median is not appropriate to replace missing values of categorical columns.
upvoted 2 times
...
evangelist
1 year, 3 months ago
Selected Answer: A
A is correct
upvoted 1 times
...
Ratz
1 year, 6 months ago
Answer B: All the replace missing value options apply to the column. The question talks about randomly missing values in the row. Hence removing the row will be ideal.
upvoted 2 times
...
eternaleclipse
1 year, 7 months ago
No. because we don't know the TYPE of data it is. To simply replace with median may not work. What if it was text instead of numerical?
upvoted 2 times
...
rakeshmk
1 year, 8 months ago
Missing value replacement depends on the nature of your data..median is robust to outliers. Also one can go for mean..
upvoted 1 times
...
PradhanManva
1 year, 8 months ago
Selected Answer: A
This is the answer.
upvoted 1 times
...
mefor
1 year, 9 months ago
Selected Answer: B
Yes, using the "Replace with median" option in the Clean Missing Data module in Azure Machine Learning Studio can help satisfy the requirements of dealing with null and missing values in your machine learning dataset. The median is a suitable option for replacing missing values in numerical features because it's less sensitive to outliers compared to the mean. By choosing this option, the module will identify columns with missing values and replace those missing values with the median value of each respective column. This can help maintain the integrity of your dataset and ensure that your machine learning model receives meaningful input data. However, keep in mind that the choice of replacement strategy can also depend on the nature of your data and the specific requirements of your machine learning problem. It's always a good practice to assess the impact of different imputation methods on your model's performance to find the best strategy for your particular case.
upvoted 1 times
...
endeesa
2 years ago
Selected Answer: B
We simply dont have enough information about the dataset to know if Median substitution will work, so the answer is No
upvoted 1 times
...
ManuelHenriques
2 years, 3 months ago
Selected Answer: B
You should not assume that it is correct to use median if you don't know if data is continuous or not so B
upvoted 4 times
...
Obhee
2 years, 4 months ago
Replace with median: Calculates the column median value, and uses the median value as the replacement for any missing value in the column. Applies only to columns that have Integer or Double data types. See the Technical notes section for more information. https://learn.microsoft.com/en-us/previous-versions/azure/machine-learning/studio-module-reference/clean-missing-data
upvoted 1 times
...
KIshor1212
2 years, 6 months ago
Selected Answer: A
eplace with median: Calculates the column median value, and uses the median value as the replacement for any missing value in the column. Applies only to columns that have Integer or Double data types. https://learn.microsoft.com/en-us/azure/machine-learning/component-reference/clean-missing-data
upvoted 1 times
KIshor1212
2 years, 6 months ago
** Replace with median
upvoted 1 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...