Exam DP-100 topic 5 question 14 discussion

Actual exam question from Microsoft's DP-100

Question #: 14
Topic #: 5

You are creating a binary classification by using a two-class logistic regression model.
You need to evaluate the model results for imbalance.
Which evaluation metric should you use?

A. Relative Absolute Error
B. AUC Curve
C. Mean Absolute Error
D. Relative Squared Error
E. Accuracy
F. Root Mean Square Error

Show Suggested Answer

Suggested Answer: B 🗳️

by Askme101 at Dec. 27, 2020, 5:31 a.m.

Comments

Submit Cancel

akgarg00

Highly Voted 4 years, 5 months ago

99% class 1 data and 1% class 2 data. If all prediction is class 1, we will attain 99% accuracy. So accuracy is incorrect answer

upvoted 15 times

pancman

3 years, 3 months ago

Absolutely not. Funny thing is, you proved yourself wrong on why it shouldn't be accuracy in the answer you gave.

upvoted 1 times

...

evangelist

Most Recent 1 year, 1 month ago

Selected Answer: B

For evaluating a binary classification model, especially with imbalanced datasets, the Area Under the Receiver Operating Characteristic (AUC-ROC) Curve is an excellent metric. It's insensitive to class imbalance and provides a good summary of the model's performance across different classification thresholds.

upvoted 1 times

...

evangelist

1 year, 2 months ago

Selected Answer: B

AUC Curve (Area Under the Curve): The AUC-ROC (Receiver Operating Characteristic) curve is a performance measurement for classification problems at various threshold settings. AUC represents the degree or measure of separability, indicating how much the model is capable of distinguishing between classes. An AUC value of 0.5 suggests no discrimination (i.e., random guessing), whereas a value of 1.0 indicates perfect discrimination. The AUC-ROC curve is particularly useful for evaluating models on imbalanced datasets because it is insensitive to changes in the class distribution. It provides a single metric that captures the trade-off between sensitivity (true positive rate) and specificity (true negative rate).

upvoted 1 times

...

phdykd

2 years, 5 months ago

The appropriate evaluation metric to use for assessing imbalance in a binary classification model is the AUC Curve (B). AUC (Area Under the Curve) is a measure of the model's ability to distinguish between positive and negative classes. AUC ranges from 0 to 1, where an AUC of 1 indicates perfect separation between the positive and negative classes, and an AUC of 0.5 indicates random chance. A high AUC value indicates that the model has a strong ability to correctly classify positive and negative instances, which is especially important in imbalanced datasets where one class may have significantly fewer instances than the other. Therefore, the AUC curve is a commonly used metric to evaluate the performance of binary classification models in the presence of class imbalance.

upvoted 1 times

...

ning

3 years, 1 month ago

I guess weighted AUC is the best answer ...

upvoted 4 times

ning

3 years, 1 month ago

Or weighted accuracy

upvoted 1 times

...

[Removed]

3 years, 3 months ago

What does it mean by "evaluate the model results for imbalance"? Does it mean evaluate the extent/degree of imbalance in the dataset? Or does it simply mean to evaluate the model when the underyling data is imbalanced?

upvoted 1 times

...