Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 174 discussion

Exam question from Amazon's AWS Certified Machine Learning - Specialty

Question #: 174
Topic #: 1

[All AWS Certified Machine Learning - Specialty Questions]

An ecommerce company sends a weekly email newsletter to all of its customers. Management has hired a team of writers to create additional targeted content. A data scientist needs to identify five customer segments based on age, income, and location. The customers' current segmentation is unknown. The data scientist previously built an XGBoost model to predict the likelihood of a customer responding to an email based on age, income, and location.
Why does the XGBoost model NOT meet the current requirements, and how can this be fixed?

A. The XGBoost model provides a true/false binary output. Apply principal component analysis (PCA) with five feature dimensions to predict a segment.
B. The XGBoost model provides a true/false binary output. Increase the number of classes the XGBoost model predicts to five classes to predict a segment.
C. The XGBoost model is a supervised machine learning algorithm. Train a k-Nearest-Neighbors (kNN) model with K = 5 on the same dataset to predict a segment.
D. The XGBoost model is a supervised machine learning algorithm. Train a k-means model with K = 5 on the same dataset to predict a segment.

Show Suggested Answer

Suggested Answer: D 🗳️

by spaceexplorer at April 29, 2022, 8 p.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

spaceexplorer

Highly Voted 2 years, 6 months ago

Selected Answer: D

Answer is D! K-means used for customer segmentation

upvoted 16 times

Omijh

2 years, 5 months ago

well, both are used for customer segmentation Knn & kmeans but kmeans is for unsupervised learning and knn is for supervised learning. since we have the data it's better to use supervised learning in this case. Ref: https://rstudio-pubs-static.s3.amazonaws.com/599866_59be74824ca7482ba99dbc8466dc36a0.html#:~:text=The%20difference%20between%20the%20two,to%20predict%20the%20unlabelled%20data.

upvoted 4 times

...

tgaos

Highly Voted 2 years, 5 months ago

The answer is D. 1. "The current segmentation of consumers is unclear." so it is unsupervised learning. 2. Then K-means is for unsupervised learning.

upvoted 11 times

...

AIWave

Most Recent 8 months, 3 weeks ago

Selected Answer: D

Typical clustering problem - use K means

upvoted 1 times

...

Sharath1783

1 year, 2 months ago

Selected Answer: D

KNN is used to solve missing data in regression/supervised problems. Since the question says unknown segmentation, it is an unsupervised problem and K-Means is the right choice. So Option D it is.

upvoted 1 times

...

kaike_reis

1 year, 3 months ago

D is the correct C is wrong because kNN stills a supervised algorithm

upvoted 1 times

...

Mickey321

1 year, 3 months ago

Selected Answer: D

The XGBoost model is a supervised machine learning algorithm, which means it requires labeled data to learn from. However, the customers’ current segmentation is unknown, so there are no labels to train or evaluate the model. The data scientist needs an unsupervised machine learning algorithm, which can discover patterns and clusters in unlabeled data. A k-means model is an example of an unsupervised machine learning algorithm that can partition the data into K groups based on similarity. By setting K = 5, the data scientist can obtain five customer segments based on age, income, and location.

upvoted 1 times

...

Peeking

1 year, 11 months ago

Selected Answer: D

KNN has no k parameter in its input. C is not the answer.

upvoted 1 times

drcok87

1 year, 9 months ago

in K-means also there is no input parameter "K". What i mean to say here is in knn the k is nothing but "kNN classifier identifies the class of a data point using the majority voting principle. If k is set to 5, the classes of 5 nearest points are examined."

upvoted 1 times

...

matteocal

2 years, 3 months ago

D The key work is that the classification is "unclear", therefore k-means

upvoted 3 times

...

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 174 discussion

Comments

spaceexplorer

Omijh

tgaos

AIWave

Sharath1783

kaike_reis

Mickey321

Peeking

drcok87

matteocal

SY0-701