Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 All Questions

View all questions & answers for the AWS Certified Machine Learning Engineer - Associate MLA-C01 exam

Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 topic 1 question 14 discussion

Exam question from Amazon's AWS Certified Machine Learning Engineer - Associate MLA-C01

Question #: 14
Topic #: 1

[All AWS Certified Machine Learning Engineer - Associate MLA-C01 Questions]

Case study -
An ML engineer is developing a fraud detection model on AWS. The training dataset includes transaction logs, customer profiles, and tables from an on-premises MySQL database. The transaction logs and customer profiles are stored in Amazon S3.
The dataset has a class imbalance that affects the learning of the model's algorithm. Additionally, many of the features have interdependencies. The algorithm is not capturing all the desired underlying patterns in the data.
The ML engineer needs to use an Amazon SageMaker built-in algorithm to train the model.
Which algorithm should the ML engineer use to meet this requirement?

A. LightGBM
B. Linear learner
C. К-means clustering
D. Neural Topic Model (NTM)

Show Suggested Answer

Suggested Answer: A 🗳️

by aragon_saa at Nov. 28, 2024, 12:51 a.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

Leo2023aws

Highly Voted 7 months ago

Selected Answer: A

https://docs.aws.amazon.com/en_kr/sagemaker/latest/dg/lightgbm.html

upvoted 8 times

...

aragon_saa

Highly Voted 7 months, 1 week ago

Selected Answer: B

Answer is B

upvoted 7 times

...

eesa

Most Recent 3 months ago

Selected Answer: A

1. Clase desbalanceada: LightGBM (Light Gradient Boosted Machine) es muy eficaz para trabajar con datasets desbalanceados, gracias a su capacidad para ajustar los pesos de clases y usar técnicas como el weighted loss o boosting adaptativo. 2. Interdependencia entre variables: LightGBM puede capturar relaciones no lineales e interacciones entre variables gracias a su estructura basada en árboles de decisión. Esto lo hace más adecuado que modelos lineales como el Linear Learner, que solo captura relaciones lineales. 3. No captura de patrones complejos: La descripción indica que el algoritmo actual no está capturando patrones subyacentes complejos, lo que sugiere la necesidad de un modelo más robusto como LightGBM, capaz de modelar relaciones complejas y no lineales en los datos

upvoted 1 times

...

Sadrik

3 months, 1 week ago

Selected Answer: B

Fraud detection is a binary classification problem, and Linear Learner is designed for classification tasks. Although LightGBM can handle binary classification tasks, including fraud detection and it is actually widely used for fraud detection, it is not available as a built-in SageMaker algorithm. Linear Learner is a built-in SageMaker algorithm.

upvoted 1 times

...

joy5135

3 months, 2 weeks ago

Selected Answer: B

because linear learner is a built-in algorithm while lgbm is not

upvoted 1 times

...

chris_spencer

3 months, 3 weeks ago

Selected Answer: B

Linear learner is a built-in algorithm whereas LightGBM can be used via custom container.

upvoted 1 times

...

doull

4 months, 3 weeks ago

Selected Answer: B

Linear learner is a built-in algorithm where LightBM is not

upvoted 1 times

...

shabak

4 months, 4 weeks ago

Selected Answer: A

ChatGPT say it's A: LightGBM

upvoted 1 times

...

Jacobog3

5 months ago

Selected Answer: A

Is supported by Sagemaker

upvoted 1 times

...

abrarjahin

5 months, 1 week ago

Selected Answer: B

Linear Learner is a built-in algorithm provided by SageMaker for supervised learning tasks like regression and classification. LightGBM is not a built-in algorithm in Amazon SageMaker. While it is a strong gradient-boosting algorithm, it would need to be implemented as a custom script in SageMaker, which increases operational overhead.

upvoted 2 times

ninomfr64

5 months ago

LightGBM is built-in https://docs.aws.amazon.com/sagemaker/latest/dg/lightgbm.html

upvoted 1 times

...

xukun

5 months, 2 weeks ago

Selected Answer: A

https://docs.aws.amazon.com/en_kr/sagemaker/latest/dg/lightgbm.html

upvoted 1 times

...

Makendran

5 months, 3 weeks ago

Selected Answer: B

In an ideal scenario, for a problem with these characteristics (fraud detection, class imbalance, feature interdependencies, complex patterns), a tree-based ensemble method like XGBoost (which is a SageMaker built-in algorithm) would be more suitable. XGBoost can handle non-linear relationships, is robust to class imbalance with proper tuning, and can capture complex patterns in the data. However, given the options provided and the requirement to use a SageMaker built-in algorithm, the Linear learner is the best available choice among these options for this specific fraud detection task.

upvoted 1 times

ninomfr64

5 months ago

LightGBM is better for this use case https://docs.aws.amazon.com/sagemaker/latest/dg/lightgbm.html

upvoted 1 times

...

gulf1324

5 months, 4 weeks ago

Selected Answer: B

A. Light BGM : It's suitable model, but not built-in model for SageMaker. Answer B. Linear learner : suitable model, built-in model for SageMaker. C. K-means clustering : groups similar data points, not suitable for classification problems, and it's unsupervised learning algorithm so doesn't fit in this case(fraud detection). D. Neural Topic Model: used for topic modeling and document classification, not suitable for fraud detection

upvoted 2 times

minhhnh

5 months, 3 weeks ago

Light BGM is built-in model for SageMaker https://docs.aws.amazon.com/sagemaker/latest/dg/lightgbm.html

upvoted 1 times

...

khchan123

6 months ago

Selected Answer: A

Here's why LightGBM is the most suitable algorithm for this fraud detection task: Handling Class Imbalance: LightGBM is particularly effective at handling imbalanced datasets, which is a key issue mentioned in the problem statement. It has built-in mechanisms to deal with class imbalance. Feature Interdependencies: LightGBM can capture complex feature interactions through its tree-based structure, addressing the issue of feature interdependencies mentioned in the problem. Capturing Underlying Patterns: As an advanced gradient boosting framework, LightGBM is excellent at capturing complex patterns in data, which the current algorithm is struggling with. Suitable for Fraud Detection: LightGBM is widely used in fraud detection tasks due to its high performance and ability to handle large datasets efficiently. Handling Various Data Types: It can work well with the mix of data types likely present in transaction logs, customer profiles, and database tables.

upvoted 3 times

...

ninomfr64

6 months ago

Selected Answer: A

We have an unbalanced dataset, this means we have labelled dataset thus we are going to use a supervised model training. This reduce options to A and B (K-means and NTM are unsupervised). Both LightGBM and Linear Learner provides hyperparameter to manage unbalanced datasets, respectively "scale-pos_weight" and "positive_example_weight_mult". I would go for LightGBM as this algorithm is more suited to handle complex relationship among features, while Linear Learner learns a linear function, or, for classification problems, a linear threshold function, and maps a vector x to an approximation of the label y.

upvoted 2 times

...

Ell89

6 months ago

Selected Answer: B

Linear Learner. LightGBM is NOT a built in algorithm which the question asks for.

upvoted 1 times

...

michaelcloud

6 months ago

Selected Answer: A

This is a binary classification problem so LightGBM so be used. Other algorithms are not for binary classification.

upvoted 1 times

...

Load full discussion...