Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 171 discussion

Exam question from Amazon's AWS Certified Machine Learning - Specialty

Question #: 171
Topic #: 1

[All AWS Certified Machine Learning - Specialty Questions]

A machine learning (ML) specialist wants to create a data preparation job that uses a PySpark script with complex window aggregation operations to create data for training and testing. The ML specialist needs to evaluate the impact of the number of features and the sample count on model performance.
Which approach should the ML specialist use to determine the ideal data transformations for the model?

A. Add an Amazon SageMaker Debugger hook to the script to capture key metrics. Run the script as an AWS Glue job.
B. Add an Amazon SageMaker Experiments tracker to the script to capture key metrics. Run the script as an AWS Glue job.
C. Add an Amazon SageMaker Debugger hook to the script to capture key parameters. Run the script as a SageMaker processing job.
D. Add an Amazon SageMaker Experiments tracker to the script to capture key parameters. Run the script as a SageMaker processing job.

Show Suggested Answer

Suggested Answer: D 🗳️

by bluer1 at April 28, 2022, 10:34 p.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

dolorez

Highly Voted 3 years, 1 month ago

Selected Answer: D

while I agree that Sagemaker Experiments is the way to go, it only supports Training, Processing, and Transform jobs, so the right answer is to run the job as a processing job, hence D not B https://docs.aws.amazon.com/sagemaker/latest/dg/experiments-create.html#:~:text=CreateTrainingJob-,Processing,-Processor.run

upvoted 12 times

Jerry84

2 years, 5 months ago

“Generally, you use load_run with no arguments to track metrics, parameters, and artifacts within a SageMaker training or processing job script.” https://docs.aws.amazon.com/sagemaker/latest/dg/experiments-create.html

upvoted 2 times

...

Jerry84

2 years, 4 months ago

Run PySpark script in SageMaker processing job https://sagemaker.readthedocs.io/en/stable/amazon_sagemaker_processing.html

upvoted 1 times

...

bluer1

Highly Voted 3 years, 1 month ago

B - https://docs.aws.amazon.com/sagemaker/latest/dg/experiments.html

upvoted 7 times

KlaudYu

3 years ago

But It doesn't describe glue job.

upvoted 4 times

...

salim1905

Most Recent 1 year ago

Selected Answer: B

Pyspark -> AWS Glue

upvoted 1 times

ef12052

3 months ago

https://sagemaker.readthedocs.io/en/stable/amazon_sagemaker_processing.html#pysparkprocessor -> D

upvoted 1 times

...

3eb0542

1 year, 2 months ago

Selected Answer: B

AWS Glue is a fully managed extract, transform, and load (ETL) service that is purpose-built for processing large datasets and executing PySpark scripts. It's more aligned with the task of running a PySpark script with complex window aggregation operations to prepare data for training and testing

upvoted 1 times

...

sanjosh

1 year, 7 months ago

D https://sagemaker-experiments.readthedocs.io/en/latest/tracker.html

upvoted 2 times

...

Mickey321

1 year, 10 months ago

Selected Answer: D

A PySpark script can be run as a SageMaker processing job by using the SparkProcessor class. A SageMaker processing job can use Amazon SageMaker Experiments to track the input parameters, output metrics, and artifacts of each run. A SageMaker processing job can also use Amazon SageMaker Debugger to capture tensors and analyze the training behavior, but this is more useful for deep learning models than for data preparation tasks. Running the script as an AWS Glue job would not allow the ML specialist to use Amazon SageMaker Experiments or Amazon SageMaker Debugger, as these features are specific to SageMaker.

upvoted 2 times

...

ADVIT

1 year, 11 months ago

D: SageMaker Experiments automatically tracks the inputs, parameters, configurations, and results of your iterations as runs.

upvoted 1 times

...

dkx

2 years, 1 month ago

Selected Answer: D

The PySpark script defined above is passed via via the submit_app parameter https://github.com/aws/amazon-sagemaker-examples/blob/main/sagemaker_processing/spark_distributed_data_processing/sagemaker-spark-processing.ipynb

upvoted 1 times

...

Mllb

2 years, 2 months ago

Selected Answer: B

Key metrics is the "key". Then D is not a correct answer

upvoted 1 times

ZSun

2 years, 2 months ago

what is the difference between key metrics and key parameteres? why we care about key metrics, because we can compare the key metrics of different parametes and then find impact of the number of features. so the key is "glue" or "SageMaker processing"

upvoted 2 times

...

blanco750

2 years, 3 months ago

Selected Answer: D

D looks the right answer

upvoted 2 times

...

SANDEEP_AWS

2 years, 3 months ago

Selected Answer: B

https://docs.aws.amazon.com/sagemaker/latest/dg/experiments.html ---- Use SageMaker Experiments to view, manage, analyze, and compare both custom experiments that you programmatically create and experiments automatically created from SageMaker jobs.

upvoted 1 times

ZSun

2 years, 1 month ago

"SageMaker jobs" not "Glue job", it is D!

upvoted 2 times

...

jhonivy

2 years, 4 months ago

B: Glue job goes with window aggregation operations

upvoted 1 times

...

aScientist

2 years, 7 months ago

Selected Answer: D

https://sagemaker-examples.readthedocs.io/en/latest/sagemaker_processing/spark_distributed_data_processing/sagemaker-spark-processing.html

upvoted 3 times

...

ovokpus

2 years, 12 months ago

Selected Answer: B

here: https://aws.amazon.com/about-aws/whats-new/2018/10/aws-glue-now-supports-connecting-amazon-sagemaker-notebooks-to-development-endpoints/#:~:text=AWS%20Glue%20now%20supports%20connecting%20Amazon%20SageMaker%20notebooks%20to%20development%20endpoints,-Posted%20On%3A%20Oct&text=You%20can%20now%20create%20an,an%20AWS%20Glue%20development%20endpoint.

upvoted 5 times

...