exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 171 discussion

A machine learning (ML) specialist wants to create a data preparation job that uses a PySpark script with complex window aggregation operations to create data for training and testing. The ML specialist needs to evaluate the impact of the number of features and the sample count on model performance.
Which approach should the ML specialist use to determine the ideal data transformations for the model?

  • A. Add an Amazon SageMaker Debugger hook to the script to capture key metrics. Run the script as an AWS Glue job.
  • B. Add an Amazon SageMaker Experiments tracker to the script to capture key metrics. Run the script as an AWS Glue job.
  • C. Add an Amazon SageMaker Debugger hook to the script to capture key parameters. Run the script as a SageMaker processing job.
  • D. Add an Amazon SageMaker Experiments tracker to the script to capture key parameters. Run the script as a SageMaker processing job.
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
dolorez
Highly Voted 2 years, 11 months ago
Selected Answer: D
while I agree that Sagemaker Experiments is the way to go, it only supports Training, Processing, and Transform jobs, so the right answer is to run the job as a processing job, hence D not B https://docs.aws.amazon.com/sagemaker/latest/dg/experiments-create.html#:~:text=CreateTrainingJob-,Processing,-Processor.run
upvoted 12 times
Jerry84
2 years, 3 months ago
“Generally, you use load_run with no arguments to track metrics, parameters, and artifacts within a SageMaker training or processing job script.” https://docs.aws.amazon.com/sagemaker/latest/dg/experiments-create.html
upvoted 2 times
...
Jerry84
2 years, 2 months ago
Run PySpark script in SageMaker processing job https://sagemaker.readthedocs.io/en/stable/amazon_sagemaker_processing.html
upvoted 1 times
...
...
bluer1
Highly Voted 3 years ago
B - https://docs.aws.amazon.com/sagemaker/latest/dg/experiments.html
upvoted 7 times
KlaudYu
2 years, 10 months ago
But It doesn't describe glue job.
upvoted 4 times
...
...
salim1905
Most Recent 10 months, 3 weeks ago
Selected Answer: B
Pyspark -> AWS Glue
upvoted 1 times
ef12052
1 month, 1 week ago
https://sagemaker.readthedocs.io/en/stable/amazon_sagemaker_processing.html#pysparkprocessor -> D
upvoted 1 times
...
...
3eb0542
1 year ago
Selected Answer: B
AWS Glue is a fully managed extract, transform, and load (ETL) service that is purpose-built for processing large datasets and executing PySpark scripts. It's more aligned with the task of running a PySpark script with complex window aggregation operations to prepare data for training and testing
upvoted 1 times
...
sanjosh
1 year, 5 months ago
D https://sagemaker-experiments.readthedocs.io/en/latest/tracker.html
upvoted 2 times
...
Mickey321
1 year, 9 months ago
Selected Answer: D
A PySpark script can be run as a SageMaker processing job by using the SparkProcessor class. A SageMaker processing job can use Amazon SageMaker Experiments to track the input parameters, output metrics, and artifacts of each run. A SageMaker processing job can also use Amazon SageMaker Debugger to capture tensors and analyze the training behavior, but this is more useful for deep learning models than for data preparation tasks. Running the script as an AWS Glue job would not allow the ML specialist to use Amazon SageMaker Experiments or Amazon SageMaker Debugger, as these features are specific to SageMaker.
upvoted 2 times
...
ADVIT
1 year, 10 months ago
D: SageMaker Experiments automatically tracks the inputs, parameters, configurations, and results of your iterations as runs.
upvoted 1 times
...
dkx
1 year, 11 months ago
Selected Answer: D
The PySpark script defined above is passed via via the submit_app parameter https://github.com/aws/amazon-sagemaker-examples/blob/main/sagemaker_processing/spark_distributed_data_processing/sagemaker-spark-processing.ipynb
upvoted 1 times
...
Mllb
2 years, 1 month ago
Selected Answer: B
Key metrics is the "key". Then D is not a correct answer
upvoted 1 times
ZSun
2 years ago
what is the difference between key metrics and key parameteres? why we care about key metrics, because we can compare the key metrics of different parametes and then find impact of the number of features. so the key is "glue" or "SageMaker processing"
upvoted 2 times
...
...
blanco750
2 years, 1 month ago
Selected Answer: D
D looks the right answer
upvoted 2 times
...
SANDEEP_AWS
2 years, 1 month ago
Selected Answer: B
https://docs.aws.amazon.com/sagemaker/latest/dg/experiments.html ---- Use SageMaker Experiments to view, manage, analyze, and compare both custom experiments that you programmatically create and experiments automatically created from SageMaker jobs.
upvoted 1 times
ZSun
1 year, 12 months ago
"SageMaker jobs" not "Glue job", it is D!
upvoted 2 times
...
...
jhonivy
2 years, 3 months ago
B: Glue job goes with window aggregation operations
upvoted 1 times
...
aScientist
2 years, 5 months ago
Selected Answer: D
https://sagemaker-examples.readthedocs.io/en/latest/sagemaker_processing/spark_distributed_data_processing/sagemaker-spark-processing.html
upvoted 3 times
...
ovokpus
2 years, 10 months ago
Selected Answer: B
here: https://aws.amazon.com/about-aws/whats-new/2018/10/aws-glue-now-supports-connecting-amazon-sagemaker-notebooks-to-development-endpoints/#:~:text=AWS%20Glue%20now%20supports%20connecting%20Amazon%20SageMaker%20notebooks%20to%20development%20endpoints,-Posted%20On%3A%20Oct&text=You%20can%20now%20create%20an,an%20AWS%20Glue%20development%20endpoint.
upvoted 5 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago