exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 119 discussion

A data scientist wants to use Amazon Forecast to build a forecasting model for inventory demand for a retail company. The company has provided a dataset of historic inventory demand for its products as a .csv file stored in an Amazon S3 bucket. The table below shows a sample of the dataset.

How should the data scientist transform the data?

  • A. Use ETL jobs in AWS Glue to separate the dataset into a target time series dataset and an item metadata dataset. Upload both datasets as .csv files to Amazon S3.
  • B. Use a Jupyter notebook in Amazon SageMaker to separate the dataset into a related time series dataset and an item metadata dataset. Upload both datasets as tables in Amazon Aurora.
  • C. Use AWS Batch jobs to separate the dataset into a target time series dataset, a related time series dataset, and an item metadata dataset. Upload them directly to Forecast from a local machine.
  • D. Use a Jupyter notebook in Amazon SageMaker to transform the data into the optimized protobuf recordIO format. Upload the dataset in this format to Amazon S3.
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
[Removed]
Highly Voted 3 years, 1 month ago
I would answer A. Target and metadata must be in two files and loaded from S3, based on documentation: https://docs.aws.amazon.com/forecast/latest/dg/dataset-import-guidelines-troubleshooting.html
upvoted 26 times
ZSun
1 year, 6 months ago
1. I cannot find any evidence support the seperate file defination. 2. A,B,C all seperate datasets, this explanation is weak.
upvoted 1 times
...
...
AjoseO
Highly Voted 1 year, 7 months ago
Selected Answer: A
Amazon Forecast requires the input data to be separated into a target time series dataset and an item metadata dataset. The target time series dataset should include the time series data that you want to use for forecasting, such as inventory demand in this case. The item metadata dataset should include the metadata that describes the items in the time series, such as product IDs, categories, and attributes. Therefore, the data scientist should use ETL jobs in AWS Glue to separate the dataset into a target time series dataset and an item metadata dataset. Both datasets should be uploaded as .csv files to Amazon S3, which is a suitable storage option for input data to Amazon Forecast.
upvoted 9 times
ccpmad
1 year, 3 months ago
thank you chatgpt
upvoted 2 times
...
...
AIWave
Most Recent 8 months, 3 weeks ago
I would go with A Input formats for forecast -> Json, CSV and paraquet (Selects A & eliminates B, C, D) Data needs to be split in target time series dataset and an item metadata dataset
upvoted 1 times
...
Mickey321
1 year, 2 months ago
Selected Answer: A
Target and metadata must be in two files
upvoted 1 times
...
kaike_reis
1 year, 3 months ago
Selected Answer: A
Letter A is correct, as it uses a specific transformation service (AWS Glue) and saves it in a cloud database for AWS Forecast to access. By default in ML, our storage option will be AWS S3 (unless caveats or issue specifications). That said, we discard B and C. Letter D is discarded due to the format requested by AWS Forecast being csv.
upvoted 1 times
...
ystotest
1 year, 11 months ago
Selected Answer: A
I would vote for A
upvoted 3 times
...
tgaos
2 years, 5 months ago
The answer is A. According to the https://docs.aws.amazon.com/forecast/latest/dg/forecast.dg.pdf , page 51. Target Time Series Dataset: Required: timestamp, item_id, demand Additional: lead_time Item Metadata Dataset: item_id, category
upvoted 3 times
tgaos
2 years, 5 months ago
You can find the same question with the picture at https://ccnav7.net/a-data-scientist-wants-to-use-amazon-forecast-to-build-a-forecasting-model-for-inventory-demand-for-a-retail-company/
upvoted 1 times
...
...
DSJingguo
3 years ago
The correct answer is A ''Forecast supports only the comma-separated values (CSV) file format. You can't separate values using tabs, spaces, colons, or any other characters. Guideline: Convert your dataset to CSV format (using only commas as your delimiter) and try importing the file again.''
upvoted 1 times
...
achiko
3 years ago
lead time belongs to related time series, as its not a target variable
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago