exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 58 discussion

A Machine Learning Specialist needs to be able to ingest streaming data and store it in Apache Parquet files for exploration and analysis.
Which of the following services would both ingest and store this data in the correct format?

  • A. AWS DMS
  • B. Amazon Kinesis Data Streams
  • C. Amazon Kinesis Data Firehose
  • D. Amazon Kinesis Data Analytics
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
JayK
Highly Voted 3 years, 7 months ago
the answer is C. as the main point of the question is data transformation to Parquet format which is done by Kinesis Data Firehose not Data Stream. Coming to the data store the data store in Kinesis Data Stream is only for couple of days so it does not serve the purpose here
upvoted 52 times
shammous
8 months, 3 weeks ago
The storage part will be taken care of by S3 anyway. Firehose would just transform to Parquet on the fly.
upvoted 1 times
...
...
eganilovic
Highly Voted 3 years, 5 months ago
Firehose
upvoted 5 times
...
earthMover
Most Recent 1 year, 11 months ago
Not sure Firehose can store the data .... Data Stream can store the data. Someone please explain the answer
upvoted 1 times
kaike_reis
1 year, 9 months ago
Firehose is to Store the data. Stream requires other service to do that.
upvoted 1 times
...
...
GOSD
2 years ago
Kinesis Data Streams can Store for up to 365 days, While Firehouse sends it to S3. Which is correct?
upvoted 1 times
...
Valcilio
2 years, 1 month ago
Selected Answer: C
Firehose can do it if the data is in JSON or ORC format initially!
upvoted 2 times
...
DS2021
2 years, 2 months ago
It should be KDS
upvoted 1 times
...
AjoseO
2 years, 2 months ago
Selected Answer: C
Amazon Kinesis Data Firehose is a fully managed service that can automatically load streaming data into data stores and analytics tools. It can ingest real-time streaming data such as application logs, website clickstreams, and IoT telemetry data, and then store it in the correct format, such as Apache Parquet files, for exploration and analysis. This makes it a suitable option for the requirement described in the question.
upvoted 1 times
...
Thai_Xuan
3 years, 6 months ago
B https://github.com/ravsau/aws-exam-prep/issues/10
upvoted 2 times
...
weslleylc
3 years, 6 months ago
B) Only Amazon Kinesis Data Streams can store and Ingest data. We don't need to apply any transformation; the question asks to ingest and store data in Apache Parquet format, There is no assumption that the data coming in a different format than parquet.
upvoted 3 times
joe3232
2 years, 3 months ago
KDS cant store to s3 https://stackoverflow.com/questions/66097886/writing-to-s3-via-kinesis-stream-or-firehose
upvoted 1 times
...
...
In
3 years, 6 months ago
It is C with no doubt https://aws.amazon.com/about-aws/whats-new/2018/05/stream_real_time_data_in_apache_parquet_or_orc_format_using_firehose/
upvoted 5 times
...
GeeBeeEl
3 years, 6 months ago
It appears all agree that the answer is between Firehose and Analytics. Kinesis Firehose is used for ingestion. Both firehose and analytics can store, only firehose can ingest. https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html shows firehose can store parquet to S3
upvoted 3 times
GeeBeeEl
3 years, 6 months ago
It appears all agree that the answer is between Firehose and Analytics. Data Streams handle stuff like event data, clickstream etc. Its not interested in special format, the focus is speed. The question did not talk of transformation, only ingestion. Kinesis Firehose is used for ingestion. Both firehose and analytics can store, only firehose can ingest. https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html shows firehose can store parquet to S3
upvoted 1 times
...
...
Urban_Life
3 years, 6 months ago
Think just like this -- batch process Glue ETL and Streaming process Firehose ETL ......covert to parquet or any other format.
upvoted 1 times
...
CMMC
3 years, 7 months ago
C for Firehose
upvoted 2 times
...
Erso
3 years, 7 months ago
Just in case https://acloud.guru/forums/aws-certified-big-data-specialty/discussion/-KhI3MgPEo-FY5rfgl3J/what_is_difference_between_kin
upvoted 2 times
...
BigEv
3 years, 7 months ago
Amazon Kinesis Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC before storing the data in Amazon S3. https://github.com/awsdocs/amazon-kinesis-data-firehose-developer-guide/blob/master/doc_source/record-format-conversion.md
upvoted 3 times
...
rsimham
3 years, 7 months ago
I would go with B. Kinesis data streams stores data, while Firehose not.
upvoted 3 times
cloud_trail
3 years, 6 months ago
It's the other way around. Firehouses stores data; data streams does not.
upvoted 1 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago