exam questions

Exam AWS Certified Solutions Architect - Associate SAA-C02 All Questions

View all questions & answers for the AWS Certified Solutions Architect - Associate SAA-C02 exam

Exam AWS Certified Solutions Architect - Associate SAA-C02 topic 1 question 244 discussion

A company is using a fleet of Amazon EC2 instances to ingest data from on-premises data sources. The data is in JSON format and ingestion rates can be as high as 1 MB/s. When an EC2 instance is rebooted, the data in-flight is lost. The company's data science team wants to query ingested data in near-real time.
Which solution provides near-real-time data querying that is scalable with minimal data loss?

  • A. Publish data to Amazon Kinesis Data Streams. Use Kinesis Data Analytics to query the data.
  • B. Publish data to Amazon Kinesis Data Firehose with Amazon Redshift as the destination. Use Amazon Redshift to query the data.
  • C. Store ingested data in an EC2 instance store. Publish data to Amazon Kinesis Data Firehose with Amazon S3 as the destination. Use Amazon Athena to query the data.
  • D. Store ingested data in an Amazon Elastic Block Store (Amazon EBS) volume. Publish data to Amazon ElastiCache for Redis. Subscribe to the Redis channel to query the data.
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
aguy9
Highly Voted 3 years, 7 months ago
Answer is B. Kinesis data streams consists of shards. The more througput is needed, the more shards you add, the less throughput, the more shards you remove, so it’s scalable. Each shard can handle up to 1MB/s of writes. However Kinesis data streams stores ingested data for only 1 to 7 days so there is a chance of data loss. Additionally, Kinesis data analytics and kinesis data streams are both for real-time ingestion and analytics. Firehouse on the other hand is also scalable and processes data in near real time as per the requirement. It also transfers data into Redshift which is a data warehouse so data won’t be lost. Redshift also has a SQL interface for performing queries for data analytics. This information was sourced from ultimate AWS certified solutions architect 2020 course with Stephane Maarek.
upvoted 72 times
hseeker1
3 years, 6 months ago
Data stream not store data, instance store lost data when EC2 instance rebooted. So B is right
upvoted 3 times
...
guruaws2021
3 years, 6 months ago
A Kinesis data stream is an ordered sequence of data records meant to be written to and read from in real time. Data records are therefore stored in shards in your stream temporarily. The time period from when a record is added to when it is no longer accessible is called the retention period. A Kinesis data stream stores records from 24 hours by default, up to 8760 hours (365 days).
upvoted 4 times
...
guruaws2021
3 years, 6 months ago
Amazon Kinesis stores your data for up to 24 hours by default. You can raise data retention period to up to 7 days by enabling extended data retention or up to 365 by enabling long-term data retention using the console, the CLI or the API call. https://aws.amazon.com/kinesis/data-streams/faqs/#:~:text=Amazon%20Kinesis%20stores%20your%20data,CLI%20or%20the%20API%20call.
upvoted 2 times
Criticalhitx
3 years, 5 months ago
now its 365 days
upvoted 1 times
...
...
Goozian
3 years, 6 months ago
it doesn't mention data needs to be maintain for long term. "query ingested data in near-real time" => Kinesis Data Stream + Analytics are enough Answer is A
upvoted 5 times
...
...
aws_guru1
Highly Voted 3 years, 6 months ago
I am completely confused with the comments here. More than 80 comments saying ALL options are correct. This is sad! I wish people will comment only if they are sure of the answer.
upvoted 36 times
andwill1001
3 years, 6 months ago
It's a bad question with multiple possible right answers. That's why. It's not their fault.
upvoted 11 times
...
...
orbpig
Most Recent 2 years, 6 months ago
Selected Answer: A
B.Amazon Kinesis Data Firehose could lose data C.EC2 instance store could lose data D:Elastic Block Store and Redis could lose data
upvoted 1 times
...
Sinaneos
2 years, 7 months ago
Selected Answer: B
The answer is B, it's more resilient than A and compatible with the near-real-time requirements.
upvoted 1 times
...
Curious76
2 years, 8 months ago
Selected Answer: A
Query ingested data in near-real time= kinnesis data analytics.
upvoted 1 times
...
Moathov
2 years, 9 months ago
Selected Answer: A
AAAAAAAAA
upvoted 1 times
...
Janan
2 years, 9 months ago
Selected Answer: A
Kinesis Data Streams for real time data ingestion and KDA for "real-time" analytics. REal time analytics is the key. Load in Redshift is not real time..
upvoted 1 times
...
Nikpati
2 years, 11 months ago
Selected Answer: A
For near real time streaming and processing answer A is correct.
upvoted 2 times
...
Venki_dev
3 years, 1 month ago
Selected Answer: A
Answer is A , Amazon Kinesis Data Streams to collect and store streaming data Amazon Kinesis Data Analytics to process and analyze the streaming data continuously https://aws.amazon.com/blogs/big-data/perform-near-real-time-analytics-on-streaming-data-with-amazon-kinesis-and-amazon-elasticsearch-service/ Redshift is nowhere required.
upvoted 4 times
...
awsnoobster
3 years, 1 month ago
Kinesis Strem = Real time. so A for me.
upvoted 1 times
...
sayed
3 years, 3 months ago
Selected Answer: A
Near Real time -> firehouse JSON -> so redshift is not suitable I will go with A as KDS ingest data in real-time but KDA is near real time plus JSON data lead me to not use redshift I hope this one is not in my exam
upvoted 5 times
...
weilun_tann
3 years, 4 months ago
Selected Answer: A
- Wrong. Answer is A. - "near-real-time data querying" --> Kinesis Data Analytics - "minimal data loss" --> Kinesis Data Streams, which has configurable persistence (1 - 7 days) - https://docs.aws.amazon.com/streams/latest/dev/kinesis-extended-retention.html
upvoted 1 times
...
Subhankar89
3 years, 4 months ago
My bad ... after analysing the question. Answer is B
upvoted 1 times
...
Subhankar89
3 years, 4 months ago
1. an Amazon Kinesis Data Analytics for Apache Flink application transforms and aggregates streaming data in near-real time 2. Amazon Kinesis Data Firehose extends data from Kinesis Data Streams to your data lake stored in Amazon Simple Storage Service (Amazon S3) Answer is A. https://aws.amazon.com/blogs/database/near-real-time-processing-with-amazon-kinesis-amazon-timestream-and-grafana/
upvoted 1 times
...
Osemk
3 years, 5 months ago
Ans is A
upvoted 1 times
...
SFXY
3 years, 5 months ago
Selected Answer: B
Kinesis Data Firehose Load streaming data into S3 / Redshift /ES / 3rd party / custom HTTP Near real-time (buffer time min. 60 sec) Automatic scaling
upvoted 2 times
...
AMEJack
3 years, 5 months ago
This is a data analytics question, the answer should be A, you need to ingest the data in real time so you should use Kinesis Data Stream, the do the analytics with near real time analysis tool which is Kinesis Data Analytics. Kinesis Data Firehose can't ingest 1MB/sec.
upvoted 2 times
ChrisG1454
2 years, 9 months ago
By default, each delivery stream can intake up to 2,000 transactions/second, 5,000 records/second, and 5 MB/second. You can have this limit increased easily by submitting a service limit increase form. Reference https://aws.amazon.com/kinesis/data-firehose/faqs/
upvoted 1 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago