exam questions

Exam AWS Certified Solutions Architect - Professional All Questions

View all questions & answers for the AWS Certified Solutions Architect - Professional exam

Exam AWS Certified Solutions Architect - Professional topic 1 question 199 discussion

With Amazon Elastic MapReduce (Amazon EMR) you can analyze and process vast amounts of data. The cluster is managed using an open-source framework called Hadoop. You have set up an application to run Hadoop jobs. The application reads data from DynamoDB and generates a temporary file of 100 TBs.
The whole process runs for 30 minutes and the output of the job is stored to S3.
Which of the below mentioned options is the most cost effective solution in this case?

  • A. Use Spot Instances to run Hadoop jobs and configure them with EBS volumes for persistent data storage.
  • B. Use Spot Instances to run Hadoop jobs and configure them with ethereal storage for output file storage.
  • C. Use an on demand instance to run Hadoop jobs and configure them with EBS volumes for persistent storage.
  • D. Use an on demand instance to run Hadoop jobs and configure them with ephemeral storage for output file storage.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️
AWS EC2 Spot Instances allow the user to quote his own price for the EC2 computing capacity. The user can simply bid on the spare Amazon EC2 instances and run them whenever his bid exceeds the current Spot Price. The Spot Instance pricing model complements the On-Demand and Reserved Instance pricing models, providing potentially the most cost-effective option for obtaining compute capacity, depending on the application. The only challenge with a Spot Instance is data persistence as the instance can be terminated whenever the spot price exceeds the bid price. In the current scenario a Hadoop job is a temporary job and does not run for a longer period. It fetches data from a persistent DynamoDB. Thus, even if the instance gets terminated there will be no data loss and the job can be re- run. As the output files are large temporary files, it will be useful to store data on ethereal storage for cost savings.
Reference:
http://aws.amazon.com/ec2/purchasing-options/spot-instances/

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
amog
Highly Voted 3 years, 8 months ago
Answer is B "the output of the job is stored on S3". Ephemeral storage is using to store temporary data in this case
upvoted 8 times
robertomartinez
3 years, 7 months ago
B says ethereal storage, so it's A
upvoted 1 times
bobsmith2000
3 years, 1 month ago
It's a typo. Don't read into it. There are a while bunch of questions here with typos
upvoted 1 times
...
...
...
amministrazione
Most Recent 10 months, 2 weeks ago
B. Use Spot Instances to run Hadoop jobs and configure them with Ephemeral storage for output file storage.
upvoted 1 times
...
KungLjao
1 year, 10 months ago
Ephemeral storage does not support more than a few Tb.. https://allcloud.io/blog/ec2-ephemeral-storage-size-per-instance-type/ Answer is A
upvoted 1 times
...
cldy
3 years, 6 months ago
B. Use Spot Instances to run Hadoop jobs and configure them with ethereal storage for output file storage.
upvoted 1 times
...
01037
3 years, 8 months ago
Sure B
upvoted 1 times
...
Madwyn
3 years, 8 months ago
Use spot for cost. Not A because EBS for persistent data storage is not cost efficient, you may use S3. B not because EBS is limited to 16TB, Hadoop forms a cluster, it doesn't require each node to have 100TB storage, that's silly.
upvoted 1 times
...
newme
3 years, 8 months ago
Where to find price of instance store? And how to add instance store to an instance? I followed instructions from https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/add-instance-store-volumes.html#adding-instance-storage-instance but there is no instance store in Volume Type.
upvoted 1 times
...
MarkDillon1075
3 years, 8 months ago
'Ethereal' is just a typo. The same question pops up on other sites and its 'Ephemeral' So the answer is B
upvoted 1 times
robertomartinez
3 years, 7 months ago
How can you affirm it's a typo, to me answer is A because "ethereal storage" does not exists. If it's really a Typo (that's a lot of misplaced chars) then answer is B. If I had this on the day of the exam I'd chose A assuming B is misleading because of the wrong use of term ethereal storage with means nothings
upvoted 2 times
...
...
Paramg1234
3 years, 8 months ago
This is tricky .... B is the right answer because EBS will not support more than 16 TBs, the remaining choice is B (Although no such thing like ethereal storage)
upvoted 1 times
...
JAWS1600
3 years, 8 months ago
A is the right answer
upvoted 1 times
...
JAWS1600
3 years, 8 months ago
Epheremal may not work in this case. Because of 100TB ( FILE size). Instance stores are not available to accomodate this size. https://docs.aws.amazon.com/AWSEC2/latest/WindowsGuide/volume_limits.html
upvoted 3 times
...
NKnab
3 years, 8 months ago
b is what aws wants you to do - https://aws.amazon.com/blogs/aws/run-amazon-elastic-mapreduce-on-ec2-spot-instances/
upvoted 2 times
...
NNHAN
3 years, 8 months ago
Answer is C B spot instance might be interrupted any time.
upvoted 1 times
sam422
3 years, 8 months ago
Requirement is cost effectivem , Amog comment makes sense
upvoted 3 times
...
...
manishgp
3 years, 8 months ago
Answer is B See the emphasis on "most cost-effective"
upvoted 4 times
...
skywalker
3 years, 9 months ago
shouldn't this be "A" since there is no such thing as "ethereal storage" as mentioned in B?
upvoted 2 times
skywalker
3 years, 9 months ago
If "B" is refer to Ephemeral Storage... then the output will be lost the moment spot instance is terminate.. Output will thus be lost.. Having output stored in EBS which is persistent is a better option.. right? So answer should still be "A"???
upvoted 2 times
examacc
3 years, 9 months ago
I think B is ok assuming ephermal storage. as it is not the final destination. Final destination is still S3. It is talking about processed data i beleive.
upvoted 5 times
...
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...