exam questions

Exam AWS Certified Data Analytics - Specialty All Questions

View all questions & answers for the AWS Certified Data Analytics - Specialty exam

Exam AWS Certified Data Analytics - Specialty topic 1 question 98 discussion

A company is migrating from an on-premises Apache Hadoop cluster to an Amazon EMR cluster. The cluster runs only during business hours. Due to a company requirement to avoid intraday cluster failures, the EMR cluster must be highly available. When the cluster is terminated at the end of each business day, the data must persist.
Which configurations would enable the EMR cluster to meet these requirements? (Choose three.)

  • A. EMR File System (EMRFS) for storage
  • B. Hadoop Distributed File System (HDFS) for storage
  • C. AWS Glue Data Catalog as the metastore for Apache Hive
  • D. MySQL database on the master node as the metastore for Apache Hive
  • E. Multiple master nodes in a single Availability Zone
  • F. Multiple master nodes in multiple Availability Zones
Show Suggested Answer Hide Answer
Suggested Answer: ACE 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
AjithkumarSL
Highly Voted 3 years, 10 months ago
yes.. I go with ACE.. https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-plan-ha.html "Note : The cluster can reside only in one Availability Zone or subnet."
upvoted 35 times
juanife
2 years, 1 month ago
I did not know that, thank you very much. Undoubtedly the correct answer is ACF.
upvoted 1 times
...
...
yogen
Highly Voted 3 years, 7 months ago
ACE, for those in doubts for F - EMR cluster can only be launched in single availability zone, if availability zone failure is to be considered then a read replica of EMR cluster is configured in another availability zone with shared storage space. But this option is not there in the choice. so ACE is the correct answer https://aws.amazon.com/getting-started/hands-on/optimize-amazon-emr-clusters-with-ec2-spot/
upvoted 9 times
...
pk349
Most Recent 2 years, 3 months ago
ACE: I passed the test
upvoted 1 times
...
mawsman
2 years, 4 months ago
Selected Answer: ACE
E not F because Amazon EMR clusters with multiple primary nodes are not tolerant to Availability Zone failures. https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-plan-ha-considerations.html
upvoted 1 times
...
Aina
2 years, 4 months ago
Selected Answer: ACE
This question appears in the Stephane Maarek's Udemy course.
upvoted 1 times
Tabby_cloudy
1 year, 8 months ago
where ? I must have missed it.
upvoted 2 times
...
...
Chelseajcole
2 years, 6 months ago
https://repost.aws/questions/QUvOaZvA5BT56skWO0iu2kZA/emr-in-2-a-zs-and-high-availability
upvoted 1 times
...
Mang2000
2 years, 6 months ago
Amazon EMR clusters with multiple primary nodes are not tolerant to Availability Zone failures. In the case of an Availability Zone outage, you lose access to the Amazon EMR cluster. E - is not correct here
upvoted 1 times
...
cloudlearnerhere
2 years, 9 months ago
Correct answers are A, C & E Option A as the cluster is not persistent and terminated each business day, it would be best to use EMRFS and S3 as an external persistence layer. Option C as AWS Glue Data Catalog can be used as the metastore for Apache Hive. Option E as Multiple master nodes are hosted in a single AZ or subnet. Option B is wrong as HDFS would need a persistent cluster. Option D is wrong as the MySQL database should be external and not installed on the master nodes. Option F is wrong as multiple master nodes cannot be hosted in multiple AZs but in a single AZ.
upvoted 6 times
...
JHJHJHJHJ
2 years, 11 months ago
E is correct (ACE) Amazon docs lists E
upvoted 1 times
...
Dun6
3 years ago
I go with ACE
upvoted 1 times
...
rocky48
3 years ago
Selected Answer: ACE
Selected Answer: ACE
upvoted 1 times
...
samsanta2012
3 years, 2 months ago
Selected Answer: ACF
Amazon EMR supports multiple master nodes to enable high availability for EMR applications. EMR clusters with multiple master nodes are not tolerant of Availability Zone failures. In the case of an Availability Zone outage, you lose access to the EMR cluster. In the event that the primary cluster becomes unavailable, you can access the data from the read-replica cluster to perform read operations simultaneously.
upvoted 1 times
...
Bik000
3 years, 3 months ago
Selected Answer: ACE
My Answer is A, C & E
upvoted 1 times
...
MWL
3 years, 3 months ago
Selected Answer: ACE
Many explaination below.
upvoted 1 times
...
Japanese1
3 years, 6 months ago
A, C, F https://aws.amazon.com/jp/blogs/news/setting-up-read-replica-clusters-with-hbase-on-amazon-s3/
upvoted 1 times
Japanese1
3 years, 6 months ago
I'm not a fluent speaker of English, so it's possible that I didn't understand the intent of the question.
upvoted 1 times
...
...
aws2019
3 years, 9 months ago
I go with ACE..
upvoted 1 times
...
nirmalmarathon
3 years, 9 months ago
The correct answer shown is BCF but in discussions it’s apparent the answe should be ACF. What’s the actual answer?
upvoted 1 times
cnmc
3 years, 5 months ago
You must be new here... Always open the discussion to see the actual, correct answer
upvoted 3 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...