Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.

Unlimited Access

Get Unlimited Contributor Access to the all ExamTopics Exams!
Take advantage of PDF Files for 1000+ Exams along with community discussions and pass IT Certification Exams Easily.

Exam AWS Certified Data Engineer - Associate DEA-C01 topic 1 question 20 discussion

A company is migrating on-premises workloads to AWS. The company wants to reduce overall operational overhead. The company also wants to explore serverless options.
The company's current workloads use Apache Pig, Apache Oozie, Apache Spark, Apache Hbase, and Apache Flink. The on-premises workloads process petabytes of data in seconds. The company must maintain similar or better performance after the migration to AWS.
Which extract, transform, and load (ETL) service will meet these requirements?

  • A. AWS Glue
  • B. Amazon EMR
  • C. AWS Lambda
  • D. Amazon Redshift
Show Suggested Answer Hide Answer
Suggested Answer: C ūüó≥ÔłŹ

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
milofficial
Highly Voted 4 months, 3 weeks ago
Selected Answer: B
Glue is like the more good-looking one, but weaker brother of EMR. So when it's about petabyte scales, let EMR do the work and have Glue stay away from the action.
upvoted 9 times
...
pypelyncar
Most Recent 6 days, 21 hours ago
Selected Answer: B
EMR provides a managed Hadoop framework that natively supports Apache Pig, Oozie, Spark, and Flink. This allows the company to migrate their existing workloads with minimal code changes, reducing development effort
upvoted 2 times
...
tgv
1 week, 6 days ago
Selected Answer: B
That's exactly the purpose of EMR. "Amazon EMR is the industry-leading cloud big data solution for petabyte-scale data processing, interactive analytics, and machine learning using open-source frameworks such as Apache Spark, Apache Hive, and Presto." https://aws.amazon.com/emr/
upvoted 1 times
...
Just_Ninja
1 month, 1 week ago
Selected Answer: A
Glue is Serverless :)
upvoted 1 times
...
wa212
2 months, 1 week ago
Selected Answer: B
https://docs.aws.amazon.com/ja_jp/emr/latest/ManagementGuide/emr-what-is-emr.html
upvoted 1 times
...
certplan
2 months, 3 weeks ago
- While AWS Glue is a fully managed ETL service and offers serverless capabilities, it might not provide the same level of performance and flexibility as Amazon EMR for handling petabyte-scale workloads with complex processing requirements. - AWS Glue is optimized for data integration, cataloging, and ETL jobs but may not be as well-suited for heavy-duty processing tasks that require frameworks like Apache Spark, Apache Flink, etc., which are commonly used for large-scale data processing. - Documentation on AWS Glue can be found in the AWS Glue Developer Guide https://docs.aws.amazon.com/glue/index.html.
upvoted 1 times
...
certplan
2 months, 3 weeks ago
A. AWS Glue: AWS Glue is a fully managed extract, transform, and load (ETL) service provided by Amazon Web Services (AWS). It allows users to prepare and load data for analytics purposes B. Amazon EMR: Amazon Elastic MapReduce (EMR) is a cloud-based big data platform provided by AWS. It allows users to process and analyze large amounts of data using popular frameworks such as Apache Hadoop, Apache Spark, Apache Hive, Apache HBase, and more. https://docs.aws.amazon.com/emr/index.html https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-best-practices.html https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-manage.html https://docs.aws.amazon.com/emr/latest/DeveloperGuide/emr-developer-guide.html As per the AWS/Amazon docs, option B specifically calls out it out with the specific features/options that the question asked directly about.
upvoted 2 times
...
GiorgioGss
3 months, 1 week ago
Selected Answer: B
https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-release-components.html
upvoted 1 times
...
TonyStark0122
4 months, 2 weeks ago
A. AWS Glue
upvoted 1 times
...
[Removed]
4 months, 3 weeks ago
Selected Answer: B
https://aws.amazon.com/emr/features/
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...