A company uses an Amazon EMR cluster to process data once a day. The raw data comes from Amazon S3, and the resulting processed data is also stored in
Amazon S3. The processing must complete within 4 hours; currently, it only takes 3 hours. However, the processing time is taking 5 to 10 minutes longer each week due to an increasing volume of raw data.
The team is also concerned about rising costs as the compute capacity increases. The EMR cluster is currently running on three m3.xlarge instances (one master and two core nodes).
Which of the following solutions will reduce costs related to the increasing compute needs?
donathon
Highly Voted 3 years, 8 months agoWaiweng
Highly Voted 3 years, 7 months agotvs
3 years, 7 months agoTiredDad
3 years, 7 months agomaxh8086
Most Recent 2 years, 5 months agoBinoj_1985
3 years, 6 months agocldy
3 years, 6 months agoAzureDP900
3 years, 6 months agoDerekKey
3 years, 7 months agoWhyIronMan
3 years, 7 months agoSunflyhome
3 years, 7 months agoPupu86
3 years, 7 months agonitinz
3 years, 7 months agoawsnoob
3 years, 7 months agonatpilot
3 years, 7 months agoKian1
3 years, 7 months agonewme
3 years, 7 months agoT14102020
3 years, 7 months agoKau123
3 years, 7 months agojackdryan
3 years, 8 months ago