A company is building a new version of a recommendation engine. Machine learning (ML) specialists need to keep adding new data from users to improve personalized recommendations. The ML specialists gather data from the users' interactions on the platform and from sources such as external websites and social media.
The pipeline cleans, transforms, enriches, and compresses terabytes of data daily, and this data is stored in Amazon S3. A set of Python scripts was coded to do the job and is stored in a large Amazon EC2 instance. The whole process takes more than 20 hours to finish, with each script taking at least an hour. The company wants to move the scripts out of Amazon EC2 into a more managed solution that will eliminate the need to maintain servers.
Which approach will address all of these requirements with the LEAST development effort?
spaceexplorer
Highly Voted 2 years, 6 months agockkobe24
2 years, 5 months agodaidaidai
1 year, 5 months agoStokvisss
Most Recent 8 months, 1 week agoendeesa
11 months, 1 week agogiustino98
12 months agoteka112233
1 year, 1 month agokaike_reis
1 year, 2 months agoMickey321
1 year, 3 months agoMickey321
1 year, 3 months agoMickey321
1 year, 2 months agoAjoseO
1 year, 8 months agomaxkm
1 year, 9 months agoGiyeonShin
1 year, 8 months agomilan_ml
2 years, 3 months agoovokpus
2 years, 4 months ago