A company uses Amazon S3 to store semi-structured data in a transactional data lake. Some of the data files are small, but other data files are tens of terabytes.
A data engineer must perform a change data capture (CDC) operation to identify changed data from the data source. The data source sends a full snapshot as a JSON file every day and ingests the changed data into the data lake.
Which solution will capture the changed data MOST cost-effectively?
GiorgioGss
Highly Voted 1 year, 5 months agoplutonash
Most Recent 7 months, 1 week agoJuan_pc
3 months, 4 weeks agoinfluxy
1 year agoFunkyFresco
1 year, 2 months agocertplan
1 year, 5 months agodamaldon
1 year, 5 months agoJuan_pc
3 months, 4 weeks agoGiorgioGss
1 year, 5 months ago[Removed]
1 year, 7 months agoHouyon
1 year, 6 months ago