A company uses Amazon S3 to store semi-structured data in a transactional data lake. Some of the data files are small, but other data files are tens of terabytes.
A data engineer must perform a change data capture (CDC) operation to identify changed data from the data source. The data source sends a full snapshot as a JSON file every day and ingests the changed data into the data lake.
Which solution will capture the changed data MOST cost-effectively?
GiorgioGss
Highly Voted 1 year, 1 month agoplutonash
Most Recent 3 months, 2 weeks agoJuan_pc
1 week, 1 day agoinfluxy
8 months, 3 weeks agoFunkyFresco
11 months, 1 week agocertplan
1 year, 1 month agodamaldon
1 year, 2 months agoJuan_pc
1 week, 1 day agoGiorgioGss
1 year, 1 month ago[Removed]
1 year, 3 months agoHouyon
1 year, 2 months ago