A company uses Amazon S3 to store semi-structured data in a transactional data lake. Some of the data files are small, but other data files are tens of terabytes.
A data engineer must perform a change data capture (CDC) operation to identify changed data from the data source. The data source sends a full snapshot as a JSON file every day and ingests the changed data into the data lake.
Which solution will capture the changed data MOST cost-effectively?
GiorgioGss
Highly Voted 1 year, 3 months agoplutonash
Most Recent 5 months, 2 weeks agoJuan_pc
2 months agoinfluxy
10 months, 3 weeks agoFunkyFresco
1 year, 1 month agocertplan
1 year, 3 months agodamaldon
1 year, 3 months agoJuan_pc
2 months agoGiorgioGss
1 year, 3 months ago[Removed]
1 year, 5 months agoHouyon
1 year, 4 months ago