You want to schedule a number of sequential load and transformation jobs. Data files will be added to a Cloud Storage bucket by an upstream process. There is no fixed schedule for when the new data arrives. Next, a Dataproc job is triggered to perform some transformations and write the data to BigQuery. You then need to run additional transformation jobs in BigQuery. The transformation jobs are different for every table. These jobs might take hours to complete. You need to determine the most efficient and maintainable workflow to process hundreds of tables and provide the freshest data to your end users. What should you do?
cuadradobertolinisebastiancami
Highly Voted 1 year, 2 months agochoprat1
Most Recent 3 months, 1 week agof74ca0c
4 months, 3 weeks ago8ad5266
10 months, 3 weeks agoplum21
3 months agoJyoGCP
1 year, 2 months agoMatt_108
1 year, 4 months agoJordan18
1 year, 4 months agocuadradobertolinisebastiancami
1 year, 2 months agoAllenChen123
1 year, 4 months agoraaad
1 year, 4 months agoscaenruy
1 year, 4 months ago