You need to modernize your existing on-premises data strategy. Your organization currently uses:
• Apache Hadoop clusters for processing multiple large data sets, including on-premises Hadoop Distributed File System (HDFS) for data replication.
• Apache Airflow to orchestrate hundreds of ETL pipelines with thousands of job steps.
You need to set up a new architecture in Google Cloud that can handle your Hadoop workloads and requires minimal changes to your existing orchestration processes. What should you do?
raaad
Highly Voted 1 year, 1 month agodatasmg
Most Recent 11 months agocuadradobertolinisebastiancami
11 months, 2 weeks agoJyoGCP
11 months, 3 weeks agoMatt_108
1 year agoscaenruy
1 year, 1 month ago