You need to modernize your existing on-premises data strategy. Your organization currently uses:
• Apache Hadoop clusters for processing multiple large data sets, including on-premises Hadoop Distributed File System (HDFS) for data replication.
• Apache Airflow to orchestrate hundreds of ETL pipelines with thousands of job steps.
You need to set up a new architecture in Google Cloud that can handle your Hadoop workloads and requires minimal changes to your existing orchestration processes. What should you do?
raaad
Highly Voted 10 months agodatasmg
Most Recent 7 months, 4 weeks agocuadradobertolinisebastiancami
8 months, 1 week agoJyoGCP
8 months, 2 weeks agoMatt_108
9 months, 3 weeks agoscaenruy
10 months ago