You have an upstream process that writes data to Cloud Storage. This data is then read by an Apache Spark job that runs on Dataproc. These jobs are run in the us-central1 region, but the data could be stored anywhere in the United States. You need to have a recovery process in place in case of a catastrophic single region failure. You need an approach with a maximum of 15 minutes of data loss (RPO=15 mins). You want to ensure that there is minimal latency when reading the data. What should you do?
raaad
Highly Voted 9 months, 4 weeks agoJyoGCP
Most Recent 8 months, 1 week agoMatt_108
9 months, 3 weeks agoscaenruy
10 months ago