You encounter a large number of outages in the production systems you support. You receive alerts for all the outages, the alerts are due to unhealthy systems that are automatically restarted within a minute. You want to set up a process that would prevent staff burnout while following Site Reliability Engineering (SRE) practices. What should you do?
enter_co
7 months, 1 week agoxhilmi
1 year, 6 months agomshafa
1 year, 7 months agolelele2023
1 year, 7 months agolelele2023
1 year, 7 months agoJason_Cloud_at
1 year, 7 months ago