You encounter a large number of outages in the production systems you support. You receive alerts for all the outages that wake you up at night. The alerts are due to unhealthy systems that are automatically restarted within a minute. You want to set up a process that would prevent staff burnout while following Site
Reliability Engineering practices. What should you do?
AL12
Highly Voted 3 years agoMF2C
3 years ago09bd94b
Most Recent 2 months, 2 weeks agoJonathanSJ
1 year, 9 months agoGreg123123
1 year, 10 months agossmb
2 years ago[Removed]
2 years, 4 months agozygomar
2 years, 8 months agoSekierer
2 years, 9 months agoKyubiBlaze
2 years, 9 months agogcpz
2 years, 10 months agoESP_SAP
2 years, 11 months agoESP_SAP
2 years, 11 months agoManh
2 years, 12 months agoNXD
3 years agoFeliphus
10 months, 2 weeks agoFeliphus
10 months, 2 weeks agoTNT87
3 years agoTNT87
2 years, 10 months agoTNT87
2 years, 10 months agoneutrino9
3 years agojob_search83
3 years ago