You encounter a large number of outages in the production systems you support. You receive alerts for all the outages that wake you up at night. The alerts are due to unhealthy systems that are automatically restarted within a minute. You want to set up a process that would prevent staff burnout while following Site
Reliability Engineering practices. What should you do?
AL12
Highly Voted 3 years, 3 months agoMF2C
3 years, 2 months ago09bd94b
Most Recent 5 months, 2 weeks agoJonathanSJ
2 years agoGreg123123
2 years, 1 month agossmb
2 years, 3 months agozygomar
2 years, 11 months agoSekierer
3 years agoKyubiBlaze
3 years agogcpz
3 years, 1 month agoESP_SAP
3 years, 1 month agoESP_SAP
3 years, 1 month agoManh
3 years, 2 months agoNXD
3 years, 3 months agoFeliphus
1 year, 1 month agoFeliphus
1 year, 1 month agoTNT87
3 years, 3 months agoTNT87
3 years, 1 month agoTNT87
3 years, 1 month agoneutrino9
3 years, 3 months agojob_search83
3 years, 3 months ago