Learning from Incidents
Session
Incidents
When Incidents Refuse to End
Wednesday Nov 19 / 11:45AM PST
As engineers, we’re used to managing failure, but long-running outages hit differently. They stretch teams, systems, and assumptions about how incidents “should” play out.
Vanessa Huerta Granda
Resiliency Manager @Enova, Co-Author of the Howie Guide on Post Incident Analysis
Session
Incident Analysis
The Time it Wasn't DNS
Wednesday Nov 19 / 03:55PM PST
In January of 2023, the Microsoft Azure Wide Area Network experienced a global outage. If you were a Microsoft customer at the time, you were impacted by this outage.
Sean Klein
Principal Technical Program Manager - Modern Incident Analysis @Microsoft Azure