Resilience
Architecting a Centralized Platform for Data Deletion at Netflix
Monday Nov 17 / 03:55PM PST
What does it take to safely delete data at Netflix scale? In large-scale systems, data deletion cuts across infrastructure, reliability, and performance complexities.
Vidhya Arvind
Tech Lead & a Founding Architect for the Data Abstraction Platform @Netflix, Previously @Box and @Verizon
Shawn Liu
Senior Software Engineer @Netflix, Building Reliable and Extensible Systems for Consumer Data Lifecycle at Scale
Enhancing Reliability Using Service-Level Prioritized Load Shedding at Netflix
Monday Nov 17 / 05:05PM PST
How does Netflix maintain a seamless viewing experience for millions of users, especially during traffic spikes or when backend datastores are overloaded? Autoscaling can help during traffic spikes, but it costs money, takes a few minutes to kick in, and capacity may not always be available.
Anirudh Mendiratta
Staff Software Engineer, Playback Lifecycle @Netflix, Previously @Amazon Prime Video and @fuboTV
Benjamin Fedorka
Staff Software Engineer, Productivity Engineering @Netflix
When Incidents Refuse to End
Wednesday Nov 19 / 11:45AM PST
As engineers, we’re used to managing failure, but long-running outages hit differently. They stretch teams, systems, and assumptions about how incidents “should” play out.
Vanessa Huerta Granda
Resiliency Manager @Enova, Co-Author of the Howie Guide on Post Incident Analysis