Abstract
In large-scale systems, data deletion is a complex challenge that touches infrastructure, reliability, and performance. As data spreads across diverse data stores, ensuring timely and consistent deletion becomes increasingly difficult. Without a central strategy, teams often build isolated solutions, leading to fragmented behavior, inconsistent processes, and increased operational overhead.
At Netflix, we have developed an innovative architecture for managing data deletion across multiple data stores, effectively addressing these challenges while improving overall system efficiency. The centralized and extensible platform automates the entire lifecycle—from detection and auditing to verification and deletion—while providing observability, journaling, a recovery mechanism, and configurable deletion rate controls to ensure safe and reliable operation at scale.
In this talk, we will present the architectural design and execution strategies behind the system. We will share how we used various technologies to build a reliable and auditable deletion framework, and discuss key engineering tradeoffs, including how we balanced throughput, safety, and achieved scalability across diverse systems, while maintaining operational resilience with live incoming traffic.
By the end of the talk, you will:
- Learn about the challenges of managing deletions across diverse and large-scale operational data stores
- Understand why a standardized approach is critical for reliable and scalable data deletion
- See how central teams can enable deletion through platform-managed orchestration and shared infrastructure
- Explore Netflix’s architectural strategies for solving datastore-specific challenges and scaling deletion workflows
- Gain practical insights from real-world engineering decisions and trade-offs made in production
Speaker

Vidhya Arvind
Tech Lead @Netflix Data Platform, Founding Member of Data Abstractions at Netflix, Previously @Box and @Verizon
Vidhya Arvind is a Tech Lead at Netflix and a founding architect of Netflix’s cutting-edge data abstraction platform. She is a recognized expert in designing and delivering scalable, high-impact data abstractions that empower thousands of developers across the organization to move faster with confidence. With expertise in crafting robust APIs and high-performance abstractions, Vidhya drives the seamless operation of complex abstractions at massive scale. She is known for her strategic thinking, curiosity, and a systems-level mindset that fuels her passion for debugging, innovating, and solving deeply technical challenges. Vidhya has played a pivotal role in shaping the evolution of Netflix's data infrastructure, enabling mission-critical systems to run with exceptional efficiency, reliability, and resilience.
Find Vidhya Arvind at:
Speaker

Shawn Liu
Software Engineer @Netflix
Shawn Liu is a seasoned Software Engineer at Netflix, where he builds highly available consumer identity systems, manages account lifecycles, and supports data deletion at scale. His work powers large-scale backend infrastructure serving over 300 million members worldwide. He brings deep experience in distributed datastores, event-driven architectures, and high-throughput data pipelines, with a focus on building resilient, high-performance systems.