In late 2024, Netflix made a bet: consolidate the vast majority of our relational database use cases onto a single engine: Amazon Aurora PostgreSQL. This talk uses Netflix’s Aurora PostgreSQL consolidation as a case study in platform engineering: how we turned risky, bespoke database migrations into a repeatable internal platform capability.

Over 12 months, we migrated 100+ unique workloads off a third-party, distributed, PostgreSQL-compatible store and onto Aurora PostgreSQL. The workloads varied widely in size, query patterns, availability requirements, and team risk tolerance, including use cases with tens of terabytes of data and business-critical traffic. To make this tractable, we built a migration platform that standardized the common path while preserving escape hatches for the hard cases: automated data movement, byte-for-byte validation, resumable workflows and transparent cutovers with only single-digit minutes of write downtime.

We’ll cover both the technical and organizational sides of the effort: how we minimized application-team churn, built confidence in our tooling, worked with external and internal partners, and coordinated a high-risk infrastructure change across hundreds of stakeholders without turning the platform team into a blocking approval gate.

Takeaways:

How to turn a large migration program into an internal platform capability with reusable tooling, standardized workflows, and clear escape hatches.
How to migrate business-critical relational workloads with near-zero downtime while keeping application-team effort low.
How to build trust in migration tooling through validation harnesses, byte-for-byte correctness checks, resumable execution, and cutovers.
How to make build-vs-buy decisions when vendor tools and internal platform requirements do not fully overlap.
How to coordinate a high-risk infrastructure change across many teams without adding overhead for developer teams.

From the same track

Session

Why Most Platform Teams Fail: The Adoption Problem Nobody Wants to Own

We have all seen the moment: the platform goes live, the launch deck looks sharp, the portal is polished, the golden paths are documented, and yet teams quietly continue doing things the old way. Not always because the platform is bad, but because adoption was assumed, not owned.

Shweta Vohra

Architecture Leader @Booking.com, Author of "Decoding Platform Engineering Patterns" & "Dear Software and AI Architect", 24+ Years Experience Building Cloud, Platform, and AI Systems

Session

Platform Engineering’s Second Act: From Vending Machine to Passport Control

Three years ago on the QCon SF stage, I made the case for “Acceleration, Autonomy, and Accountability” as the pillars of a successful platform. Those pillars haven't moved. AI has just rewritten what each one requires, and the platform team's job along with it.

Smruti Patel

SVP of Engineering @apollographql

Alex Mann

Senior Engineering Manager @apollographql

Session

The Rise of Agent Enablement: Coding Agents Don't Scale Themselves. Neither Do Your Teams.

In 2009, every big enterprise said "continuous delivery won't work here." In 2026, the same rooms say "the dark factory won't work here." It was never about the technology being ready. It's about the organization being ready. Readiness is a socio-technical problem, not a technical one.

Patrick Debois

AI Product Engineer @Tessl, Co-Author of the "DevOps Handbook", Content Curator at AI Native Developer Community

Building a Migration Platform: Moving 100+ Netflix RDBMS Workloads to Aurora PostgreSQL

Abstract

Takeaways:

Date

Track

Share

From the same track

Why Most Platform Teams Fail: The Adoption Problem Nobody Wants to Own

Platform Engineering’s Second Act: From Vending Machine to Passport Control

The Rise of Agent Enablement: Coding Agents Don't Scale Themselves. Neither Do Your Teams.

Follow QCon

Contact

Menu

Conferences around the World