Parting the Clouds: The Rise of Disaggregated Systems

Abstract

Cloud systems are undergoing an architectural shift. Traditional shared-nothing designs struggle to deliver the elasticity, availability, and operational simplicity that the cloud demands. The new generation of cloud systems (Amazon Aurora, Microsoft Socrates, Google AlloyDB, Snowflake, S3 Athena, Neon) embrace disaggregation, and decouple compute, storage, and increasingly, logs. This separation allows independent scaling, faster failover, and shared durable storage across tenants, but it also brings new performance tradeoffs and system design challenges.

In this talk, I will survey why disaggregation matters, what we have learned from the past decade of industrial systems, and where research and practice are headed next. I will explain the economic and operational motivations (elastic scaling, fault isolation, pooling) and discuss how disaggregation reshapes core database components like logging, recovery, and concurrency control. I'll discuss how design choices like "log-as-database", shared-storage replication, and caching tiers affect throughput, latency, and cost.

Finally, I’ll connect these modern architectures back to the timeless ideas in distributed systems, using Lamport's proposers, acceptors, and learners as a way to reason about how we disaggregate coordination, availability, durability, and computation. Viewed through this lens, disaggregation is our new Paxos: an architecture for separating roles so systems can scale, fail, and recover gracefully.


Speaker

Murat Demirbas

Principal Research Scientist @MongoDB Research, Previously Principal Applied Scientist @AWS and a Professor of Computer Science at the University at Buffalo (SUNY)

Murat Demirbas is a Principal Research Scientist at MongoDB Research. Before joining MongoDB, he was a Principal Applied Scientist at AWS for 3 years, and a Professor of Computer Science at the University at Buffalo (SUNY) for 16 years. His work spans distributed systems and databases, with contributions to hybrid logical clocks, WPaxos, PigPaxos, and Paxos Quorum Reads. He received the NSF CAREER Award in 2008 and the UB School of Engineering Senior Researcher of the Year Award in 2016. Murat writes a widely read blog on distributed systems at http://muratbuffalo.blogspot.com, with over 5.6 million views.

Read more

From the same track

Session

How to Build an Exchange: Sub Millisecond Response Times and 24/7 Uptimes in the Cloud

Monday Nov 17 / 10:35AM PST

These days it is possible to achieve fairly good performance on cloud provisioned systems. We discuss the design of a high performance, strongly consistent system which maintains constant service in the face of regular updates to core logic.

Speaker image - Frank Yu

Frank Yu

Director of Engineering @Coinbase, Previously Principal Engineer and Director @FairX

Session

Building Resilient Platforms: Insights from 20+ Years in Mission-Critical Infrastructure

Monday Nov 17 / 02:45PM PST

In this talk, Matthew will describe lessons learned from over 20+ years of building scalable, secure and stable infrastructure platforms for software in financial services (electronic trading, credit card processing etc.), the talk is relevant to anyone building platforms for mission-critic

Speaker image - Matthew Liste

Matthew Liste

Head of Infrastructure @American Express, Previously @JPMorgan Chase and @Goldman Sachs

Session

Unconference: Architectures You've Always Wondered About

Monday Nov 17 / 05:05PM PST

Session

Architecting a Centralized Platform for Data Deletion at Netflix

Monday Nov 17 / 03:55PM PST

What does it take to safely delete data at Netflix scale? In large-scale systems, data deletion cuts across infrastructure, reliability, and performance complexities.

Speaker image - Vidhya Arvind

Vidhya Arvind

Tech Lead & a Founding Architect for the Data Abstraction Platform @Netflix, Previously @Box and @Verizon

Speaker image - Shawn Liu

Shawn Liu

Senior Software Engineer @Netflix, Building Reliable and Extensible Systems for Consumer Data Lifecycle at Scale

Session

Compiling Workflows into Databases: The Architecture That Shouldn't Work (But Does)

Monday Nov 17 / 11:45AM PST

What if everything you know about building distributed systems is backwards?

Speaker image - Jeremy Edberg

Jeremy Edberg

CEO of DBOS, Creator of Chaos Engineering, Tech Editor for 'AWS for Dummies'; Previously Founding Reliability Engineer @Netflix, and First Engineer @Reddit

Speaker image - Qian Li

Qian Li

Co-founder, Architect @DBOS, Stanford CS Ph.D., Co-organizer of South Bay Systems