You are viewing content from a past/completed QCon

Presentation: Netflix Play API - An Evolutionary Architecture

Track: Architectures You've Always Wondered About

Location: Ballroom A

Duration: 1:40pm - 2:30pm

Day of week: Tuesday

Level: Intermediate

Persona: Architect, Developer

Share this on:

This presentation is now available to view on InfoQ.com

Watch video with transcript

What You’ll Learn

  • Learn how to make decisions in your architectural design using the Type1/Type2 decision framework.

  • Understand how to ensure original architectural principles and decisions are maintained as architecture evolves.

  • Hear why evolvability and identity act as top guiding principles in the success of your architecture

Abstract

The Play API Service is responsible for orchestrating all playback workflows whenever a user watches a title on Netflix. If the Play API service is down, Netflix is down. In its seven-year journey, the service has gone through three major re-architectures: Take-1 began with a traditional monolith where the API and its associated business functions were all part of the same service. Take-2 was about decomposing the monolith into a few key microservices - but, in the process, we inadvertently built a distributed monolith with tight coupling and thick client libraries. With Take-3, our goal is to break away from a distributed monolith and to build an evolutionary microservice architecture which puts change-driven-design first above all other principles.
This talk will deep dive into how we used a set of three core foundational principles to iteratively develop our architecture. Along with providing the necessary motivation behind our goals, I will specifically talk about what patterns we observed in our previous architectures and how we arrived at a list of practices to create an Evolutionary Architecture. If you are building a new set of services or considering a major re-architecture, this talk provides you with a framework to reason about your use-cases.
This talk is intended for engineers with experience in service-based architectures
Key Takeaways:
  • Three foundational principles that can guide you in designing a microservice architecture:
    • Why focussing on “identity” lays the foundation for your architectural design
    • Why it is important to identify Type1/Type2 decisions
    • Why evolvability is crucial, specifically when it comes to “known unknowns”!
  • Concrete lessons learned around:
    • Different types of coupling and its limitations
    • How we can choose a mix of async/sync architectures to unlock evolvability
    • Ensuring we are not building a Data Monolith
Question: 

QCon: What is the focus of your work today?

Answer: 

Suudhan: I work on the Playback API team whose identity is to deliver Playback Functionality 24/7. I, along with a team of stunning colleagues, own and operate the critical Play API service which orchestrates playback functions like deciding the best playback experience, authorize every playback and collect playback data for business intelligence. For the past 2 years, i lead the initiative to re-architect this service to significantly improve our scalability, availability and developer velocity.

Question: 

QCon: What’s the motivation for this talk?

Answer: 

Suudhan: Our API service have gone through 3 architectures. With Netflix scale, we often hit limitations of our architecture every 3 or so years. In this third iteration, we have architected a solution to optimize for evolvability. We fully expect things to change in 3-5 years time and we want an architecture in which each aspect of the architecture can be replaced with minimal overhead. My motivation for the talk is to share our learnings with this architecture and have an exchange of ideas with the qCon audience. I am particularly interested in hearing opinions about aspects which we might have overlooked or how some elements of our architecture can be applied to different domains.

Question: 

QCon: How you you describe the persona and level of the target audience?

Answer: 

Suudhan: Target audience is a senior software engineer who has had some experience with owning and operating services. The talk will appeal to persons who are thinking about re-architecting any part of their services; or starting afresh on a new architecture.

Question: 

What technology problem keeps you up at night?

Answer: 

Suudhan: These are some of the problems i am thinking about:

1) How to convert real-time data services to materialized views to significantly improve our throughput and reduce point of failures in our critical systems.

2) How to isolate request-response style APIs from fire and forget ones to optimize for different attributes (like availability for the former and consistency for the latter)

3) When we see an availability drop, how to shed load by smartly degrading our user experience till the issue is resolved

Speaker: Suudhan Rangarajan

Senior Software Engineer @Netflix

Suudhan Rangarajan works on the Playback API team at Netflix, responsible for ensuring that customers receive the best possible playback experience every time they click play. A few dozen playback microservices fill a key role in enabling Netflix to stream amazing content to 125M+ members on 1000+ devices. Prior to Netflix, Suudhan worked on the Audio/Video decoding pipeline in Adobe Flash and Adobe Primetime products helping many partners create a great video streaming client. Suudhan enjoys developing and running large scale distributed services, with a special focus on Netflix Playback API architecture and design. Suudhan has received a Masters in Computer Science from University of Texas at Austin.

Find Suudhan Rangarajan at

Proposed Tracks

  • Human Systems: Hacking the Org

    Power of leadership, Engineering Metrics and strategies for shaping the org for velocity.

  • Building High-Performing Teams

    Building, maintaining, and growing a team balanced for skills and aptitudes. Constraint theory, systems thinking, lean, hiring/firing and performance improvement

  • Software Defined Infrastructure: Kubernetes, Service Meshes & Beyond

    Deploying, scaling and managing your services is undifferentiated heavy lifting. Hear stories, learn techniques and dive deep into what it means to code your infrastructure.

  • Practices of DevOps & Lean Thinking

    Practical approaches using DevOps and a lean approach to delivering software.

  • Operationalizing Microservices: Design, Deliver, Operate

    What's the last mile for deploying your service? Learn techniques from the world's most innovative shops on managing and operating Microservices at scale.

  • Developer Experience: Level up your Engineering Effectiveness

    Improving the end to end developer experience - design, dev, test, deploy and operate/understand.

  • Architectures You've Always Wondered About

    Next-gen architectures from the most admired companies in software, such as Netflix, Google, Facebook, Twitter, & more

  • Machine Learning without a PhD

    AI/ML is more approachable than ever. Discover how deep learning and ML is being used in practice. Topics include: TensorFlow, TPUs, Keras, PyTorch & more. No PhD required.

  • Production Readiness: Building Resilient Systems

    Making systems resilient involves people and tech. Learn about strategies being used from chaos testing to distributed systems clustering.

  • Building Predictive Data Pipelines

    From personalized news feeds to engaging experiences that forecast demand: learn how innovators are building predictive systems in modern application development.

  • Modern Languages: The Right Language for the Job

    We're polyglot developers. Learn languages that excel at very specific tasks and remove undifferentiated heavy lifting at the language level.

  • Delivering on the Promise of Containers

    Runtime containers, libraries and services that power microservices.

  • Evolving Java & the JVM

    6 month cadence, cloud-native deployments, scale, Graal, Kotlin, and beyond. Learn how the role of Java and the JVM is evolving.

  • Trust, Safety & Security

    Privacy, confidentiality, safety and security: learning from the frontlines.

  • Beyond the Web: What’s Next for JavaScript

    JavaScript is the language of the web. Latest practices for JavaScript development in and out of the browser topics: react, serverless, npm, performance, & less traditional interfaces.

  • Modern Operating Systems

    Applied, practical & real-world deep-dive into industry adoption of OS, containers and virtualization, including Linux on.

  • Optimizing You: Human Skills for Individuals

    Better teams start with a better self. Learn practical skills for IC.

  • Modern CS in the Real World

    Thoughts pushing software forward, including consensus, CRDT's, formal methods & probabilistic programming.