You are viewing content from a past/completed QCon

Presentation: Netflix Play API - An Evolutionary Architecture

Track: Architectures You've Always Wondered About

Location: Ballroom A

Duration: 1:40pm - 2:30pm

Day of week: Tuesday

Level: Intermediate

Persona: Architect, Developer

Share this on:

This presentation is now available to view on InfoQ.com

Watch video with transcript

What You’ll Learn

  • Learn how to make decisions in your architectural design using the Type1/Type2 decision framework.

  • Understand how to ensure original architectural principles and decisions are maintained as architecture evolves.

  • Hear why evolvability and identity act as top guiding principles in the success of your architecture

Abstract

The Play API Service is responsible for orchestrating all playback workflows whenever a user watches a title on Netflix. If the Play API service is down, Netflix is down. In its seven-year journey, the service has gone through three major re-architectures: Take-1 began with a traditional monolith where the API and its associated business functions were all part of the same service. Take-2 was about decomposing the monolith into a few key microservices - but, in the process, we inadvertently built a distributed monolith with tight coupling and thick client libraries. With Take-3, our goal is to break away from a distributed monolith and to build an evolutionary microservice architecture which puts change-driven-design first above all other principles.
This talk will deep dive into how we used a set of three core foundational principles to iteratively develop our architecture. Along with providing the necessary motivation behind our goals, I will specifically talk about what patterns we observed in our previous architectures and how we arrived at a list of practices to create an Evolutionary Architecture. If you are building a new set of services or considering a major re-architecture, this talk provides you with a framework to reason about your use-cases.
This talk is intended for engineers with experience in service-based architectures
Key Takeaways:
  • Three foundational principles that can guide you in designing a microservice architecture:
    • Why focussing on “identity” lays the foundation for your architectural design
    • Why it is important to identify Type1/Type2 decisions
    • Why evolvability is crucial, specifically when it comes to “known unknowns”!
  • Concrete lessons learned around:
    • Different types of coupling and its limitations
    • How we can choose a mix of async/sync architectures to unlock evolvability
    • Ensuring we are not building a Data Monolith
Question: 

QCon: What is the focus of your work today?

Answer: 

Suudhan: I work on the Playback API team whose identity is to deliver Playback Functionality 24/7. I, along with a team of stunning colleagues, own and operate the critical Play API service which orchestrates playback functions like deciding the best playback experience, authorize every playback and collect playback data for business intelligence. For the past 2 years, i lead the initiative to re-architect this service to significantly improve our scalability, availability and developer velocity.

Question: 

QCon: What’s the motivation for this talk?

Answer: 

Suudhan: Our API service have gone through 3 architectures. With Netflix scale, we often hit limitations of our architecture every 3 or so years. In this third iteration, we have architected a solution to optimize for evolvability. We fully expect things to change in 3-5 years time and we want an architecture in which each aspect of the architecture can be replaced with minimal overhead. My motivation for the talk is to share our learnings with this architecture and have an exchange of ideas with the qCon audience. I am particularly interested in hearing opinions about aspects which we might have overlooked or how some elements of our architecture can be applied to different domains.

Question: 

QCon: How you you describe the persona and level of the target audience?

Answer: 

Suudhan: Target audience is a senior software engineer who has had some experience with owning and operating services. The talk will appeal to persons who are thinking about re-architecting any part of their services; or starting afresh on a new architecture.

Question: 

What technology problem keeps you up at night?

Answer: 

Suudhan: These are some of the problems i am thinking about:

1) How to convert real-time data services to materialized views to significantly improve our throughput and reduce point of failures in our critical systems.

2) How to isolate request-response style APIs from fire and forget ones to optimize for different attributes (like availability for the former and consistency for the latter)

3) When we see an availability drop, how to shed load by smartly degrading our user experience till the issue is resolved

Speaker: Suudhan Rangarajan

Senior Software Engineer @Netflix

Suudhan Rangarajan works on the Playback API team at Netflix, responsible for ensuring that customers receive the best possible playback experience every time they click play. A few dozen playback microservices fill a key role in enabling Netflix to stream amazing content to 125M+ members on 1000+ devices. Prior to Netflix, Suudhan worked on the Audio/Video decoding pipeline in Adobe Flash and Adobe Primetime products helping many partners create a great video streaming client. Suudhan enjoys developing and running large scale distributed services, with a special focus on Netflix Playback API architecture and design. Suudhan has received a Masters in Computer Science from University of Texas at Austin.

Find Suudhan Rangarajan at

Tracks

  • Optimizing Yourself: Human Skills for Individuals

    Better teams start with a better self. Learn practical skills for IC.

  • Modern Data Architectures

    Today’s systems move huge volumes of data. Hear how places like LinkedIn, Facebook, Uber and more built their systems and learn from their mistakes.

  • Practices of DevOps & Lean Thinking

    Practical approaches using DevOps and a lean approach to delivering software.

  • Microservices Patterns & Practices

    What's the last mile for deploying your service? Learn techniques from the world's most innovative shops on managing and operating Microservices at scale.

  • Bare Knuckle Performance

    Killing latency and getting the most out of your hardware

  • Architectures You've Always Wondered About

    Next-gen architectures from the most admired companies in software, such as Netflix, Google, Facebook, Twitter, & more

  • Machine Learning for Developers

    AI/ML is more approachable than ever. Discover how deep learning and ML is being used in practice. Topics include: TensorFlow, TPUs, Keras, PyTorch & more. No PhD required.

  • Production Readiness: Building Resilient Systems

    Making systems resilient involves people and tech. Learn about strategies being used from chaos testing to distributed systems clustering.

  • Regulation, Risk and Compliance

    With so much uncertainty, how do you bulkhead your organization and technology choices? Learn strategies for dealing with uncertainty.

  • Languages of Infrastructure

    This track explores languages being used to code the infrastructure. Expect practices on toolkits and languages like Cloudformation, Terraform, Python, Go, Rust, Erlang.

  • Building & Scaling High-Performing Teams

    To have a high-performing team, everybody on it has to feel and act like an owner. Organizational health and psychological safety are foundational underpinnings to support ownership.

  • Evolving the JVM

    The JVM continues to evolve. We’ll look at how things are evolving. Covering Kotlin, Clojure, Java, OpenJDK, and Graal. Expect polyglot, multi-VM, performance, and more.

  • Trust, Safety & Security

    Privacy, confidentiality, safety and security: learning from the frontlines.

  • JavaScript & Transpiler/WebAssembly Track

    JavaScript is the language of the web. Latest practices for JavaScript development in and how transpilers are affecting the way we work. We’ll also look at the work being done with WebAssembly.

  • Living on the Edge: The World of Edge Compute From Device to Application Edge

    Applied, practical & real-world deep-dive into industry adoption of OS, containers and virtualization, including Linux on.

  • Software Supply Chain

    Securing the container image supply chain (containers + orchestration + security + DevOps).

  • Modern CS in the Real World

    Thoughts pushing software forward, including consensus, CRDT's, formal methods & probabilistic programming.

  • Tech Ethics: The Intersection of Human Welfare & STEM

    What does it mean to be ethical in software? Hear how the discussion is evolving and what is being said in ethics.