Chronon - Airbnb’s End-to-End Feature Platform

ML Models typically use upwards of 100 features to generate a single prediction. As a result, there is an explosion in the number of data pipelines and high request fanout during prediction. On top of that, feature schemas evolve with every model iteration and managing them manually becomes cumbersome. Models also can fail silently with shifts in input data making observability challenging. To make matters even worse, for use-cases that involve ranking, the per-inference latency budgets are extremely tight.

In this presentation we will talk about Airbnb’s Feature Platform - focusing on the recent efforts to solve the challenges mentioned above. Specifically we will cover four areas - core APIs, training data generation, feature serving and feature observability in detail.

Main Takeaways

In this presentation we will cover the API and architecture of Airbnb’s Feature Platform with special focus on the following aspects

  1. Training data generation - full support for the entire training data generation pipeline.
    1. feature bootstrap, label computation and training set generation at large scale. We covered point-in-time feature backfill in the past, and will briefly introduce it, but focus on the other areas. 
  2. Feature serving - full support for advanced feature computation 
    1. Feature derivations, Feature Chaining & external and contextual feature support. We will describe how this works in batch, streaming and application serving environments.
  3. Feature observability - pre-training and post productionization monitoring for data failures
    1. Online Offline consistency, training data health metrics, feature & prediction drift

Speaker

Nikhil Simha

Author of "Chronon Feature Platform", Previously Built Stream Processing Infra @Meta and NLP Systems @Amazon & @Walmartlabs

Nikhil Simha is a Staff Software Engineer on the Machine Learning infrastructure team at Airbnb. He is currently working on Chronon, an end-to-end feature engineering platform. Prior to Airbnb, he was a founding engineer on the stream processing team where he built a scheduler (Turbine, ICDE '20) and a stream processing framework (RealTime Data @ FB, SIGMOD '16) at Facebook. He is interested in the intersection of compilers, machine learning and realtime data processing systems. Nikhil got his Bachelors degree in Computer Science from Indian Institute of Technology, Bombay. While not working, he likes to walk with his dog Leela.

Read more
Find Nikhil Simha at:

Date

Tuesday Oct 3 / 10:35AM PDT ( 50 minutes )

Location

Seacliff ABC

Topics

AI/ML Architecture Stream Processing Big Data

Share

From the same track

Session AI/ML

Defensible Moats: Unlocking Enterprise Value with Large Language Models

Tuesday Oct 3 / 11:45AM PDT

Building LLM-powered applications using APIs alone poses significant challenges for enterprises. These challenges include data fragmentation, the absence of a shared business vocabulary, privacy concerns regarding data, and diverse objectives among data and ML users.

Speaker image - Nischal HP
Nischal HP

Vice President of Data Science @Scoutbee, Decade of Experience Building Enterprise AI

Session Distributed Computing

Modern Compute Stack for Scaling Large AI/ML/LLM Workloads

Tuesday Oct 3 / 01:35PM PDT

Advanced machine learning (ML)  models, particularly large language models (LLMs), require scaling beyond a single machine.

Speaker image - Jules Damji
Jules Damji

Lead Developer Advocate @Anyscale, MLflow Contributor, and Co-Author of "Learning Spark"

Session AI/ML

Generative Search: Practical Advice for Retrieval Augmented Generation (RAG)

Tuesday Oct 3 / 02:45PM PDT

In this presentation, we will delve into the world of Retrieval Augmented Generation (RAG) and its significance for Large Language Models (LLMs) like OpenAI's GPT4. With the rapid evolution of data, LLMs face the challenge of staying up-to-date and contextually relevant.

Speaker image - Sam Partee
Sam Partee

Principal Engineer @Redis

Session AI/ML

Building Guardrails for Enterprise AI Applications W/ LLMs

Tuesday Oct 3 / 05:05PM PDT

Large Language Models (LLMs) such as ChatGPT have revolutionized AI applications, offering unprecedented potential for complex real-world scenarios. However, fully harnessing this potential comes with unique challenges such as model brittleness and the need for consistent, accurate outputs.

Speaker image - Shreya Rajpal
Shreya Rajpal

Founder @Guardrails AI, Experienced ML Practitioner with a Decade of Experience in ML Research, Applications and Infrastructure

Session

Unconference: Modern ML

Tuesday Oct 3 / 03:55PM PDT

What is an unconference? An unconference is a participant-driven meeting. Attendees come together, bringing their challenges and relying on the experience and know-how of their peers for solutions.