Ray: The Next Generation Compute Runtime for ML Applications

Ray is an open source project that makes it simple to scale any compute-intensive Python workload. Industry leaders like Uber, Shopify, Spotify are building their next generation ML platforms on top of Ray. Ray is equipped with a powerful distributed scheduling mechanism which launches stateful Actors and stateless Tasks in a much more granular and lightweight fashion than existing frameworks. Meanwhile it also has an embedded distributed in-memory object store to drastically reduce data exchange overhead. These architectural advantages make Ray the ideal compute substrate for cutting-edge ML use cases including Graph Neural Networks, Online Learning, Reinforcement Learning, and so forth.

This talk will introduce the basic API and architectural concepts of Ray, as well as diving deeper into some of its innovative ML use cases.


Speaker

Zhe Zhang

Head of Open Source Engineering @anyscalecompute, Previously Hadoop/Spark infra Team Manager @LinkedIn

Zhe is currently Head of Open Source Engineering (Ray.io project) at Anyscale. Before Anyscale, Zhe spent 4.5 years at LinkedIn where he managed the Hadoop/Spark infra team. Zhe has been working on open source for about 10 years; he's a committer and PMC member of the Apache Hadoop project, and a member of the Apache Software Foundation.

Read more
Find Zhe Zhang at:

Date

Monday Oct 24 / 10:35AM PDT ( 50 minutes )

Location

Pacific DEKJ

Track

MLOps

Topics

Machine Learning Open Source Graph Neural Networks Online Learning Reinforcement Learning

Share

From the same track

Session Machine Learning

Fabricator: End-to-End Declarative Feature Engineering Platform

Monday Oct 24 / 11:50AM PDT

At Doordash, the last year has seen a surge in applications of machine learning to various product verticals in our growing business. However, with this growth, our data scientists have had increasing bottlenecks in their development cycle because of our existing feature engineering process.

Speaker image - Kunal Shah

Kunal Shah

ML Platform Engineering Manager @DoorDash, Previously ML Platforms & Data Engineering frameworks @Airbnb & @YouTube

Session Machine Learning

An Open Source Infrastructure for PyTorch

Monday Oct 24 / 01:40PM PDT

In this talk we’ll go over tools and techniques to deploy PyTorch in production. The PyTorch organization maintains and supports open source tools for efficient inference like pytorch/serve, job management pytorch/torchx and streaming datasets like pytorch/data.

Speaker image - Mark Saroufim

Mark Saroufim

Applied AI Engineer @Meta

Session Machine Learning

Real-Time Machine Learning: Architecture and Challenges

Monday Oct 24 / 02:55PM PDT

Fresh data beats stale data for machine learning applications. This talk discusses the value of fresh data as well as different types of architecture and challenges of online prediction.  

Speaker image - Chip Huyen

Chip Huyen

Co-founder @Claypot AI, previously @Snorkel Ai & @NVIDIA

Session Machine Learning

Declarative Machine Learning: A Flexible, Modular and Scalable Approach for Building Production ML Models

Monday Oct 24 / 04:10PM PDT

Building ML solutions from scratch is challenging because of a variety of reasons: the long development cycles of writing low level machine learning code and the fast pace of state-of-the-art ML methods to name a few.

Speaker image - Shreya Rajpal

Shreya Rajpal

Founder @Guardrails AI, Experienced ML Practitioner with a Decade of Experience in ML Research, Applications and Infrastructure

Session

Unconference: MLOps

Monday Oct 24 / 05:25PM PDT

What is an unconference? At QCon SF, we’ll have unconferences in most of our tracks.

Speaker image - Shane Hastie

Shane Hastie

Global Delivery Lead for SoftEd and Lead Editor for Culture & Methods at InfoQ.com