You are viewing content from a past/completed conference.

Speaker: Cody Yu

Staff Software Engineer and Tech Lead @Anyscale, Ex-Amazonian, vLLM Committer, Apache TVM PMC

Cody Yu is a staff software engineer and a tech lead at Anyscale, working on LLM inference performance optimization. He is a community member of various popular open source projects such as vLLM, SGLang and Apache TVM. Before Anyscale, Cody was a founding engineer at BosonAI, as well as a senior applied scientist at AWS AI. His recent research is in hardware acceleration and performance optimization for LLM systems.

Find Cody Yu at:

Session

Scale Out Batch Inference with Ray

As AI technologies continue to evolve, the demand for processing both structured and unstructured data across diverse industries is rapidly growing.

Speaker: Cody Yu

Find Cody Yu at:

Session

Scale Out Batch Inference with Ray

Date

Location

Track

Share

Follow QCon

Contact

Menu

Conferences around the World