Speaker: Cody Yu
Staff Software Engineer and Tech Lead @Anyscale, Ex-Amazonian, vLLM Committer, Apache TVM PMC
Cody Yu is a staff software engineer and a tech lead at Anyscale, working on LLM inference performance optimization. He is a community member of various popular open source projects such as vLLM, SGLang and Apache TVM. Before Anyscale, Cody was a founding engineer at BosonAI, as well as a senior applied scientist at AWS AI. His recent research is in hardware acceleration and performance optimization for LLM systems.
Find Cody Yu at:
Session
Scale Out Batch Inference with Ray
As AI technologies continue to evolve, the demand for processing both structured and unstructured data across diverse industries is rapidly growing.