Inference

Name: QCon San Francisco Software Development Conference
Start: 2025-11-17T09:00:00-07:00
End: 2025-11-21T18:00:00-07:00
Location: Hyatt Regency San Francisco

Session AI/ML

Producing the World's Cheapest Tokens: A How-to Guide

Wednesday Nov 19 / 10:35AM PST

AI inference is expensive, but it doesn’t have to be. In this talk, we’ll break down how to systematically drive down the cost per token across different types of AI workloads.

Meryem Arik

Co-Founder and CEO @Doubleword (Previously TitanML), Recognized as a Technology Leader in Forbes 30 Under 30, Recovering Physicist

Inference

Producing the World's Cheapest Tokens: A How-to Guide

Follow QCon

Contact

Menu

Conferences around the World