LLM accuracy is a challenging topic to address and is much more multi dimensional than a simple accuracy score. In this talk we’ll dive deeper into how to measure LLM related metrics, going through examples, case studies and techniques beyond just a single accuracy and score. We’ll discuss how to create, track and revise micro LLM metrics to have granular direction for improving LLM models.
Speaker
![](https://qconsf.com/sites/qcon_sf/files/styles/medium/public/pictures/2024-07/unnamed%20%282%29.png?itok=cDq4Qnq8)
Denys Linkov
Head of ML @Voiceflow, Linkedin Learning Instructor
Denys leads Enterprise AI at Voiceflow, is a ML Startup Advisor and Linkedin Learning Course Instructor. He's worked with 50+ enterprises in their conversational AI journey, and his Gen AI courses have helped 150,000+ learners build key skills. He's worked across the AI product stack, being hands-on building key ML systems, managing product delivery teams, and working directly with customers on best practices.