Optimizing Custom Workloads with RISC-V

This talk will explore how RISC-V architecture can accelerate custom workloads, focusing on AI/ML applications. We’ll start by examining the RISC-V ecosystem and its increasing relevance in the software development landscape. By looking at OpenBLAS, a highly optimized linear algebra library, we'll demonstrate how it's been optimized for RISC-V hardware, leading to significant performance gains in AI/ML tasks.

We’ll also highlight the ongoing work at RISC-V International on the matrix multiply extension, which promises to further enhance performance for these critical operations. Through practical examples, the talk will highlight why RISC-V is a strong candidate for optimizing workloads requiring high computational efficiency.

This session will offer valuable insights into how RISC-V can meet demanding performance needs while providing flexibility for custom workload optimization.


Speaker

Ludovic Henry

Member of Technical Staff @Rivos, Performance-Minded Engineer, Hardware & Software, Previously @Xamarin, @Microsoft, @Datadog

Ludovic is currently working at Rivos, a RISC-V hardware company, focused on accelerating AI/ML and Data Analytics workloads. His focus is on ensuring software stacks like Java, Python, Go, and various System Librariers are optimal in their usage of the hardware, as well as making sure the right hardware gets developed to accelerate these workloads.

His passion for hardware has developed over the past decade where he had the chance to work on multiple runtime technologies: OpenJDK, Go, Mono, .NET, Ruby, and more. These experiences have taught him how important hardware is to the performance of software, and how to make best use of it.

Read more
Find Ludovic Henry at:

From the same track

Session

Evaluating and Deploying State-of-the-Art Hardware to Meet the Challenges of Modern Workloads

Wednesday Nov 20 / 01:35PM PST

At GEICO we are on a journey to entirely modernize our Infrastructure. We are building an open-source, cloud-agnostic hybrid stack to run across public and on prem private cloud infrastructure without having to expose vendor specific stacks to our application developers.

Speaker image - Rebecca Weekly

Rebecca Weekly

VP of Infrastructure @GEICO

Session

Maximizing Deep Learning Performance: Hardware and Software Innovations for Optimizing AI Workloads

Wednesday Nov 20 / 10:35AM PST

As deep learning continues to drive advancements across various industries, efficiently navigating the landscape of specialized AI hardware has huge impact in cost and speed of operation.

Speaker image - Bibek Bhattarai

Bibek Bhattarai

AI Technical Lead @Intel, Computer Scientist Invested in Hardware-Software Optimization, Building Scalable Data Analytics, Mining, and Learning Systems

Session

High-Resolution Platform Observability

Wednesday Nov 20 / 02:45PM PST

Many observability tools fail to provide us with the relevant insights for understanding hardware health and utilization.

Speaker image - Brian Martin

Brian Martin

Co-founder and Software Engineer @IOP Systems, Focused on High-Performance Software and Systems, Previously @Twitter

Session

Unleashing Llama's Potential: CPU-Based Fine-Tuning

Wednesday Nov 20 / 03:55PM PST

Details coming soon.

Speaker image - Anil Rajput

Anil Rajput

AMD Fellow, Software System Design Eng. Java Committee Chair @SPEC, Architected Industry Standard Benchmarks and Authored Best Practices Guides for Platform Engineering and Cloud

Speaker image - Dr. Rema Hariharan

Dr. Rema Hariharan

PRincipal Engineer @AMD, Seasoned Performance Engineer With a Base in Quantitative Sciences and a Penchant for Root-Causing