You are viewing content from a past/completed conference.

Optimizing Custom Workloads with RISC-V

Abstract

This talk will explore how RISC-V architecture can accelerate custom workloads, focusing on AI/ML applications. We’ll start by examining the RISC-V ecosystem and its increasing relevance in the software development landscape. By looking at OpenBLAS, a highly optimized linear algebra library, we'll demonstrate how it's been optimized for RISC-V hardware, leading to significant performance gains in AI/ML tasks.

We’ll also highlight the ongoing work at RISC-V International on the matrix multiply extension, which promises to further enhance performance for these critical operations. Through practical examples, the talk will highlight why RISC-V is a strong candidate for optimizing workloads requiring high computational efficiency.

This session will offer valuable insights into how RISC-V can meet demanding performance needs while providing flexibility for custom workload optimization.

Speaker

Ludovic Henry

Member of Technical Staff @Rivos, Performance-Minded Engineer, Hardware & Software, Previously @Xamarin, @Microsoft, @Datadog

Ludovic is currently working at Rivos, a RISC-V hardware company, focused on accelerating AI/ML and Data Analytics workloads. His focus is on ensuring software stacks like Java, Python, Go, and various System Librariers are optimal in their usage of the hardware, as well as making sure the right hardware gets developed to accelerate these workloads.

His passion for hardware has developed over the past decade where he had the chance to work on multiple runtime technologies: OpenJDK, Go, Mono, .NET, Ruby, and more. These experiences have taught him how important hardware is to the performance of software, and how to make best use of it.

Speaker

Ludovic Henry

Member of Technical Staff @Rivos, Performance-Minded Engineer, Hardware & Software, Previously @Xamarin, @Microsoft, @Datadog

From the same track

Session Hybrid cloud

Evaluating and Deploying State-of-the-Art Hardware to Meet the Challenges of Modern Workloads

Wednesday Nov 20 / 01:35PM PST

At GEICO we are on a journey to entirely modernize our Infrastructure. We are building an open-source, cloud-agnostic hybrid stack to run across public and on prem private cloud infrastructure without having to expose vendor specific stacks to our application developers.

Rebecca Weekly

VP of Infrastructure @GEICO

Session AI HW/SW optimization

Maximizing Deep Learning Performance on CPUs using Modern Architectures

Wednesday Nov 20 / 11:45AM PST

As deep learning continues to drive advancements across various industries, efficiently navigating the landscape of specialized AI hardware has a huge impact on cost and speed of operation.

Bibek Bhattarai

AI Technical Lead @Intel, Computer Scientist Invested in Hardware-Software Optimization, Building Scalable Data Analytics, Mining, and Learning Systems

Session

High-Resolution Platform Observability

Wednesday Nov 20 / 03:55PM PST

Many observability tools fail to provide us with the relevant insights for understanding hardware health and utilization.

Brian Martin

Co-founder and Software Engineer @IOP Systems, Focused on High-Performance Software and Systems, Previously @Twitter

Session AI/ML

Unleashing Llama's Potential: CPU-Based Fine-Tuning

Wednesday Nov 20 / 10:35AM PST

Generative AI landscape is rapidly changing as new models are appearing in horizon every few days. However, the hardware and software characteristics of these models have many similar patterns and execution phases.

Anil Rajput

AMD Fellow, Software System Design Eng. Java Committee Chair @SPEC, Architected Industry Standard Benchmarks and Authored Best Practices Guides for Platform Engineering and Cloud

Rema Hariharan

Principal Engineer @AMD, Seasoned Performance Engineer With a Base in Quantitative Sciences and a Penchant for Root-Causing

Optimizing Custom Workloads with RISC-V

Abstract

Speaker

Ludovic Henry

Find Ludovic Henry at:

Speaker

Ludovic Henry

Date

Location

Track

Topics

Share

From the same track

Evaluating and Deploying State-of-the-Art Hardware to Meet the Challenges of Modern Workloads

Maximizing Deep Learning Performance on CPUs using Modern Architectures

High-Resolution Platform Observability

Unleashing Llama's Potential: CPU-Based Fine-Tuning

Follow QCon

Contact

Menu

Conferences around the World