| Kite - Bootstrap Admin Template

Linaro Connect 2025

LIS25-107 Making AI Workloads on ARM Boring with RamaLama

Technical presentation - 30 minutes (including q&a)

Ai machine learning

  • Wednesday, 14 May 10:30 - 10:55
  • Room: Session room 3 | Opala III

Running AI workloads on ARM can be challenging due to the need for specialized hardware configurations, runtime optimizations, and dependency management. However, with the rise of tools like RamaLama, krunkit, libkrun, podman-machine, llama.cpp and vllm, developers can now deploy and manage AI models on ARM platforms with ease. In this talk, we’ll showcase how these tools come together to simplify AI development and deployment on ARM systems, focusing on practical workflows for AI environments. RamaLama, an open-source framework, streamlines AI model management by leveraging container technology (podman, docker), providing seamless integration with registries (ollama, hugging face, oci), and supporting AI runtimes optimized for ARM (llama.cpp and vllm). We’ll explore how podman-machine and Vulkan enable performant workloads on ARM GPUs. Through live demonstrations and examples, attendees will see how to: - Set up ARM-based systems for AI workloads using RamaLama. - Leverage containerized runtimes for predictable, repeatable deployment. - Optimize GPU performance using Vulkan and llama.cpp. - Deploy at scale with kubernetes YAML and podman quadlets for edge environments. Join us to discover how this ecosystem makes ARM a first-class citizen in the world of AI workloads, bridging the gap between experimentation and production with tools that prioritize simplicity and performance.

Download Slides

Add to My Agenda

Presented by

Profile Picture
Eric Curtin
Principal Software Engineer at Red Hat
Red Hat Engineer working with CentOS Automotive SIG. Upstream maintainer of inotify-tools, ostree, etc. View more
Profile Picture
Sergio Lopez Pascual
Senior Principal Software Engineer at Red Hat
Working in the Automotive Team at Red Hat with a focus on Virtualization. Lead developer of libkrun, maintainer of the "microvm" machine type in QEMU, co-developer of krunkit and muvm, and trying to p... View more