Andrew Wafaa
From Model to Metal: An Open Source Stack for Accelerating Intelligence
#1about 3 minutes
The challenge of a fragmented AI software stack
The proliferation of AI workloads across diverse hardware has created a fragmented ecosystem with vendor-specific libraries and duplicated optimization efforts.
#2about 3 minutes
Unifying AI development with the UXL Foundation
The UXL Foundation leverages Intel's oneAPI to create a unified, open-source library suite that ensures portability and performance across different hardware architectures.
#3about 3 minutes
Exploring core UXL components for compute and math
A breakdown of oneDNN for accelerating deep learning primitives and oneMKL for high-performance mathematical functions like BLAS and FFT.
#4about 2 minutes
Managing parallelism and data with UXL libraries
An overview of oneTBB for task scheduling, oneCCL for multi-node communication, and oneDAL for managing data analysis and machine learning pipelines.
#5about 2 minutes
Visualizing the end-to-end UXL software stack
The complete architecture shows how UXL libraries abstract hardware and accelerate operations for frameworks during training, inference, and serving.
#6about 3 minutes
The benefits of an open and unified AI stack
An open-source, unified stack increases developer productivity, fosters community innovation, and ensures software evolves in lockstep with new hardware releases.
#7about 1 minute
How to contribute to the UXL open source project
A call for developers to contribute kernels, join the UXL Foundation working groups, and share workloads to help grow the open ecosystem.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
1:00:16 MIN
Supporting a global developer community in the age of AI
WeAreDevelopers LIVE – Web Scraping, Agents, Actors and more
14:10 MIN
Leveraging open software and AI for code development
The Future of Computing: AI Technologies in the Exascale Era
03:35 MIN
How the Linux Foundation supports the end-to-end AI stack
The Open Future of AI: Beyond Open Weights
04:05 MIN
Why you should run AI workloads on the Arm CPU
Mobile AI Just Got Faster: What’s Coming for Developers on Arm
13:46 MIN
Enabling hybrid AI with an open software stack
Bringing AI Everywhere
23:20 MIN
The future of on-device AI hardware and APIs
From ML to LLM: On-device AI in the Browser
34:19 MIN
A final summary of Stack Overflow's AI journey
The Data Phoenix: The future of the Internet and the Open Web
04:25 MIN
Architecture of a unified data and GenAI platform
Beyond GPT: Building Unified GenAI Platforms for the Enterprise of Tomorrow
Featured Partners
Related Videos
Open Source AI, To Foundation Models and Beyond
Ankit Patel, Matt White, Philipp Schmid, Lucie-Aimée Kaffee & Andreas Blattmann
Unleashing the Full Potential of the Arm Architecture – Write Once, Deploy Anywhere
Andrew Waafa
How AI Models Get Smarter
Ankit Patel
Bringing AI Everywhere
Stephan Gillich
Mobile AI Just Got Faster: What’s Coming for Developers on Arm
Gian Marco Iodice
The AI-Ready Stack: Rethinking the Engineering Org of the Future
Jan Oberhauser, Mirko Novakovic, Alex Laubscher & Keno Dreßel
The Open Future of AI: Beyond Open Weights
Matt White
The State of GenAI & Machine Learning in 2025
Alejandro Saucedo
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

AI Systems and MLOps Engineer for Earth Observation
Forschungszentrum Jülich GmbH
Jülich, Germany
Intermediate
Senior
Linux
Docker
AI Frameworks
Machine Learning


AI & Embedded ML Engineer (Real-Time Edge Optimization)
autonomous-teaming
Canton of Toulouse-5, France
Remote
C++
GIT
Linux
Python
+1

AI & Embedded ML Engineer (Real-Time Edge Optimization)
autonomous-teaming
München, Germany
Remote
C++
GIT
Linux
Python
+1





Principal Software Architect - Focus: AI Architecture
appliedAI
München, Germany
Remote
Senior
Docker
Terraform
Kubernetes
Software Architecture
+1