Gian Marco Iodice
Mobile AI Just Got Faster: What’s Coming for Developers on Arm
#1about 3 minutes
Exploring generative AI use cases on mobile devices
Generative AI on mobile enables powerful, local-first applications like group chat summarization and high-quality audio generation without an internet connection.
#2about 3 minutes
Why you should run AI workloads on the Arm CPU
The Arm CPU offers scalability, security, and an "optimize once, deploy everywhere" model, making it ideal for high-performance, low-latency AI applications.
#3about 2 minutes
Navigating the diverse mobile AI framework ecosystem
A wide range of open-source frameworks, each with unique strengths, are available for deploying AI models on Arm-powered mobile devices.
#4about 3 minutes
How the KleidiAI library unifies AI performance
The KleidiAI library provides highly optimized, low-level routines that integrate directly into popular AI frameworks to ensure the best performance on Arm CPUs.
#5about 3 minutes
A deep dive into the on-device AudioGen pipeline
The AudioGen pipeline runs locally by combining multiple models and processing steps, requiring data type flexibility like FP32 and FP16 for optimal quality.
#6about 2 minutes
Building a private, fully on-device smart assistant
Generative AI enables smart speakers to run entirely locally, combining speech-to-text, LLM, and text-to-speech models for a private user experience.
#7about 3 minutes
Introducing SME2 for next-generation AI acceleration
The Scalable Matrix Extension 2 (SME2) for Armv9 CPUs uses the Matrix Outer Product Accumulate (MPA) instruction to dramatically accelerate matrix multiplication.
#8about 1 minute
Measuring performance gains with SME2 acceleration
SME2 delivers over six times better performance for key generative AI models like Gemma and Whisper, enabling real-time text summarization and audio generation.
#9about 2 minutes
How Android developers can prepare for SME2
With SME2 support coming to Android, developers using AI frameworks with KleidiAI integration will automatically receive significant performance boosts without any code changes.
Related jobs
Jobs that call for the skills explored in this talk.
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Picnic Technologies B.V.
Amsterdam, Netherlands
Intermediate
Senior
Python
Structured Query Language (SQL)
+1
Matching moments
00:48 MIN
The shift to on-device AI models in smartphones
Fake or News: Coding on a Phone, Emotional Support Toasters, ChatGPT Weddings and more - Anselm Hannemann
06:44 MIN
Using Chrome's built-in AI for on-device features
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
09:10 MIN
How AI is changing the freelance developer experience
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
03:07 MIN
Final advice for developers adapting to AI
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
03:55 MIN
The hardware requirements for running LLMs locally
AI in the Open and in Browsers - Tarek Ziadé
01:02 MIN
AI lawsuits, code flagging, and self-driving subscriptions
Fake or News: Self-Driving Cars on Subscription, Crypto Attacks Rising and Working While You Sleep - Théodore Lefèvre
14:06 MIN
Exploring the role and ethics of AI in gaming
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
00:30 MIN
The feasibility of coding entirely on a mobile phone
Fake or News: Coding on a Phone, Emotional Support Toasters, ChatGPT Weddings and more - Anselm Hannemann
Featured Partners
Related Videos
Unleashing the Full Potential of the Arm Architecture – Write Once, Deploy Anywhere
Andrew Waafa
From Model to Metal: An Open Source Stack for Accelerating Intelligence
Andrew Wafaa
Google Gemini: Open Source and Deep Thinking Models - Sam Witteveen
Sam Witteveen
Prompt API & WebNN: The AI Revolution Right in Your Browser
Christian Liebel
How AI Models Get Smarter
Ankit Patel
Generative AI power on the web: making web apps smarter with WebGPU and WebNN
Christian Liebel
From ML to LLM: On-device AI in the Browser
Nico Martin
Your Next AI Needs 10,000 GPUs. Now What?
Anshul Jindal & Martin Piercy
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Arm Limited
Cambridge, United Kingdom
£47K
Senior
API
Azure
Python
Amazon Web Services (AWS)


Scalable GmbH
Berlin, Germany
API
Data analysis
Microservices
Agile Methodologies

Scalable GmbH
München, Germany
API
Data analysis
Microservices
Agile Methodologies

AiMA Beyond Ai
Barcelona, Spain
€40K
Senior
iOS
Java
NoSQL
React
+11

Apple Inc.
Cambridge, United Kingdom
C++
Java
Bash
Perl
Python
+4


