Christian Liebel
Prompt API & WebNN: The AI Revolution Right in Your Browser
#1about 3 minutes
The case for running AI models locally
Cloud-based AI has drawbacks like offline limitations, capacity issues, data privacy concerns, and subscription costs, creating an opportunity for local, on-device models.
#2about 2 minutes
Two primary approaches for browser-based AI
The W3C is exploring two main approaches for on-device AI: "Bring Your Own AI" libraries like WebLLM and low-level APIs like WebNN, alongside experimental "Built-in AI" APIs like the Prompt API.
#3about 3 minutes
Running large language models with WebLLM
The WebLLM library uses WebGPU to download and run open-weight large language models directly in the browser's cache storage, enabling offline chat and data processing.
#4about 1 minute
Solving the model size and storage problem
Large AI models create a storage problem due to browser origin isolation, leading to a proposal for a Cross Origin Storage API to allow models to be shared across different websites.
#5about 2 minutes
Exploring diverse ML workloads with Transformers.js
The Transformers.js library enables various on-device machine learning tasks beyond text generation, such as computer vision and audio processing, as shown in a sketch recognition game.
#6about 4 minutes
Accelerating performance with the WebNN API
The upcoming Web Neural Network (WebNN) API provides direct access to specialized hardware like NPUs, offering a significant performance increase for ML tasks compared to CPU or GPU processing.
#7about 3 minutes
The alternative: Built-in AI and the Prompt API
Google Chrome's experimental built-in AI initiative solves model sharing and performance issues by providing standardized APIs that use a single, browser-managed model like Gemini Nano.
#8about 4 minutes
Exploring the built-in AI API suite
A demonstration of the built-in AI APIs shows how to use the summarizer, language detector, and Prompt API for general LLM tasks directly from JavaScript in the browser.
#9about 4 minutes
Practical use cases for on-device AI
On-device AI can enhance web applications with features like an offline-capable chatbot in an Angular app or a smart form filler that automatically categorizes and inputs user data.
#10about 3 minutes
Building real-time conversational agents
Demonstrations of a multimodal insurance form assistant and a simple on-device conversational agent highlight the potential for creating interactive, real-time user experiences with local AI.
#11about 1 minute
Weighing the pros and cons of on-device AI
On-device AI offers significant advantages in privacy, availability, and cost, but developers must consider the trade-offs in model capability, response quality, and system requirements compared to cloud solutions.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
18:18 MIN
Web performance gaps and AI's struggle with logic
WeAreDevelopers LIVE – Web Scraping, Agents, Actors and more
00:02 MIN
Introduction to generative AI in the browser
Generate AI in the Browser with Chrome AI - Raymond Camden
01:31 MIN
Understanding the fundamentals of Chrome AI
Generate AI in the Browser with Chrome AI - Raymond Camden
33:57 MIN
Implementing on-device AI with the Chrome AI API
WeAreDevelopers LIVE – AI vs the Web & AI in Browsers
31:13 MIN
Running on-device AI in the browser with Gemini Nano
Exploring Google Gemini and Generative AI
23:20 MIN
The future of on-device AI hardware and APIs
From ML to LLM: On-device AI in the Browser
31:13 MIN
The future of on-device AI in web development
Generative AI power on the web: making web apps smarter with WebGPU and WebNN
09:43 MIN
The technical challenges of running LLMs in browsers
From ML to LLM: On-device AI in the Browser
Featured Partners
Related Videos
Generative AI power on the web: making web apps smarter with WebGPU and WebNN
Christian Liebel
Privacy-first in-browser Generative AI web apps: offline-ready, future-proof, standards-based
Maxim Salnikov
From ML to LLM: On-device AI in the Browser
Nico Martin
Generate AI in the Browser with Chrome AI - Raymond Camden
Exploring the Future of Web AI with Google
Thomas Steiner
Performant Architecture for a Fast Gen AI User Experience
Nathaniel Okenwa
AI: Superhero or Supervillain? How and Why with Scott Hanselman
Scott Hanselman
Bringing the power of AI to your application.
Krzysztof Cieślak
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

AI Systems and MLOps Engineer for Earth Observation
Forschungszentrum Jülich GmbH
Jülich, Germany
Intermediate
Senior
Linux
Docker
AI Frameworks
Machine Learning


Front End Engineering Manager ( Generative AI experience )
Accenture
Charing Cross, United Kingdom
REST
React
GraphQL
React Native
Continuous Integration

AI & Embedded ML Engineer (Real-Time Edge Optimization)
autonomous-teaming
Canton of Toulouse-5, France
Remote
C++
GIT
Linux
Python
+1

AI & Embedded ML Engineer (Real-Time Edge Optimization)
autonomous-teaming
München, Germany
Remote
C++
GIT
Linux
Python
+1

Front End Engineer TypeScript React Native AI
Client Server
Charing Cross, United Kingdom
Remote
£80K
CSS
React
JavaScript
+5

Senior Software Engineer - Frontend AI Native
Simple Online Healthcare
Glasgow, United Kingdom
£65-80K
Senior
Go
Java
React
Python
+4

Generative AI Developer
University of the Arts, London
Sleaford, United Kingdom
£34-41K
Python
PyTorch
TensorFlow

GenAI Developer - Prompt Engineering & Data Workflows
Mindrift
Remote
€41K
Junior
JSON
Python
Data analysis
+1