Christian Liebel

Generative AI power on the web: making web apps smarter with WebGPU and WebNN

What if your web app could run generative AI without cloud costs or latency? Discover how WebGPU and the upcoming WebNN API make on-device AI a reality.

Generative AI power on the web: making web apps smarter with WebGPU and WebNN
#1about 1 minute

Generative AI use cases and cloud provider limitations

Cloud-based AI faces challenges like required internet connectivity, data privacy risks, and high costs, creating a need for local alternatives.

#2about 13 minutes

Running large language models locally with Web LLM

Web LLM enables running multi-gigabyte language models like Llama 3 directly in the browser for offline use, despite initial download and initialization times.

#3about 2 minutes

The technology behind in-browser AI execution

In-browser AI performance is accelerated by combining WebAssembly for efficient computation and the new WebGPU API for direct access to the system's GPU.

#4about 4 minutes

Boosting performance with the upcoming WebNN API

The Web Neural Network (WebNN) API provides access to dedicated Neural Processing Units (NPUs) for even faster, more efficient on-device model inference.

#5about 6 minutes

Solving model duplication with the new Prompt API

The experimental Prompt API addresses the issue of redundant model downloads by allowing websites to access a single, shared OS-level model like Gemini Nano.

#6about 3 minutes

Using the Prompt API for on-device data extraction

A demonstration shows how the Prompt API can use a local model to accurately extract structured data from unstructured text, highlighting its practical application.

#7about 2 minutes

Generating images in the browser with WebSD

WebSD brings text-to-image generation to the browser by running Stable Diffusion models locally using WebGPU, enabling creative AI tasks without cloud dependency.

#8about 1 minute

Weighing the pros and cons of local AI models

Local AI models offer superior privacy, offline availability, and low cost, but come with trade-offs like lower quality, high system requirements, and slower performance.

#9about 1 minute

The future of on-device AI in web development

While cloud-based models are currently superior, the trend towards more compact open-source models and OS-integrated AI suggests a growing role for local AI in specialized web applications.

Related jobs
Jobs that call for the skills explored in this talk.

Featured Partners

Related Articles

View all articles
AB
Adrien Book
How AI Will Eat The World 🤖
Of generative-AI-for-everything and synthetic pleasuresRemember the web3 hype? Tech bros with easy access to cheap liquidity wanted to create a decentralised, peer-to-peer internet powered by blockchain technology. Spoiler alert, it did not work. And...
How AI Will Eat The World 🤖
DC
Daniel Cranney
The State of WebDev AI 2025 Results: What Can We Learn?
Introduction The 2025 edition of The State of WebDev AI offers a detailed snapshot of how developers are using AI today, which tools have gained the most traction over the past year, and what these trends suggest about the future of the industry. In...
The State of WebDev AI 2025 Results: What Can We Learn?
CH
Chris Heilmann
With AIs wide open - WeAreDevelopers at All Things Open 2025
Last week our VP of Developer Relations, Chris Heilmann, flew to Raleigh, North Carolina to present at All Things Open . An excellent event he had spoken at a few times in the past and this being the “Lucky 13” edition, he didn’t hesitate to come and...
With AIs wide open - WeAreDevelopers at All Things Open 2025
CH
Chris Heilmann
Dev Digest 116 - WWWAI?
This time, learn how to un-AI Google's search results, what's new on the web, avoid a new security hole and go back to BASICS with us. News and ArticlesWhat a week. Google, Microsoft, OpenAI and many others had their big flagship events announcing th...
Dev Digest 116 - WWWAI?

From learning to earning

Jobs that call for the skills explored in this talk.

Generative AI Developer

Generative AI Developer

University of the Arts, London
Sleaford, United Kingdom

£34-41K
Python
PyTorch
TensorFlow
Generative AI Engineer

Generative AI Engineer

Generative Ai Engineer83zero Limited
Glasgow, United Kingdom

£80-88K
GIT
Azure
NoSQL
React
+16