Tobias Münch
Is the web ready for voice user interfaces?
#1about 3 minutes
Why voice user interfaces are important for accessibility
Voice interfaces can significantly improve web accessibility for users with disabilities and provide hands-free convenience for mobile professionals.
#2about 1 minute
Understanding the Web Speech API's core functions
The Web Speech API is a W3C standard divided into speech recognition for converting voice to text and speech synthesis for converting text to voice.
#3about 2 minutes
Reviewing VUI research and its current limitations
Research projects like the Conversational Web and a wheelchair VUI demonstrate potential but suffer from inconsistent accuracy, online-only functionality, and lack of wake words.
#4about 3 minutes
How to implement the Web Speech API in JavaScript
Learn the step-by-step process of implementing speech recognition, including loading the class, configuring grammar with JSGF, starting the listener, and processing the results.
#5about 2 minutes
Navigating the Web Speech API's result data structure
The API returns a nested data structure containing a list of results, each with alternatives that include the text transcript and a confidence score.
#6about 3 minutes
Key challenges limiting Web Speech API adoption
The API's adoption is hindered by significant issues including poor developer experience, privacy risks from cloud processing, no offline support, and inconsistent browser implementations.
#7about 3 minutes
A look inside the browser's implementation of speech recognition
An analysis of the Chromium source code reveals how the Web Speech API is implemented through layers that manage and dispatch recognition tasks to either remote cloud services or local OS-dependent engines.
#8about 5 minutes
The future of VUIs with Stanford's React Genie
Stanford's React Genie project offers a new paradigm by loosely coupling a voice agent with React state, allowing for complex voice commands that can manipulate off-screen content and application logic.
#9about 1 minute
Final verdict on the web's readiness for voice UIs
While the current Web Speech API is suitable for experimentation, it is not reliable enough for production use, but promising research indicates a more capable future for web-based voice interfaces.
Related jobs
Jobs that call for the skills explored in this talk.
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Hubert Burda Media
München, Germany
€80-95K
Intermediate
Senior
JavaScript
Node.js
+1
Douglas GmbH
Düsseldorf, Germany
Intermediate
Senior
TypeScript
React
Matching moments
06:33 MIN
The security challenges of building AI browser agents
AI in the Open and in Browsers - Tarek Ziadé
03:31 MIN
The value of progressive enhancement and semantic HTML
WeAreDevelopers LIVE – You Don’t Need JavaScript, Modern CSS and More
08:40 MIN
Integrating AI into Firefox while respecting user privacy
AI in the Open and in Browsers - Tarek Ziadé
06:23 MIN
The trend of browsers depending on online services
Developer Time Is Valuable - Use the Right Tools - Kilian Valkhof
03:16 MIN
Improving the developer feedback loop with specialized tools
Developer Time Is Valuable - Use the Right Tools - Kilian Valkhof
06:44 MIN
Using Chrome's built-in AI for on-device features
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
04:30 MIN
Understanding browser APIs that rely on company services
Developer Time Is Valuable - Use the Right Tools - Kilian Valkhof
02:33 MIN
Why you might not need JavaScript for everything
WeAreDevelopers LIVE – You Don’t Need JavaScript, Modern CSS and More
Featured Partners
Related Videos
Speak, Code, Deploy: Transforming Developer Experience with Voice Commands
Sami Ekblad
Hello JARVIS - Building Voice Interfaces for Your LLMS
Nathaniel Okenwa
What’s New and What’s Next in Web UI
Cleyra Uzcategui
Building a Browser-Based Karaoke Game with Web Speech API
Ana Rodrigues
From ML to LLM: On-device AI in the Browser
Nico Martin
Exploring the Future of Web AI with Google
Thomas Steiner
Prompt API & WebNN: The AI Revolution Right in Your Browser
Christian Liebel
The State Of The Web
Jeremy Keith
Related Articles
View all articles


.webp?w=240&auto=compress,format)
From learning to earning
Jobs that call for the skills explored in this talk.

Visonum GmbH
Remote
Junior
Intermediate
React
Redux
TypeScript







