Chris Heilmann, Daniel Cranney, Marius Obert & Staff Developer Evangelist at Twilio
Minimal infrastructure for Real‑Time Phone Agents: transcripts in, responses out
#1about 4 minutes
Why voice is a powerful and natural AI interface
Voice interaction is significantly faster for input than typing and allows for hands-free operation, making it a natural fit for many AI use cases despite the challenges of audio parsing.
#2about 1 minute
The complexity of building traditional voice agents
Building a voice agent the traditional way requires managing separate services for speech recognition, text-to-speech, and interruption detection, which introduces significant latency and complexity.
#3about 4 minutes
Simplifying voice agent architecture with ConversationRelay
Twilio's ConversationRelay abstracts away the complexities of voice processing, allowing developers to receive text transcripts via a WebSocket and focus solely on their application logic.
#4about 6 minutes
Live coding a Deno server for a phone agent
A basic Deno server is set up to handle initial HTTP requests by returning TwiML instructions and to upgrade the connection to a WebSocket for real-time communication.
#5about 3 minutes
Configuring a Twilio number and testing the connection
A new phone number is purchased and configured in the Twilio console to point to the server's webhook, followed by a live call to test the transcription and hardcoded response.
#6about 5 minutes
Integrating OpenAI for streaming dynamic responses
The OpenAI API is integrated to generate dynamic responses, using streaming to send text chunks back as they are generated to minimize perceived latency for the caller.
#7about 2 minutes
Adding conversational memory for context-aware replies
A simple map is used to store the last message ID for each WebSocket connection, enabling the OpenAI API to maintain conversational history for follow-up questions.
#8about 2 minutes
Final demo with AI, history, and interruption
The final demonstration showcases the fully functional AI phone agent handling a multi-turn conversation, remembering context, and allowing the user to interrupt its response.
Related jobs
Jobs that call for the skills explored in this talk.
ROSEN Technology and Research Center GmbH
Osnabrück, Germany
Senior
TypeScript
React
+3
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Eltemate
Amsterdam, Netherlands
Intermediate
Senior
TypeScript
Continuous Integration
+1
Matching moments
04:57 MIN
Increasing the value of talk recordings post-event
Cat Herding with Lions and Tigers - Christian Heilmann
09:10 MIN
How AI is changing the freelance developer experience
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
04:17 MIN
Playing a game of real or fake tech headlines
WeAreDevelopers LIVE – You Don’t Need JavaScript, Modern CSS and More
05:03 MIN
Building and iterating on an LLM-powered product
Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2
00:30 MIN
The feasibility of coding entirely on a mobile phone
Fake or News: Coding on a Phone, Emotional Support Toasters, ChatGPT Weddings and more - Anselm Hannemann
06:44 MIN
Using Chrome's built-in AI for on-device features
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
04:06 MIN
Using AI to enable human connection in recruiting
Retention Over Attraction: A New Employer Branding Mindset
06:33 MIN
The security challenges of building AI browser agents
AI in the Open and in Browsers - Tarek Ziadé
Featured Partners
Related Videos
WeAreDevelopers LIVE – Real-Time Phone Agents, Unsafe VPNs & More
Chris Heilmann, Daniel Cranney & Marius Obert
Raise your voice!
Lee Boonstra
Hello JARVIS - Building Voice Interfaces for Your LLMS
Nathaniel Okenwa
WeAreDevelopers LIVE – AI vs the Web & AI in Browsers
Chris Heilmann, Daniel Cranney & Raymond Camden
From ML to LLM: On-device AI in the Browser
Nico Martin
WeAreDevelopers LIVE - Build a multi AI agents game master with Strands & our weekly web finds
Chris Heilmann, Daniel Cranney & Tiffany Souterre
Agents for the Sake of Happiness
Thomas Dohmke
From Syntax to Singularity: AI’s Impact on Developer Roles
Anna Fritsch-Weninger
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.




MUUUH! GmbH
Junior
JSON
REST
Data analysis
Microsoft Office
Amazon Web Services (AWS)



MUUUH! GmbH
Osnabrück, Germany
Junior
JSON
REST
Data analysis
Microsoft Office
Amazon Web Services (AWS)

MANGO
Palau-solità i Plegamans, Spain
API
Azure
Redis
Node.js
Salesforce
+6

Starion Group
Municipality of Madrid, Spain
API
CSS
Python
Docker
Machine Learning
+1