@AndrewYNg: New course: Add voice to your AI agents and applications, built with @VocalBridge (disclosure: an AI Fund portfolio com…
Summary
Andrew Ng announces a new course on adding voice to AI agents using VocalBridge, taught by its CEO. The course covers three integration patterns and evaluation techniques for building reliable and low-latency voice applications.
View Cached Full Text
Cached at: 06/18/26, 10:11 PM
New course: Add voice to your AI agents and applications, built with @VocalBridge (disclosure: an AI Fund portfolio company) and taught by its CEO @_ashwyn.
Voice applications historically required making a hard tradeoff: using fast voice-to-voice models that sacrifice reliability, or accurate speech-to-text pipelines that add latency. This course teaches you how to build voice agents that are both reliable and fast.
You’ll build three types of voice-enabled applications: a voice-interactive game where voice commands and mouse clicks work together over a single channel, an agent that gains a voice in about 10 lines of code without touching its prompts or tools, and an agent that places outbound phone calls using a make_phone_call function.
Skills you’ll gain:
- Add a voice layer to an existing agent without rewriting your prompts, RAG pipeline, or tools
- Give an agent the ability to place outbound calls and stream transcripts back live
- Set up voice evaluation to score calls, catch regressions, and improve quality before deployment
Join and add voice to your agents without overhauling your architecture: https://deeplearning.ai/courses/voice-for-ai-agents-and-applications…
Voice for AI Agents and Applications
Source: https://www.deeplearning.ai/courses/voice-for-ai-agents-and-applications
What you’ll learn
- Implement three types of voice-enabled AI applications: a voice-interactive game, a voice-layered agent, and an agent that places outbound phone calls.
- Add voice to an existing agent with minimal code, without rewriting your prompts, RAG pipeline, or tools.
- Use voice evaluation to score your agent’s calls, surface failure modes, and improve quality before reaching production.
About this course
Voice is one of the most natural human interfaces, but adding it to AI applications has historically forced a tradeoff: fast voice-to-voice models that sacrifice reliability, or accurate speech-to-text-to-LLM-to-speech pipelines that add latency.
This course teaches you how to get both, using Vocal Bridge’s architecture that pairs a real-time foreground agent with a reasoning background agent.
Taught by Ashwyn Sharma, CEO and Co-Founder of Vocal Bridge (an AI Fund portfolio company), this course covers three practical integration patterns that meet you where you are: voice embedded in an application, voice layered onto an existing agent without touching its logic, and voice as a tool your LLM can call when it decides a conversation is the right modality.
In detail, you’ll:
- Survey the traditional voice stack and its tradeoffs, then explore three live integration patterns to understand when each one applies.
- Build a voice-interactive tic-tac-toe game where voice commands and mouse clicks work together over a single synchronized channel, then add a voice layer to an existing agent with minimal code, leaving your prompts, RAG pipeline, and tools untouched.
- Give your agent a make_phone_call tool so it can dial a real number, hold a conversation with a demo agent, and stream the transcript back live.
- Set up evaluation-driven development using Vocal Bridge’s multimodal evaluator to score calls, catch regressions, and refine prompts before issues reach users.
- Hear from Scott Johnston, former CEO of Docker and Vocal Bridge board member, on what it actually takes to move voice agents from demos to production.
By the end of this course, you’ll have implemented three hands-on voice AI patterns: adding voice to an interactive app, layering voice onto a text-based agent, and giving an agent the ability to place outbound calls. You’ll also know how to evaluate and improve voice interactions.
Who should join?
Developers and AI builders who want to add voice to their agents or applications. Basic familiarity with Python is recommended. No prior experience with voice APIs is required.
Course Outline
8Lessons・5Code Examples
Instructor

Ashwyn Sharma
CEO and Co-Founder, Vocal Bridge
Voice for AI Agents and Applications
- Beginner
- 1h26m
- 8 Video Lessons
- 5 Code Examples
- 1 Graded AssignmentPRO
- Earnan accomplishmentwithPRO
- Instructor: Ashwyn Sharma
Vocal Bridge- Learn more aboutMembership PRO Plan
7-Day Voice AI Builder Challenge
Stop babysitting long runs. Join the challenge and teach your AI coding assistant to call you the moment it needs human intervention.
Similar Articles
Built my own voice AI platform after Vapi burned me. Wrote up everything I learned shopping for one.
The author shares lessons from building their own voice AI platform after dissatisfaction with Vapi, revealing hidden costs, real-world latency issues, and white-label shortcomings, and offers a free guide for agency owners evaluating platforms.
Join the new AI Agents Vibe Coding Course from Google and Kaggle
Google and Kaggle are hosting a free five-day AI Agents Vibe Coding course in June 2026, focusing on building production-ready agents using natural language workflows.
@AndrewYNg: New course: Transformers in Practice. You'll get a practical view of how transformer-based LLMs work, so you can reason…
New course 'Transformers in Practice' from deeplearning.ai and AMD teaches practical understanding of transformer-based LLMs, covering text generation, attention mechanisms, and inference optimization techniques like quantization and KV caching.
@AndrewYNg: New course on serving LLMs efficiently -- how do you serve models to many concurrent users at low latency and reasonabl…
Andrew Ng and DeepLearning.AI have launched a new short course on efficient LLM inference with vLLM, built in partnership with Red Hat, covering quantization, PagedAttention, continuous batching, and benchmarking for serving LLMs at scale.
Building voice AI agents that take turns like humans — the gotchas nobody warns you about
This article shares hard-won lessons from building real-time voice AI agents, highlighting the importance of proper turn-taking, VAD handling, billing awareness, and avoiding echo loops.