Open-sourced a full-stack starter for production voice agents (web + telephony on one worker)

Reddit r/AI_Agents Tools

Summary

Open-sourced a full-stack production starter for voice agents using LiveKit, FastAPI, and React, handling both web and telephony with a single code path, deployable via Docker Compose.

Most voice agent tutorials stop at "here's a script that talks back." The gap to production is everything around it: minting room tokens, a real client, telephony, deploy, tests. I kept rebuilding that, so I packaged it as an open starter (MIT). It's a LiveKit-based voice agent in three parts: the voice worker (STT to LLM to TTS with turn detection), a FastAPI token server, and a React client with a live transcript and text chat. The part I'm happiest with: web and SIP (phone) calls hit the same agent through one participant branch, so you don't maintain two code paths for "talk in the browser" and "call a number." You extend the agent by adding function tools or handing off to a task, and the providers are swappable. Docker Compose runs the whole stack with one command. For folks who've shipped voice agents: where did the demo-to-production gap bite you hardest? I want the starter to cover the real pain, not just the happy path.
Original Article

Similar Articles

How OpenAI delivers low-latency voice AI at scale

OpenAI Blog

OpenAI details its rearchitected WebRTC stack designed to deliver low-latency voice AI at scale for over 900 million users. The post explains how new split-relay and transceiver architectures optimize media routing and connection setup for real-time interactions like ChatGPT voice.

vercel-labs/open-agents

GitHub Trending (daily)

A new open-source template released by Vercel Labs designed to simplify the process of building cloud-based AI agents.