voice

Tag

Cards List
#voice

Voice feels like the underrated output layer for AI agents

Reddit r/AI_Agents · 3d ago

The article discusses the underutilized potential of voice as an output layer for AI agents, highlighting practical use cases and workflow challenges beyond simple text-to-speech.

0 favorites 0 likes
#voice

Show HN: VoiceDraw – Talk system design out loud, the diagrams draw themselves

Hacker News Top · 5d ago

VoiceDraw is a tool that automatically draws system design diagrams as you speak, capturing reasoning and tradeoffs.

0 favorites 0 likes
#voice

Juno

Product Hunt · 2026-06-11

Juno is a free, local voice layer for Mac that lets users interact with their computer by speaking instead of typing.

0 favorites 0 likes
#voice

How We Moved Discord Voice to the Edge

Lobsters Hottest · 2026-06-11 Cached

Discord migrated over 80% of its voice and video traffic to Cloudflare's edge network spanning 300+ cities, significantly reducing latency and packet loss globally, with improvements like 34% lower ping in Frankfurt.

0 favorites 0 likes
#voice

I wired a fully offline voice loop to Ollama + LM Studio — 100% CPU, no GPU, nothing leaves your machine (Silero VAD + Parakeet STT + Supertonic TTS 3)

Reddit r/LocalLLaMA · 2026-06-11

A fully offline, CPU-only voice loop for local LLMs using Silero VAD, Parakeet STT, and Supertonic TTS, integrated via a one-command installer. Works with Ollama, LM Studio, and various agent frameworks.

0 favorites 0 likes
#voice

Krisp Voice Translation API

Product Hunt · 2026-06-05

Krisp launches a real-time speech-to-speech translation API designed for high accuracy.

0 favorites 0 likes
#voice

Carbon Voice Speed Dial

Product Hunt · 2026-06-03

Carbon Voice launches a Speed Dial feature enabling quick access to both human team members and AI agents via voice communication.

0 favorites 0 likes
#voice

twelve agents share one voice file. none of them remember each other.

Reddit r/AI_Agents · 2026-06-03

A description of a multi-agent system where twelve agents share a single voice file and no memory, each starting from zero and acting independently, with the identity anchored in the document rather than the agent.

0 favorites 0 likes
#voice

Paperwork is better when you can just talk through it (1 minute read)

TLDR AI · 2026-05-25 Cached

A product or tool that allows users to handle paperwork by speaking through it, making the process more efficient and conversational.

0 favorites 0 likes
#voice

@FinanceYF5: 3. Antigravity 2.0 is a brand new desktop app built for AI agents, voice, tasks, and Google apps.

X AI KOLs Following · 2026-05-21 Cached

Antigravity 2.0 is a brand new desktop app built for AI agents, voice, tasks, and Google apps.

0 favorites 0 likes
#voice

@antigravity: Introducing Antigravity 2.0, a new standalone desktop application that delivers fully on that original glimpse of a tru…

X AI KOLs Following · 2026-05-19 Cached

Antigravity 2.0 is a new standalone desktop application rebuilt with multi-agent teams, scheduled tasks, native voice, and one-click integration with Google products.

0 favorites 0 likes
#voice

@ycombinator: The $2T telecom industry was built for humans. @AgentPhoneHQ is rebuilding it for agents. One API gives every AI agent …

X AI KOLs Following · 2026-05-15 Cached

AgentPhone launches an API that provides AI agents with their own phone numbers and identity, enabling them to make calls and send messages across channels like iMessage, WhatsApp, RCS, and SMS.

0 favorites 0 likes
#voice

Introducing the Realtime API

OpenAI Blog · 2024-10-01 Cached

OpenAI introduces the Realtime API, enabling developers to build low-latency multimodal speech-to-speech conversational experiences with natural voice interactions powered by GPT-4o. The API supports six preset voices and simplifies development by eliminating the need to integrate multiple models.

0 favorites 0 likes
← Back to home

Submit Feedback