Tag
ElevenLabs introduces the ability to call your Hermes Agent, enabling voice-based interaction with AI agents through their platform.
The author shares their experience of making a 50-second talking-head video using @leeoxiang's skill, ElevenLabs, and Feishu CLI, emphasizing that the vtake-cut skill allows more people to express their ideas with zero barriers.
Recommend Scribe2SRT, an open-source speech-to-subtitle tool based on PySide6 and ElevenLabs API, supporting multiple languages with optimized formatting for fast generation of high-quality SRT subtitles.
ElevenLabs launched Dubbing v2, an AI dubbing model that preserves the original speaker's emotion, tone, and performance across 90+ languages by conditioning on the original audio directly, offering broadcast-quality dubbing at a fraction of the cost.
ElevenLabs signs a deal with Stan Lee Universe to create an AI clone of Stan Lee's voice and likeness for digital cameos, audiobooks, and a book club series, sparking ethical debates about consent and exploitation.
Supertonic 3 is a 99M parameter open-source TTS model that runs entirely on-device, beating ElevenLabs on a Raspberry Pi with 167x faster than real-time performance on a laptop CPU.
Nimit Sohoni left a high-paying Citadel quant role to build next-generation voice AI at Cartesia, competing with ElevenLabs, highlighting the trade-offs between quant finance and AI research.
Spotify announces a new ElevenLabs-powered tool for self-publishing audiobooks within its Spotify for Authors platform, launching in beta this June with English support.
Prajwal Tomar demonstrates how to quickly build an AI-powered storytelling app using ElevenLabs for text-to-speech and Lovable for app development, enabling personalized stories with narration and sound effects.
The author developed a portable user preference profile system that integrates with ElevenLabs and Pipecat agents, allowing voice assistants to remember user styles and interests across different platforms to skip redundant onboarding.