Tag
ElevenLabs introduces Avatars in ElevenCreative, a dedicated entry point for generating talking-head videos, enabling users to create AI-driven avatar videos with realistic speech and lip-sync.
The author argues that AI tools have made software coding trivial, shifting hackathon focus to hardware integration. He advocates for embracing ridiculous, retro hardware projects combined with AI for future hackathons.
90210 is a production-grade local app that uses AI models like Google Veo 3.1, Gemini 2.5 Pro, and ElevenLabs Music to turn screenplays into finished short films with synchronized video, audio, dialogue, music, and subtitles, featuring a quality oracle for auto re-rolls.
ElevenLabs introduces the ability to call your Hermes Agent, enabling voice-based interaction with AI agents through their platform.
The author shares their experience of making a 50-second talking-head video using @leeoxiang's skill, ElevenLabs, and Feishu CLI, emphasizing that the vtake-cut skill allows more people to express their ideas with zero barriers.
Recommend Scribe2SRT, an open-source speech-to-subtitle tool based on PySide6 and ElevenLabs API, supporting multiple languages with optimized formatting for fast generation of high-quality SRT subtitles.
ElevenLabs launched Dubbing v2, an AI dubbing model that preserves the original speaker's emotion, tone, and performance across 90+ languages by conditioning on the original audio directly, offering broadcast-quality dubbing at a fraction of the cost.
ElevenLabs signs a deal with Stan Lee Universe to create an AI clone of Stan Lee's voice and likeness for digital cameos, audiobooks, and a book club series, sparking ethical debates about consent and exploitation.
Supertonic 3 is a 99M parameter open-source TTS model that runs entirely on-device, beating ElevenLabs on a Raspberry Pi with 167x faster than real-time performance on a laptop CPU.
Nimit Sohoni left a high-paying Citadel quant role to build next-generation voice AI at Cartesia, competing with ElevenLabs, highlighting the trade-offs between quant finance and AI research.
Spotify announces a new ElevenLabs-powered tool for self-publishing audiobooks within its Spotify for Authors platform, launching in beta this June with English support.
Prajwal Tomar demonstrates how to quickly build an AI-powered storytelling app using ElevenLabs for text-to-speech and Lovable for app development, enabling personalized stories with narration and sound effects.
The author developed a portable user preference profile system that integrates with ElevenLabs and Pipecat agents, allowing voice assistants to remember user styles and interests across different platforms to skip redundant onboarding.