Tag
Hasbro's Peppa Pig is facing backlash after asking child actors to sign over their voices for AI use, prompting an open letter from the Agents of Young Performers Association condemning the practice.
A Twitter thread lists the 10 fastest-growing repositories on GitHub in June 2025, covering AI workspaces, token compression, agent prompt optimization, video generation, voice cloning, stock analysis, research agents, and more.
MOSS-TTS is an open-source voice cloning model introduced by Mosi Company. Users can clone a voice by reading a small amount of text, and then use the cloned voice to generate any speech with realistic results.
NetEase Youdao open-sourced the 1.3B parameter Confucius4-TTS model, supporting zero-shot voice cloning and cross-lingual speech synthesis in 14 languages, fast and with excellent results.
MOSS-TTS-Local Transformer v1.5 is an open-source 48 kHz stereo TTS model with zero-shot voice cloning, native streaming, and support for 31 languages, built on a Qwen3-4B backbone and served via SGLang-Omni.
MosiAI has released MOSS-TTS Local Transformer v1.5, a text-to-speech model that supports voice cloning, over 30 languages, and high-quality 48 kHz output.
VoxCPM2 is an open-source speech synthesis model from OpenBMB, using a tokenizer-free diffusion autoregressive architecture, supporting 30 languages, voice design, and controllable voice cloning. It can clone a voice with just one sentence, or create a brand new voice using text, outputting 48kHz high-quality audio, and is commercially usable.
Zyphra released ZONOS2, an open-source MoE text-to-speech model trained on over 6 million hours of multilingual speech, supporting voice cloning and high-quality synthesis across many languages.
Zyphra releases ZONOS2, an open-source real-time TTS model with high-fidelity voice cloning, under Apache 2.0, available on Zyphra Cloud on AMD.
ZONOS2 is a new text-to-speech model from Zyphra trained on over 6 million hours of multilingual speech, offering high-quality voice cloning and low latency using a mixture-of-experts architecture. It supports 30+ languages and includes a high-performance inference server.
This article from Sumsub examines the rising threat of AI-powered deepfake fraud targeting the creator economy, and offers a guide on detection and protection methods to combat impersonation and voice cloning scams.
Introducing an open-source voice synthesis model with 2 billion parameters and 2 million hours of training. It supports 30 languages and 9 Chinese dialects, allows voice description via natural language, can clone voices from a 3-second recording, delivers 48kHz studio-quality audio, and is free for commercial use under the Apache-2.0 license.
RedNote releases dots.tts, a 2B parameter open-source text-to-speech model with zero-shot voice cloning and 48 kHz synthesis.
Dots.tts is a new TTS model from Xiaohongshu (RedNote) with 2B parameters, Apache 2.0 license, fully continuous architecture without codec tokens, 48kHz synthesis, and zero-shot voice cloning.
Introducing VoxCPM2, a completely free for commercial use, open-source multilingual voice synthesis model supporting voice design, cloning, and 48kHz high-quality output, ranked #1 on GitHub trending.
Microsoft has released MAI-Voice-2, an expressive text-to-speech system supporting voice cloning in 15 languages.
Technical breakdown and benchmarks of VoxCPM2, an open-source TTS model featuring Ultimate Cloning Mode for capturing breathing and accents, tested locally with low VRAM footprint and cross-lingual accent retention.
KrillinAI is an open-source tool that integrates the entire workflow of video downloading, subtitle translation, AI dubbing, and video compositing. It supports context-aware translation, voice cloning, auto layout, and cover generation, and is compatible with multiple AI models, suitable for multilingual audio/video content creation and distribution.
A curated list of 10 open-source projects that provide powerful AI and developer capabilities, often replacing costly commercial software—including AI trading agents, multi-model chat interfaces, video generation, voice cloning, and more.
MOSS-TTS-v1.5, a multilingual controllable TTS model with voice cloning and long-form generation, reached #1 on Hugging Face Trending with 20.6K downloads.