Microsoft MAI-Voice-2
Summary
Microsoft has released MAI-Voice-2, an expressive text-to-speech system supporting voice cloning in 15 languages.
Similar Articles
k2-fsa/OmniVoice
OmniVoice is a massively multilingual zero-shot text-to-speech model supporting over 600 languages, built on a diffusion language model architecture with fast inference and voice cloning capabilities.
Voiser AI
Voiser AI offers human-like AI voiceovers in over 140 languages.
@tom_doerr: Zero-shot voice cloning for 30 languages https://github.com/sunnyxrxrx/X-Voice…
X-Voice is a flow-matching-based multilingual text-to-speech system that enables zero-shot voice cloning across 30 languages, with open-source code, model, and demo available.
OpenMOSS-Team/MOSS-TTS-v1.5 · Hugging Face
MOSS-TTS v1.5 is an updated open-source text-to-speech model with improved multilingual synthesis (supporting 31 languages), more stable zero-shot voice cloning, and explicit inline pause control.
@HowToAI_: ElevenLabs just lost its moat Someone has open-sourced a single app that replaces ElevenLabs AND WisprFlow and runs 100…
An open-source app called Voicebox replaces ElevenLabs and WisprFlow with local voice cloning, multiple TTS engines, and MCP server support, running on various hardware with MIT license.