voice-cloning

Tag

Cards List
#voice-cloning

@Prince_Canuma: mlx-audio v0.4.3 is here A massive release across models, server, and DX → 6 new TTS models: Higgs Audio v2 (voice clon…

X AI KOLs Timeline · 7h ago Cached

mlx-audio v0.4.3 releases with 6 new TTS models including Higgs Audio v2 and OmniVoice (646+ languages), plus server improvements like concurrent requests and continuous batching, ~3x faster Voxtral Realtime on 4-bit, and slimmer dependencies for Apple Silicon.

1 favorites 1 likes
#voice-cloning

Built a JARVIS-style assistant with wake word, vision mode, local voice cloning, and LLM-generated system commands

Reddit r/ArtificialInteligence · 23h ago

A developer built a JARVIS-style personal assistant called CYBER with wake word activation, local voice cloning via XTTS v2, vision mode, and LLM-generated system commands, all running locally without cloud dependencies.

0 favorites 0 likes
#voice-cloning

@billtheinvestor: Shanghai Jiao Tong University open-sources F5-TTS speech generation model. The model is trained on 100,000 hours of data and supports bilingual synthesis in Chinese and English. Technical features include zero-shot voice cloning, total-duration-based speed control, emotion expression control, and long text synthesis. Commercial use is allowed.

X AI KOLs Timeline · yesterday Cached

Shanghai Jiao Tong University has open-sourced the F5-TTS speech generation model, trained on 100,000 hours of data, supporting bilingual synthesis in Chinese and English and zero-shot voice cloning, and allowing commercial use.

1 favorites 1 likes
#voice-cloning

Nostalgia for just 3 years ago…

Reddit r/LocalLLaMA · 2026-04-22

A personal reflection on the rapid evolution of AI over the past three years, from early ChatGPT and GPT-4 quotas to BabyAGI, DALL·E, and voice cloning.

0 favorites 0 likes
#voice-cloning

k2-fsa/OmniVoice

Hugging Face Models Trending · 2026-03-30 Cached

OmniVoice is a massively multilingual zero-shot text-to-speech model supporting over 600 languages, built on a diffusion language model architecture with fast inference and voice cloning capabilities.

0 favorites 0 likes
← Back to home

Submit Feedback