@GitTrend0x: Holy cow, guys! Run voice cloning and cinematic video dubbing locally, supporting 646 languages, fully offline, no API key, no internet needed. ElevenLabs is crushed! https://github.com/debpalash/OmniVoice-Studio… This open-source marvel is insane...
Summary
OmniVoice Studio is an open-source desktop app that enables local voice cloning and cinematic video dubbing across 646 languages, fully offline with no API keys, positioning itself as a privacy-focused alternative to ElevenLabs.
View Cached Full Text
Cached at: 05/14/26, 02:29 AM
OmniVoice Studio
The open-source ElevenLabs alternative.
Real-time dictation, zero-shot voice cloning, and cinematic video dubbing — all on your desktop. Open-source, no API keys, fully local. 646 languages.
Quickstart · Features · Why OmniVoice Studio? · TTS Engines · Contributing · Discord
🎙️ Voice Cloning
3-second clip → mirror any voice. 646 languages, zero-shot.
🎨 Voice Design
Gender, age, accent, pitch, speed, emotion, dialect — dial it in.
🎬 Video Dubbing
YouTube URL or file → transcribe → translate → re-voice → MP4.
⌨️ Dictation Widget
⌘+⇧+Space from any app. Transcribes, auto-pastes, disappears.
🔊 Vocal Isolation
Demucs-powered. Splits speech from music, keeps the background.
👥 Speaker Diarization
Pyannote + WhisperX. Auto-identifies who said what.
📦 Batch Queue
Drop 50 videos, walk away. Progress bars per job.
🤖 MCP Server
Use OmniVoice from Claude, Cursor, or any MCP client.
🛡️ AI Watermark
AudioSeal (Meta). Invisible, survives compression.
🔐 100% Local
No keys, no cloud, no accounts. Your machine only.
⚡ GPU Auto-Detect
CUDA · MPS · ROCm · CPU. ≤8 GB? Auto-offloads.
🧩 Extensible
Subclass TTSBackend, add any engine in ~50 lines.
🖥️ Desktop App
🐳 Docker
⚡ From Source
Similar Articles
@Fluyeporlaweb: ElevenLabs costs $700 a year. HeyGen another $700. Someone just posted the local dubbing study that eliminates both sub…
OmniVoice Studio is a free, open-source tool that locally dubs MP4 videos into 600 languages using Whisper for transcription, voice cloning from 3 seconds of audio, and Demucs for background separation, eliminating the need for paid subscriptions like ElevenLabs and HeyGen.
@noahduck283: A tool that can download any YouTube video, cleanly remove vocals, transcribe, translate into 100+ languages, clone the original voice, and perform fully automatic dubbing. It takes less than 2 minutes. 100% runs locally. Free. Sews six top open-source models into a web page for "one-click download, vocal removal, transcription, translation, dubbing"...
Voice-Pro is a web tool that integrates six top open-source models (Whisper, Demucs, CosyVoice, F5-TTS, etc.), supporting YouTube video downloading, vocal removal, transcription, translation, voice cloning, and fully automatic dubbing. It takes less than 2 minutes, runs 100% locally, and is free.
@FakeMaidenMaker: Explosive! This open-source project converts text to human-like voice for free, can clone anyone's voice, and adjust timbre with text! GitHub has garnered 30K stars, from Mianbao Intelligent OpenBMB, VoxCPM previously topped both GitHub and HuggingFace charts. Do...
VoxCPM2 is an open-source speech synthesis model from OpenBMB, using a tokenizer-free diffusion autoregressive architecture, supporting 30 languages, voice design, and controllable voice cloning. It can clone a voice with just one sentence, or create a brand new voice using text, outputting 48kHz high-quality audio, and is commercially usable.
@GoJun315: Open-source TTS that runs locally and beats ElevenLabs. Supertonic, a speech synthesis model that runs entirely on-device, no internet required, zero API costs. - Only 99M parameters, 167x faster than real-time on M4 Pro, runs on Raspberry Pi - Supports 31 languages, covering…
Supertonic is a lightning-fast, on-device TTS model with 99M parameters, supporting 31 languages. It runs locally with no API costs, outperforms cloud TTS on accuracy for numbers, phone numbers, and technical terms, and can be installed via Python, Node.js, Rust, Go, and more.
@yhslgg: Bro, sharing another open-source video translation tool—pyVideoTrans, with 17,700 stars on GitHub, a must-have for video repurposing and localization! In a nutshell: drop a video in, and it automatically runs through the entire pipeline of speech recognition → subtitle translation → AI dubbing → video synthesis, outputting a complete video in another language. Core...
pyVideoTrans is an open-source video translation tool that supports automatic speech recognition, subtitle translation, AI dubbing, and video synthesis. It integrates multiple ASR, translation, and TTS engines, making it suitable for cross-language video production and localization.