@KevinQHLin: IntroducingViolin — an Open-source Video Translation Skill. Video is the dominant medium on the internet, yet most high…

X AI KOLs Timeline 05/14/26, 08:31 PM Tools

video-translation open-source speech-recognition llm tts multilingual agent

Summary

Violin is an open-source video translation skill that combines speech recognition, LLM translation, and speech synthesis into a seamless pipeline, supporting multilingual ASR, personalized translation, and interactive chat with video content.

IntroducingViolin — an Open-source Video Translation Skill. Video is the dominant medium on the internet, yet most high-quality content (lecture, talk, podcast) is locked behind a single language, leaving global audiences behind. So we built Violin: a video skill that combines speech recognition, LLM translation, and speech synthesis into one seamless pipeline. Demo: https://violin-ai.com Blog: https://together.ai/blog/violin-open-source-translation-skill… GitHub: https://github.com/shang-zhu/violin… Key Features: High-quality multilingual ASR & Translation & TTS. Personalize translation & voice (turn an academic talk into something children can follow). Chat with the video — ask any questions grounded in the video. Support Web app, CLI, and Agent skill Fully open-source under MIT. Built with the wonderful @ShangZhu18 and advised by @james_y_zou ! All features powered by @togethercompute . Try it and let us know what you think!

Original Article

View Cached Full Text

Cached at: 05/15/26, 02:55 AM

Violin — Video Narrator

Source: https://www.violin-ai.com/ Vimeo, X/Twitter, and1000+ sites· max 2 hours ·YouTube may not work from cloud servers

Only use URLs you have rights to download — Creative Commons, public domain, or your own content.

Similar Articles

@berryxia: Guys, this is awesome! Install it right away! Kevin Lin, postdoc at Oxford, former Meta and Microsoft researcher, just released Violin, an open-source video translation Skill. Video is already the absolute dominant content form on the internet. Yet most high-quality lectures, speeches, and podcasts are locked by a single language…

X AI KOLs Timeline

Violin is an open-source video translation tool that integrates speech recognition, large language model translation, and text-to-speech. It supports over 30 languages and offers three usage modes: CLI, web app, and Claude Code.

@aigclink: An open-source end-to-end video translation + video Q&A Skill: violin. The highlight is not just literal translation, but the idea of content re-creation. It integrates ASR, LLM translation, and TTS into a seamless pipeline video Skill. The three modules are automatically chained: input a video and get a dubbed translated video. Translation style is adjustable, for example...

X AI KOLs Timeline

Violin is an open-source end-to-end video translation and video Q&A tool, integrating ASR, LLM translation, and TTS. It supports style adjustment and content re-creation, and can answer questions about video content.

@rwayne: Video translation has been cracked by a single Oxford postdoc. Kevin Lin, a postdoc at Oxford University, open-sourced Violin, a video translation tool that integrates speech recognition, LLM translation, and speech synthesis into an automated pipeline. It supports multilingual translation, personalized translation styles, and all-in-one video dialogue; it can turn academic reports into children's...

X AI KOLs Timeline

Kevin Lin, a postdoctoral fellow at Oxford University, open-sourced Violin, a video translation tool that integrates speech recognition, LLM translation, and speech synthesis into an automated pipeline. It supports multilingual translation and personalized styles, and provides three usage modes: Web, CLI, and Agent.

@XAMTO_AI: If you don't bookmark this open-source tool now, you'll regret it later — automatic video dubbing and translation, supports 33 languages at once, and can even answer questions about video content. Found a gem on GitHub called Violin, fully open-source, what it does is a bit unbelievable: you drop a video in, it automatically recognizes speech, …

X AI KOLs Timeline

Violin is an open-source automatic video dubbing and translation tool that supports 33 languages, integrates models like Whisper and DeepSeek, and provides one-click speech recognition, translation, dubbing synthesis, and in-video Q&A functionality.

@yhslgg: Bro, sharing another open-source video translation tool—pyVideoTrans, with 17,700 stars on GitHub, a must-have for video repurposing and localization! In a nutshell: drop a video in, and it automatically runs through the entire pipeline of speech recognition → subtitle translation → AI dubbing → video synthesis, outputting a complete video in another language. Core...

X AI KOLs Timeline

pyVideoTrans is an open-source video translation tool that supports automatic speech recognition, subtitle translation, AI dubbing, and video synthesis. It integrates multiple ASR, translation, and TTS engines, making it suitable for cross-language video production and localization.

Violin — Video Narrator

Similar Articles

Submit Feedback