@aigclink: An open-source end-to-end video translation + video Q&A Skill: violin. The highlight is not just literal translation, but the idea of content re-creation. It integrates ASR, LLM translation, and TTS into a seamless pipeline video Skill. The three modules are automatically chained: input a video and get a dubbed translated video. Translation style is adjustable, for example...
Summary
Violin is an open-source end-to-end video translation and video Q&A tool, integrating ASR, LLM translation, and TTS. It supports style adjustment and content re-creation, and can answer questions about video content.
Similar Articles
@KevinQHLin: IntroducingViolin — an Open-source Video Translation Skill. Video is the dominant medium on the internet, yet most high…
Violin is an open-source video translation skill that combines speech recognition, LLM translation, and speech synthesis into a seamless pipeline, supporting multilingual ASR, personalized translation, and interactive chat with video content.
@berryxia: Guys, this is awesome! Install it right away! Kevin Lin, postdoc at Oxford, former Meta and Microsoft researcher, just released Violin, an open-source video translation Skill. Video is already the absolute dominant content form on the internet. Yet most high-quality lectures, speeches, and podcasts are locked by a single language…
Violin is an open-source video translation tool that integrates speech recognition, large language model translation, and text-to-speech. It supports over 30 languages and offers three usage modes: CLI, web app, and Claude Code.
@Russell3402: Alibaba International's open-source AI-powered fully automated short video engine, Pixelle-Video. Simply input a topic, and it automatically generates a complete short video. From copywriting and voiceovers to image selection and editing, everything is handled by AI. GitHub:
Alibaba International has open-sourced the AI-powered fully automated short video engine Pixelle-Video, allowing users to generate complete short videos—including copywriting, voiceovers, images, and editing—by simply inputting a topic.
@shachepi: 天下苦沉浸式翻译久矣。 除了昨天的陪读蛙,KISS Translator 也是个顶级平替。 纯粹,完全开源。界面清爽。除了网页翻译,它同样自建接口支持非常全(Claude、Gemini 等各类AI都能接)。不想被商业插件割韭菜,用这种自接…
天下苦沉浸式翻译久矣。 除了昨天的陪读蛙,KISS Translator 也是个顶级平替。 纯粹,完全开源。界面清爽。除了网页翻译,它同样自建接口支持非常全(Claude、Gemini 等各类AI都能接)。不想被商业插件割韭菜,用这种自接 API 的最稳,加载也快,效果还好! 还没找到顺手平替的,再试试这个。 项目地址:https://github点com/fishjar/kiss-translator
@VincentLogic: Now this is what real Harness Engineering looks like! A clear breakdown of the full article-to-video pipeline: article -> script -> web development -> voice recording -> screen capture. Skip the Sora hype; coding webpages for video generation offers much better control and is completely open source.
This post outlines a complete open-source text-to-video workflow spanning script generation, frontend development, voiceover recording, and screen capture, highlighting how a code-driven approach delivers superior control and higher content production efficiency.