@GitTrend0x: Claude Code 现在也可以编辑视频了! 这个 SKILL 100% 免费且开源。 ✓ 自动创建动画 ✓ 生成不同风格的字幕 ✓ 消除静音、错误和填充词 https://github.com/browser-use/video-us…
摘要
video-use is a 100% open-source tool that lets Claude Code edit videos automatically by removing filler words, adding subtitles, color grading, and generating animations. It integrates with Claude Code via a skill and uses ElevenLabs for transcription.
查看缓存全文
缓存时间: 2026/06/27 17:58
Claude Code 现在也可以编辑视频了! 这个 SKILL 100% 免费且开源。
✓ 自动创建动画 ✓ 生成不同风格的字幕 ✓ 消除静音、错误和填充词
https://t.co/JPE1gNwAu5 https://t.co/WVaMtphtXo
browser-use/video-use
Source: https://github.com/browser-use/video-use
video-use
Introducing video-use — edit videos with Claude Code. 100% open source.
Drop raw footage in a folder, chat with Claude Code, get final.mp4 back. Works for any content — talking heads, montages, tutorials, travel, interviews — without presets or menus.
What it does
- Cuts out filler words (
umm,uh, false starts) and dead space between takes - Auto color grades every segment (warm cinematic, neutral punch, or any custom ffmpeg chain)
- 30ms audio fades at every cut so you never hear a pop
- Burns subtitles in your style — 2-word UPPERCASE chunks by default, fully customizable
- Generates animation overlays via HyperFrames, Remotion, Manim, or PIL — spawned in parallel sub-agents, one per animation
- Self-evaluates the rendered output at every cut boundary before showing you anything
- Persists session memory in
project.mdso next week’s session picks up where you left off
Setup prompt
Paste into Claude Code, Codex, Hermes, Openclaw, or any agent with shell access:
Set up https://github.com/browser-use/video-use for me.
Read install.md first to install this repo, wire up ffmpeg, register the skill with whichever agent you're running under, and set up the ElevenLabs API key — ask me to paste it when you need it. Then read SKILL.md for daily usage, and always read helpers/ because that's where the editing scripts live. After install, don't transcribe anything on your own — just tell me it's ready and wait for me to drop footage into a folder.
The agent handles the clone, dependencies, skill registration, and prompts you once for your ElevenLabs API key (grab one at elevenlabs.io/app/settings/api-keys).
Then point your agent at a folder of raw takes:
cd /path/to/your/videos
claude # or codex, hermes, etc.
For always-on editing from your own VPS or Telegram, run the agent through Browser Use Box. Watch the 15-second demo.
And in the session:
edit these into a launch video
It inventories the sources, proposes a strategy, waits for your OK, then produces edit/final.mp4 next to your sources. All outputs live in <videos_dir>/edit/ — the skill directory stays clean.
Manual install
If you’d rather do it by hand:
# 1. Clone and symlink into your agent's skills directory
git clone https://github.com/browser-use/video-use ~/Developer/video-use
ln -sfn ~/Developer/video-use ~/.claude/skills/video-use # Claude Code
# ln -sfn ~/Developer/video-use ~/.codex/skills/video-use # Codex
# 2. Install deps
cd ~/Developer/video-use
uv sync # or: pip install -e .
brew install ffmpeg # required
brew install yt-dlp # optional, for downloading online sources
# 3. Add your ElevenLabs API key
cp .env.example .env
$EDITOR .env # ELEVENLABS_API_KEY=...
How it works
The LLM never watches the video. It reads it — through two layers that together give it everything it needs to cut with word-boundary precision.
Layer 1 — Audio transcript (always loaded). One ElevenLabs Scribe call per source gives word-level timestamps, speaker diarization, and audio events ((laughter), (applause), (sigh)). All takes pack into a single ~12KB takes_packed.md — the LLM’s primary reading view.
## C0103 (duration: 43.0s, 8 phrases)
[002.52-005.36] S0 Ninety percent of what a web agent does is completely wasted.
[006.08-006.74] S0 We fixed this.
Layer 2 — Visual composite (on demand). timeline_view produces a filmstrip + waveform + word labels PNG for any time range. Called only at decision points — ambiguous pauses, retake comparisons, cut-point sanity checks.
Naive approach: 30,000 frames × 1,500 tokens = 45M tokens of noise. Video Use: 12KB text + a handful of PNGs.
Same idea as browser-use giving an LLM a structured DOM instead of a screenshot — but for video.
Pipeline
Transcribe ──> Pack ──> LLM Reasons ──> EDL ──> Render ──> Self-Eval
│
└─ issue? fix + re-render (max 3)
The self-eval loop runs timeline_view on the rendered output at every cut boundary — catches visual jumps, audio pops, hidden subtitles. You see the preview only after it passes.
Design principles
- Text + on-demand visuals. No frame-dumping. The transcript is the surface.
- Audio is primary, visuals follow. Cuts come from speech boundaries and silence gaps.
- Ask → confirm → execute → self-eval → persist. Never touch the cut without strategy approval.
- Zero assumptions about content type. Look, ask, then edit.
- 12 hard rules, artistic freedom elsewhere. Production-correctness is non-negotiable. Taste isn’t.
See SKILL.md for the full production rules and editing craft.
相似文章
@0xluffy_eth: 有人为Claude Code开发了免费视频编辑工具...太疯狂了。 只需把原始素材和资源放入文件夹。 就这样。 它会处理一切: - 剪辑片段 - 移除冗余词 - 添加字幕 - 应用色彩分级和滤镜 - 处理动画 - 渲染最终视频 无时间线。…
A free, open-source video editing tool built for Claude Code that fully automates editing from raw footage—clipping, filler word removal, subtitles, color grading, animation, and final rendering—all without a timeline or manual edits.
@Easycompany333: 整理了 6 个可以直接试的视频类 Claude Skills: 1. HyperFrames 一句话生成动效视频,文章、推文、产品介绍都能变成 MP4。 适合产品宣发、教程开场、社交短视频。 https://github.com/heyg…
整理了6个可直接使用的视频类Claude Skills,涵盖自动生成动效视频、AI辅助粗剪、React组件渲染视频、多媒体生成工具箱、中文剪辑Agent和视频提示词编写等开源工具。
@Mng64218162: 你可以免费在本地完成。Claude自行编写动画HTML,免费的Edge TTS处理语音,ffmpeg渲染…
一项免费开源的AI技能,可本地生成完整动画并配有旁白的解说视频,使用Claude生成动画代码,Edge TTS生成语音,ffmpeg进行渲染——无需订阅或API密钥。
@499317906: ai-video-generator-claude(即梦视频 skill · 起号/无脸号专用) 一个跑 28 万粉账号的老外,把自己在用的 10 个即梦视频 skill 开源了,`git clone` 进 `~/.claude/skil…
一个拥有28万粉的账号运营者开源了10个用于Claude的AI视频生成技能,可生成精美的视频提示,直接粘贴到Higgsfield的Seedance 2.0中使用。
@XAMTO_AI: 有个开源项目叫 narrator-ai-cli-skill,塞进 Claude Code 这类 Agent 里,你就说一句“帮我做《肖申克的救赎》解说视频”,AI 自己把所有活全包了: 自动生成解说脚本,不用你动脑 精准匹配对应电影片段,…
介绍一个名为 narrator-ai-cli-skill 的开源项目,它可以集成到 Claude Code 等 AI Agent 中,让用户只需一句话就能自动完成电影解说视频的脚本、配音、剪辑、配乐等全流程制作,极大地降低了影视解说号的制作成本。