@GitTrend0x: Claude Code 现在也可以编辑视频了！这个 SKILL 100% 免费且开源。 ✓ 自动创建动画 ✓ 生成不同风格的字幕 ✓ 消除静音、错误和填充词 https://github.com/browser-use/video-us…

X AI KOLs Timeline 2026/06/27 10:30 工具

video-editing open-source claude-code ai-tool ffmpeg subtitles animation

摘要

video-use is a 100% open-source tool that lets Claude Code edit videos automatically by removing filler words, adding subtitles, color grading, and generating animations. It integrates with Claude Code via a skill and uses ElevenLabs for transcription.

Claude Code 现在也可以编辑视频了！这个 SKILL 100% 免费且开源。 ✓ 自动创建动画 ✓ 生成不同风格的字幕 ✓ 消除静音、错误和填充词 https://t.co/JPE1gNwAu5 https://t.co/WVaMtphtXo

查看原文

查看缓存全文

缓存时间: 2026/06/27 17:58

Claude Code 现在也可以编辑视频了！这个 SKILL 100% 免费且开源。

✓ 自动创建动画 ✓ 生成不同风格的字幕 ✓ 消除静音、错误和填充词

https://t.co/JPE1gNwAu5 https://t.co/WVaMtphtXo

browser-use/video-use

Source: https://github.com/browser-use/video-use

video-use

Introducing video-use — edit videos with Claude Code. 100% open source.

Drop raw footage in a folder, chat with Claude Code, get final.mp4 back. Works for any content — talking heads, montages, tutorials, travel, interviews — without presets or menus.

What it does

Cuts out filler words (umm, uh, false starts) and dead space between takes
Auto color grades every segment (warm cinematic, neutral punch, or any custom ffmpeg chain)
30ms audio fades at every cut so you never hear a pop
Burns subtitles in your style — 2-word UPPERCASE chunks by default, fully customizable
Generates animation overlays via HyperFrames, Remotion, Manim, or PIL — spawned in parallel sub-agents, one per animation
Self-evaluates the rendered output at every cut boundary before showing you anything
Persists session memory in project.md so next week’s session picks up where you left off

Setup prompt

Paste into Claude Code, Codex, Hermes, Openclaw, or any agent with shell access:

Set up https://github.com/browser-use/video-use for me.

Read install.md first to install this repo, wire up ffmpeg, register the skill with whichever agent you're running under, and set up the ElevenLabs API key — ask me to paste it when you need it. Then read SKILL.md for daily usage, and always read helpers/ because that's where the editing scripts live. After install, don't transcribe anything on your own — just tell me it's ready and wait for me to drop footage into a folder.

The agent handles the clone, dependencies, skill registration, and prompts you once for your ElevenLabs API key (grab one at elevenlabs.io/app/settings/api-keys).

Then point your agent at a folder of raw takes:

cd /path/to/your/videos
claude    # or codex, hermes, etc.

For always-on editing from your own VPS or Telegram, run the agent through Browser Use Box. Watch the 15-second demo.

And in the session:

edit these into a launch video

It inventories the sources, proposes a strategy, waits for your OK, then produces edit/final.mp4 next to your sources. All outputs live in <videos_dir>/edit/ — the skill directory stays clean.

Manual install

If you’d rather do it by hand:

# 1. Clone and symlink into your agent's skills directory
git clone https://github.com/browser-use/video-use ~/Developer/video-use
ln -sfn ~/Developer/video-use ~/.claude/skills/video-use        # Claude Code
# ln -sfn ~/Developer/video-use ~/.codex/skills/video-use       # Codex

# 2. Install deps
cd ~/Developer/video-use
uv sync                         # or: pip install -e .
brew install ffmpeg             # required
brew install yt-dlp             # optional, for downloading online sources

# 3. Add your ElevenLabs API key
cp .env.example .env
$EDITOR .env                    # ELEVENLABS_API_KEY=...

How it works

The LLM never watches the video. It reads it — through two layers that together give it everything it needs to cut with word-boundary precision.

timeline_view composite — filmstrip + speaker track + waveform + word labels + silence-gap cut candidates

Layer 1 — Audio transcript (always loaded). One ElevenLabs Scribe call per source gives word-level timestamps, speaker diarization, and audio events ((laughter), (applause), (sigh)). All takes pack into a single ~12KB takes_packed.md — the LLM’s primary reading view.

## C0103  (duration: 43.0s, 8 phrases)
  [002.52-005.36] S0 Ninety percent of what a web agent does is completely wasted.
  [006.08-006.74] S0 We fixed this.

Layer 2 — Visual composite (on demand). timeline_view produces a filmstrip + waveform + word labels PNG for any time range. Called only at decision points — ambiguous pauses, retake comparisons, cut-point sanity checks.

Naive approach: 30,000 frames × 1,500 tokens = 45M tokens of noise. Video Use: 12KB text + a handful of PNGs.

Same idea as browser-use giving an LLM a structured DOM instead of a screenshot — but for video.

Pipeline

Transcribe ──> Pack ──> LLM Reasons ──> EDL ──> Render ──> Self-Eval
                                                              │
                                                              └─ issue? fix + re-render (max 3)

The self-eval loop runs timeline_view on the rendered output at every cut boundary — catches visual jumps, audio pops, hidden subtitles. You see the preview only after it passes.

Design principles

Text + on-demand visuals. No frame-dumping. The transcript is the surface.
Audio is primary, visuals follow. Cuts come from speech boundaries and silence gaps.
Ask → confirm → execute → self-eval → persist. Never touch the cut without strategy approval.
Zero assumptions about content type. Look, ask, then edit.
12 hard rules, artistic freedom elsewhere. Production-correctness is non-negotiable. Taste isn’t.

See SKILL.md for the full production rules and editing craft.

相似文章

@0xluffy_eth: 有人为Claude Code开发了免费视频编辑工具...太疯狂了。只需把原始素材和资源放入文件夹。就这样。它会处理一切： - 剪辑片段 - 移除冗余词 - 添加字幕 - 应用色彩分级和滤镜 - 处理动画 - 渲染最终视频无时间线。…

X AI KOLs Timeline

A free, open-source video editing tool built for Claude Code that fully automates editing from raw footage—clipping, filler word removal, subtitles, color grading, animation, and final rendering—all without a timeline or manual edits.

@Easycompany333: 整理了 6 个可以直接试的视频类 Claude Skills： 1. HyperFrames 一句话生成动效视频，文章、推文、产品介绍都能变成 MP4。适合产品宣发、教程开场、社交短视频。 https://github.com/heyg…

X AI KOLs Timeline

整理了6个可直接使用的视频类Claude Skills，涵盖自动生成动效视频、AI辅助粗剪、React组件渲染视频、多媒体生成工具箱、中文剪辑Agent和视频提示词编写等开源工具。

@Mng64218162: 你可以免费在本地完成。Claude自行编写动画HTML，免费的Edge TTS处理语音，ffmpeg渲染…

X AI KOLs Following

一项免费开源的AI技能，可本地生成完整动画并配有旁白的解说视频，使用Claude生成动画代码，Edge TTS生成语音，ffmpeg进行渲染——无需订阅或API密钥。

@499317906: ai-video-generator-claude（即梦视频 skill · 起号/无脸号专用）一个跑 28 万粉账号的老外，把自己在用的 10 个即梦视频 skill 开源了，`git clone` 进 `~/.claude/skil…

X AI KOLs Timeline

一个拥有28万粉的账号运营者开源了10个用于Claude的AI视频生成技能，可生成精美的视频提示，直接粘贴到Higgsfield的Seedance 2.0中使用。

@XAMTO_AI: 有个开源项目叫 narrator-ai-cli-skill，塞进 Claude Code 这类 Agent 里，你就说一句“帮我做《肖申克的救赎》解说视频”，AI 自己把所有活全包了：自动生成解说脚本，不用你动脑精准匹配对应电影片段，…

X AI KOLs Timeline

介绍一个名为 narrator-ai-cli-skill 的开源项目，它可以集成到 Claude Code 等 AI Agent 中，让用户只需一句话就能自动完成电影解说视频的脚本、配音、剪辑、配乐等全流程制作，极大地降低了影视解说号的制作成本。

browser-use/video-use

video-use

What it does

Setup prompt

Manual install

How it works

Pipeline

Design principles

相似文章

@Easycompany333: 整理了 6 个可以直接试的视频类 Claude Skills： 1. HyperFrames 一句话生成动效视频，文章、推文、产品介绍都能变成 MP4。 适合产品宣发、教程开场、社交短视频。 https://github.com/heyg…

@Mng64218162: 你可以免费在本地完成。Claude自行编写动画HTML，免费的Edge TTS处理语音，ffmpeg渲染…

@499317906: ai-video-generator-claude（即梦视频 skill · 起号/无脸号专用） 一个跑 28 万粉账号的老外，把自己在用的 10 个即梦视频 skill 开源了，`git clone` 进 `~/.claude/skil…

@XAMTO_AI: 有个开源项目叫 narrator-ai-cli-skill，塞进 Claude Code 这类 Agent 里，你就说一句“帮我做《肖申克的救赎》解说视频”，AI 自己把所有活全包了： 自动生成解说脚本，不用你动脑 精准匹配对应电影片段，…

提交意见反馈

@Easycompany333: 整理了 6 个可以直接试的视频类 Claude Skills： 1. HyperFrames 一句话生成动效视频，文章、推文、产品介绍都能变成 MP4。适合产品宣发、教程开场、社交短视频。 https://github.com/heyg…

@499317906: ai-video-generator-claude（即梦视频 skill · 起号/无脸号专用）一个跑 28 万粉账号的老外，把自己在用的 10 个即梦视频 skill 开源了，`git clone` 进 `~/.claude/skil…

@XAMTO_AI: 有个开源项目叫 narrator-ai-cli-skill，塞进 Claude Code 这类 Agent 里，你就说一句“帮我做《肖申克的救赎》解说视频”，AI 自己把所有活全包了：自动生成解说脚本，不用你动脑精准匹配对应电影片段，…