@0xluffy_eth: 有人为Claude Code开发了免费视频编辑工具...太疯狂了。 只需把原始素材和资源放入文件夹。 就这样。 它会处理一切: - 剪辑片段 - 移除冗余词 - 添加字幕 - 应用色彩分级和滤镜 - 处理动画 - 渲染最终视频 无时间线。…
摘要
A free, open-source video editing tool built for Claude Code that fully automates editing from raw footage—clipping, filler word removal, subtitles, color grading, animation, and final rendering—all without a timeline or manual edits.
查看缓存全文
缓存时间: 2026/05/08 15:35
有人为Claude Code开发了免费视频编辑工具…太疯狂了。 只需把原始素材和资源放入文件夹。 就这样。 它会处理一切: - 剪辑片段 - 移除冗余词 - 添加字幕 - 应用色彩分级和滤镜 - 处理动画 - 渲染最终视频 无时间线。无手动编辑。无往复修改。 说实话,这超越了Remotion等工具。 它不仅帮助你制作视频…它替你做编辑。 该项目在GitHub上开源。 https://github.com/browser-use/video-use…
browser-use/video-use
Source: https://github.com/browser-use/video-use
video-use
Introducing video-use — edit videos with Claude Code. 100% open source.
Drop raw footage in a folder, chat with Claude Code, get final.mp4 back. Works for any content — talking heads, montages, tutorials, travel, interviews — without presets or menus.
What it does
- Cuts out filler words (
umm,uh, false starts) and dead space between takes - Auto color grades every segment (warm cinematic, neutral punch, or any custom ffmpeg chain)
- 30ms audio fades at every cut so you never hear a pop
- Burns subtitles in your style — 2-word UPPERCASE chunks by default, fully customizable
- Generates animation overlays via HyperFrames, Remotion, Manim, or PIL — spawned in parallel sub-agents, one per animation
- Self-evaluates the rendered output at every cut boundary before showing you anything
- Persists session memory in
project.mdso next week’s session picks up where you left off
Setup prompt
Paste into Claude Code, Codex, Hermes, Openclaw, or any agent with shell access:
Set up https://github.com/browser-use/video-use for me.
Read install.md first to install this repo, wire up ffmpeg, register the skill with whichever agent you're running under, and set up the ElevenLabs API key — ask me to paste it when you need it. Then read SKILL.md for daily usage, and always read helpers/ because that's where the editing scripts live. After install, don't transcribe anything on your own — just tell me it's ready and wait for me to drop footage into a folder.
The agent handles the clone, dependencies, skill registration, and prompts you once for your ElevenLabs API key (grab one at elevenlabs.io/app/settings/api-keys).
Then point your agent at a folder of raw takes:
cd /path/to/your/videos
claude # or codex, hermes, etc.
And in the session:
edit these into a launch video
It inventories the sources, proposes a strategy, waits for your OK, then produces edit/final.mp4 next to your sources. All outputs live in <videos_dir>/edit/ — the skill directory stays clean.
Manual install
If you’d rather do it by hand:
# 1. Clone and symlink into your agent's skills directory
git clone https://github.com/browser-use/video-use ~/Developer/video-use
ln -sfn ~/Developer/video-use ~/.claude/skills/video-use # Claude Code
# ln -sfn ~/Developer/video-use ~/.codex/skills/video-use # Codex
# 2. Install deps
cd ~/Developer/video-use
uv sync # or: pip install -e .
brew install ffmpeg # required
brew install yt-dlp # optional, for downloading online sources
# 3. Add your ElevenLabs API key
cp .env.example .env
$EDITOR .env # ELEVENLABS_API_KEY=...
How it works
The LLM never watches the video. It reads it — through two layers that together give it everything it needs to cut with word-boundary precision.
Layer 1 — Audio transcript (always loaded). One ElevenLabs Scribe call per source gives word-level timestamps, speaker diarization, and audio events ((laughter), (applause), (sigh)). All takes pack into a single ~12KB takes_packed.md — the LLM’s primary reading view.
## C0103 (duration: 43.0s, 8 phrases)
[002.52-005.36] S0 Ninety percent of what a web agent does is completely wasted.
[006.08-006.74] S0 We fixed this.
Layer 2 — Visual composite (on demand). timeline_view produces a filmstrip + waveform + word labels PNG for any time range. Called only at decision points — ambiguous pauses, retake comparisons, cut-point sanity checks.
Naive approach: 30,000 frames × 1,500 tokens = 45M tokens of noise. Video Use: 12KB text + a handful of PNGs.
Same idea as browser-use giving an LLM a structured DOM instead of a screenshot — but for video.
Pipeline
Transcribe ──> Pack ──> LLM Reasons ──> EDL ──> Render ──> Self-Eval
│
└─ issue? fix + re-render (max 3)
The self-eval loop runs timeline_view on the rendered output at every cut boundary — catches visual jumps, audio pops, hidden subtitles. You see the preview only after it passes.
Design principles
- Text + on-demand visuals. No frame-dumping. The transcript is the surface.
- Audio is primary, visuals follow. Cuts come from speech boundaries and silence gaps.
- Ask → confirm → execute → self-eval → persist. Never touch the cut without strategy approval.
- Zero assumptions about content type. Look, ask, then edit.
- 12 hard rules, artistic freedom elsewhere. Production-correctness is non-negotiable. Taste isn’t.
See SKILL.md for the full production rules and editing craft.
相似文章
@Easycompany333: 整理了 6 个可以直接试的视频类 Claude Skills: 1. HyperFrames 一句话生成动效视频,文章、推文、产品介绍都能变成 MP4。 适合产品宣发、教程开场、社交短视频。 https://github.com/heyg…
整理了6个可直接使用的视频类Claude Skills,涵盖自动生成动效视频、AI辅助粗剪、React组件渲染视频、多媒体生成工具箱、中文剪辑Agent和视频提示词编写等开源工具。
@Smartpigai: 每次有人问我“做内容 / 视频 / 素材管理用啥工具”,我都懒得再解释了,直接一次性整理好,你自己存 1、视频剪辑(用代码做视频) https://github.com/remotion-dev/remotion… 2、语音转字幕 / 会…
一个帖子整理了多个用于内容创作的开源工具,包括视频剪辑、语音转字幕、AI绘图、媒体处理等,强调免费开源且可拼成自己的系统。
@hank_aibtc: Palmier Pro:让 Claude / Cursor 直接在时间线里剪视频的开源宝物来了! Mac 原生 AI 视频编辑器,完全开源(核心编辑器 + MCP 服务开源,GPLv3),把 AI 代理从只会聊天升级成真能上手剪片。 核心…
Palmier Pro 是一款开源的 Mac 原生 AI 视频编辑器,支持通过 MCP 连接 Claude、Cursor 等 AI 代理直接在时间线上进行剪辑、重组和素材整理,并可导出到专业软件。核心编辑器及 MCP 服务已开源(GPLv3),免费核心版可用。
@leeoxiang: 尝试了下用 claude code 来剪口播视频,效果比春节当时的实验好了很多,真的可以做到一键剪出不错的口播视频。 使用 claude code + hyperframes.
尝试用 Claude Code 和 Hyperframes 剪口播视频,效果比春节时更好,可以一键完成。
@Saccc_c: 卧槽,对不起兄弟们,Claude Code+ Hyperframe 才是剪辑的最佳实践 事情的起因是,我最近在寻找剪辑更通用的提示词,希望是一次就可以剪出很好的视频 心血来潮,我就用相同提示词(Tesla宣传片)同时给了 cc 和 cod…
An article comparing Claude Code and Codex for video editing tasks, claiming Claude Code with Hyperframe is superior for creating video edits like a Tesla commercial.