@vikingmute: 发现这个 Ponytail 和 Codex 是绝配了https://github.com/DietrichGebert/ponytail… GPT 写兜底代码上瘾了,没有明确指示总是写一大堆防御性代码,看的想哭。 它的核心理念是 “The…
摘要
Ponytail是一个AI代理技能,通过强制代理先检查是否需要写新代码来大幅减少过度工程,声称能减少80-94%的代码量和42-75%的成本。作者推荐与Codex搭配使用,并已在GitHub上开源。
查看缓存全文
缓存时间: 2026/06/17 16:02
发现这个 Ponytail 和 Codex 是绝配了https://github.com/DietrichGebert/ponytail…
GPT 写兜底代码上瘾了,没有明确指示总是写一大堆防御性代码,看的想哭。
它的核心理念是 “The best code is the code you never wrote.”通过强制 agent 先检查是否真的需要写新代码,来大幅减少 over-engineering。
官方数据说:80-94% less code, 42-75% less cost, and 3-6× faster than a no-skill agent,
今天安装上试试看是否好用,stars 数已经很多了。
DietrichGebert/ponytail
Source: https://github.com/DietrichGebert/ponytail
Ponytail
He says nothing. He writes one line. It works.
80-94% less code · 3-6× faster · 42-75% cheaper
Per-task code, latency, and cost on the Claude API, not your plan's quota. Median across Haiku, Sonnet, and Opus (10 runs for code and latency, 30 for the re-verified cost). Results vary by model and prompt: the ruleset re-injects each turn, so on a short prompt or a terse reasoning model that overhead can outweigh the savings. Reproduce it yourself.
You know him. Long ponytail. Oval glasses. Has been at the company longer than the version control. You show him fifty lines; he looks at them, says nothing, and replaces them with one.
Ponytail puts him inside your AI agent.
Before / after
You ask for a date picker. Your agent installs flatpickr, writes a wrapper component, adds a stylesheet, and starts a discussion about timezones.
With ponytail:
<!-- ponytail: browser has one -->
<input type="date">
More survivors in examples/.
Numbers
Five everyday tasks (email validator, debounce, CSV sum, countdown timer, rate limiter), three models, three arms: no skill, the caveman skill, and ponytail. Ten runs per cell, median reported.
80-94% less code, 42-75% less cost, and 3-6× faster than a no-skill agent, on every Claude model. Every shortcut ponytail takes is marked in the code with a ponytail: comment naming its upgrade path. Reproduce it yourself: npx promptfoo eval -c benchmarks/promptfooconfig.yaml. Method and raw numbers: benchmarks/. Production-grade tasks, where an unconstrained agent bloats far more, are written up in benchmarks/results/.
That is the byproduct, not the pitch. These are Claude numbers, and they vary by model. Capable instruction-following models follow the ladder and write far less, cheaper and faster. Terse reasoning models can go the other way: the ladder is a deliberation step, so the model spends thinking tokens working through the rungs before it saves any output, and together with the always-on ruleset that can cost more than the shorter code saves. On GPT-5.5 it does. And all of this is single-shot, one prompt in and one answer out: a real agent session re-injects the ruleset and runs the ladder every turn, which this benchmark does not measure, so per-session cost can land either way. The rule was never “fewest tokens.” It is: write only what the task needs, and never cut validation, error handling, security, or accessibility. The code ends up small because it is necessary, not golfed, and that is the part that stays maintainable. Lower cost and latency are a side effect on the models that follow it.
How it works
Before writing code, the agent stops at the first rung that holds:
1. Does this need to exist? → no: skip it (YAGNI)
2. Stdlib does it? → use it
3. Native platform feature? → use it
4. Installed dependency? → use it
5. One line? → one line
6. Only then: the minimum that works
Lazy, not negligent: trust-boundary validation, data-loss handling, security, and accessibility are never on the chopping block.
Install
The most effort ponytail will ever ask of you:
The Claude Code and Codex plugins run two tiny Node.js lifecycle hooks, so node needs to be on your PATH (note for Nix/nvm users: it must be on the non-interactive shell’s PATH). If it isn’t, the skills still work, the always-on activation just stays quiet instead of erroring on every prompt.
Claude Code
/plugin marketplace add DietrichGebert/ponytail
/plugin install ponytail@ponytail
Codex
codex plugin marketplace add DietrichGebert/ponytail
codex
Open /plugins, select the Ponytail marketplace, and install Ponytail. Then
open /hooks, review and trust its two lifecycle hooks, and start a new thread.
This same install also covers the Codex desktop app: restart the app after installing and it picks up the plugin.
GitHub Copilot CLI
copilot plugin marketplace add DietrichGebert/ponytail
copilot plugin install ponytail@ponytail
In an interactive Copilot CLI session, use the slash equivalents:
/plugin marketplace add DietrichGebert/ponytail
/plugin install ponytail@ponytail
Copilot CLI namespaces plugin commands by plugin name. For example:
/ponytail:ponytail ultra
/ponytail:ponytail-review
Pi agent harness
pi install git:github.com/DietrichGebert/ponytail
OpenCode
Run OpenCode from a checkout of this repo (the plugin reuses its hooks/ and skills/), and add to opencode.json:
{ "plugin": ["./.opencode/plugins/ponytail.mjs"] }
Injects the ruleset every turn at the active level; adds the /ponytail commands (see Commands). OpenCode also auto-loads this repo’s AGENTS.md, so the rules hold even without the plugin. The plugin adds the lite/full/ultra/off levels.
The ./ path resolves against your project’s opencode.json; to share one checkout across projects, point it at the absolute path of the .mjs instead (it finds its hooks/ and skills/ relative to its own file).
Gemini CLI
gemini extensions install https://github.com/DietrichGebert/ponytail
Loads the ruleset as always-on context every session and registers the /ponytail commands; the skills/ ship too, activated when a task needs them.
Antigravity CLI
Google is renaming Gemini CLI to Antigravity CLI (the agy binary); the same extension installs there:
agy plugin install https://github.com/DietrichGebert/ponytail
It reuses this repo’s gemini-extension.json. One difference: Antigravity converts the /ponytail commands into skills, so you type them into the chat (e.g. /ponytail-review as a message) instead of picking them from a slash menu. Until the migration completes (around June 18, 2026), gemini extensions install still works too. To run it as an always-on rule instead, drop the ruleset into .agents/rules/.
OpenClaw
clawhub install ponytail
Installs ponytail as an OpenClaw skill from ClawHub; the review, audit, debt, and help skills install the same way (clawhub install ponytail-review, and so on). OpenClaw applies it on coding tasks and also exposes it as a /ponytail command. Without ClawHub, copy .openclaw/skills/ponytail into ~/.openclaw/skills/.
That was it. He’d be proud. He won’t say it.
Active every session, with a handful of commands (see Commands). /ponytail ultra exists for when the codebase has wronged you personally. Startup and mode-change text shows the current mode.
Set the level for every new session with the PONYTAIL_DEFAULT_MODE env var (lite/full/ultra/off), or a defaultMode field in ~/.config/ponytail/config.json (%APPDATA%\ponytail\config.json on Windows). The default is full.
Cursor, Windsurf, Cline, GitHub Copilot (editor), Aider, Kiro: copy the matching rules file from this repo (.cursor/rules/, .windsurf/rules/, .clinerules/, .github/copilot-instructions.md, AGENTS.md, .kiro/steering/).
Kiro: copy .kiro/steering/ponytail.md to ~/.kiro/steering/ (global) or .kiro/steering/ in your project.
GitHub Copilot CLI fallback (instruction-only mode): it reads AGENTS.md and .github/copilot-instructions.md in a project, or copy the rules into ~/.copilot/copilot-instructions.md to run ponytail in every project. This path keeps always-on guidance, but does not add plugin mode switches or hooks.
VS Code with the Codex extension reads AGENTS.md, which this repo ships, so it works from the repo root with no setup (~/.codex/AGENTS.md makes Codex global).
Which files map to which agent: Agent portability.
Commands
| Command | What it does |
|---|---|
/ponytail [lite | full | ultra | off] | Set the intensity, or turn it off. No argument reports the current level. |
/ponytail-review | Review the current diff for over-engineering, hands back a delete-list. |
/ponytail-audit | Audit the whole repo for over-engineering, not just the diff. |
/ponytail-debt | Harvest the ponytail: shortcuts you’ve deferred into a ledger, so “later” doesn’t become “never”. |
/ponytail-help | Quick reference for the commands above. |
Commands need a skill-capable host (Claude Code, Codex, OpenCode, Gemini, pi). In Codex they’re skills, invoke with @ (@ponytail-review). The instruction-only adapters (Cursor, Windsurf, Cline, Copilot, Kiro, Antigravity) load the always-on ruleset without the commands.
Development
When changing the compact rule text, keep the agent copies aligned:
node scripts/check-rule-copies.js
npm test
The OpenClaw skill package (.openclaw/skills/) is generated from skills/; rerun node scripts/build-openclaw-skills.js after changing a skill, the test suite fails if it is stale.
The correctness benchmark spawns Python for email and CSV checks; python3 is tried before python. CSV checks need pandas installed locally.
FAQ
Does it need a config file?
No. An optional ~/.config/ponytail/config.json or PONYTAIL_DEFAULT_MODE env var can set the default level, but nothing is required.
What if I really need the 120-line cache class? You don’t. Insist anyway and he’ll build it. Slowly. Correctly. While looking at you.
Does it scale? The code you never wrote scales infinitely. Zero bugs, zero CVEs, 100% uptime since forever.
Why “ponytail”? You know exactly why.
License
MIT. The shortest license that works.
相似文章
@Easycompany333: Open AI 曾官方点名的Codex 视频 值得收藏,不到 2 小时,把 Codex + GPT-5.5 这套组合讲得很实: - 技能怎么配。 - 插件怎么用。 - 自动化工作流怎么搭。 - 多任务怎么并行跑。 - 以及最关键的,怎么把…
推荐一个OpenAI官方点名的Codex视频教程,详细讲解如何配置技能、使用插件、搭建自动化工作流,将Codex变成持续工作的开发系统。
@AYi_AInotes: https://x.com/AYi_AInotes/status/2066865618104586525
这是一篇对OpenAI Codex的全景分析文章,详细介绍了其架构(五个入口)、三层可扩展能力(MCP、Skills、Plugins)、与Claude Code、Cursor、Devin的横评对比,以及七条可直接采纳的最佳实践。
@GitTrend0x: Claude Code 代码库智能大脑 27×省 Token 杀手级开源神器 https://github.com/repowise-dev/repowise… 这就是 Repowise,专为 AI 辅助工程团队打造的代码库智能平台!它把…
Repowise is an open-source tool that indexes codebases into four intelligence layers (dependency graph, git history, auto-documentation, architectural decisions) and exposes them via seven MCP tools to AI coding agents like Claude Code, achieving up to 27× token savings while maintaining answer quality.
@ai_super_niko: 1/5 用 Claude Code 写代码,切到 Codex 做 review,再切回来改。 每次切换都是一次思路谋杀。 你不是在写代码,你在疲于奔命。 直到 OpenAI 给竞争对手 Claude Code,出了一个官方插件:`code…
OpenAI 为竞争对手 Claude Code 推出了官方插件 codex-plugin-cc,用于在 Claude Code 中集成 Codex 进行代码审查,引发了关于工作流切换效率的讨论。
@billtheinvestor: 给 Claude Code 和 Codex 无限记忆,编程效率提升92%!Agentmemory 工具已在 GitHub 上火速获得4000+个星标,完全免费。 它通过智能压缩保存你编程会话的所有信息,未来会话中自动提取相关上下文,避免重…
Agentmemory 是一个开源工具,为 Claude Code 和 Codex 提供无限记忆功能,通过智能压缩减少 token 使用量,提升编程效率,已在 GitHub 获得 4000+ 星标。