@GoSailGlobal: 一个韩国开发者发了个 Claude Code 插件叫 Harness,已经 3.6k stars 它解决的是这个时代最被忽略的问题:"我有 Claude Code,但我不知道怎么把它组织成一个团队。" 你说一句 "build a harn…
摘要
A Korean developer released Harness, a Claude Code plugin that automatically generates agent team definitions and skills based on a single prompt. It includes six team architecture patterns and reports impressive A/B test results (60% quality improvement, n=15).
查看缓存全文
缓存时间: 2026/06/02 05:56
一个韩国开发者发了个 Claude Code 插件叫 Harness,已经 3.6k stars
它解决的是这个时代最被忽略的问题:“我有 Claude Code,但我不知道怎么把它组织成一个团队。”
你说一句 “build a harness for this project”, 它自动生成:
一个完整的 agent 团队定义(.claude/agents/) 这些 agent 用的 skills(.claude/skills/) 6 种 team 架构模式之一(Pipeline / Fan-out / Expert Pool / Producer-Reviewer / Supervisor / Hierarchical Delegation)
但最让我意外的是它直接公开了 A/B 实验数据: · 15 个软件工程任务 · 平均质量分:49.5 → 79.3(+60%) · 胜率:15/15(100%) · 输出方差:-32% · 难度越高,提升越大(Basic +23.8 / Advanced +29.6 / Expert +36.2)
更让我意外的是它的学术诚实: 每次引用 +60% 这个数字,都老老实实附上 “n=15, author-measured, third-party replications pending”,AI 圈这种克制太罕见了
https://github.com/revfactory/harness…
revfactory/harness
Source: https://github.com/revfactory/harness
Harness — The Team-Architecture Factory for Claude Code
Harness is a team-architecture factory for Claude Code. Say “build a harness for this project” (English) or “하네스 구성해줘” (한국어) or “ハーネスを構成して” (日本語), and the plugin turns your domain description into an agent team and the skills they use — picked from six pre-defined team-architecture patterns.
Overview
Harness leverages Claude Code’s agent team system to decompose complex tasks into coordinated teams of specialized agents. Say “build a harness for this project” and it automatically generates agent definitions (.claude/agents/) and skills (.claude/skills/) tailored to your domain.
Category — Where Harness Sits
Harness lives at the L3 Meta-Factory layer of the Claude Code ecosystem — the layer that generates other harnesses rather than being one. Inside L3, we pick a specific sub-layer: Team-Architecture Factory.
| Layer | What it does | Neighbors we coexist with |
|---|---|---|
| L3 — Meta-Factory / Team-Architecture Factory (us) | Domain sentence → agent team + skills, via 6 pre-defined team patterns | — |
| L3 — Meta-Factory / Runtime-Configuration Factory | Deterministic, repeatable runtime configurations | coleam00/Archon |
| L3 — Meta-Factory / Codex Runtime Port | Same concept, Codex runtime | SaehwanPark/meta-harness |
| L2 — Cross-Harness Workflow | Standardize skills/rules/hooks across multiple harnesses | affaan-m/ECC |
Archon generates deterministic runtime configurations. Harness generates team architectures (pipeline, fan-out/fan-in, expert pool, producer-reviewer, supervisor, hierarchical delegation) plus the skills agents use. Different sub-layers of the same L3. Pick Archon for runtime determinism, Harness for team architecture, or combine them.
Star History
Key Features
- Agent Team Design — 6 architectural patterns: Pipeline, Fan-out/Fan-in, Expert Pool, Producer-Reviewer, Supervisor, and Hierarchical Delegation
- Skill Generation — Auto-generates skills with Progressive Disclosure for efficient context management
- Orchestration — Inter-agent data passing, error handling, and team coordination protocols
- Validation — Trigger verification, dry-run testing, and with-skill vs without-skill comparison tests
Workflow
Phase 1: Domain Analysis
↓
Phase 2: Team Architecture Design (Agent Teams vs Subagents)
↓
Phase 3: Agent Definition Generation (.claude/agents/)
↓
Phase 4: Skill Generation (.claude/skills/)
↓
Phase 5: Integration & Orchestration
↓
Phase 6: Validation & Testing
Installation
Via Marketplace
Add the marketplace
/plugin marketplace add revfactory/harness
Install the plugin
/plugin install harness@harness-marketplace
Direct Installation as Global Skill
# Copy the skills directory to ~/.claude/skills/harness/
cp -r skills/harness ~/.claude/skills/harness
Plugin Structure
harness/
├── .claude-plugin/
│ └── plugin.json # Plugin manifest
├── skills/
│ └── harness/
│ ├── SKILL.md # Main skill definition (6-Phase workflow)
│ └── references/
│ ├── agent-design-patterns.md # 6 architectural patterns
│ ├── orchestrator-template.md # Team/subagent orchestrator templates
│ ├── team-examples.md # 5 real-world team configurations
│ ├── skill-writing-guide.md # Skill authoring guide
│ ├── skill-testing-guide.md # Testing & evaluation methodology
│ └── qa-agent-guide.md # QA agent integration guide
└── README.md
Usage
Trigger in Claude Code with prompts like:
Build a harness for this project
Design an agent team for this domain
Set up a harness
Execution Modes
| Mode | Description | Recommended For |
|---|---|---|
| Agent Teams (default) | TeamCreate + SendMessage + TaskCreate | 2+ agents requiring collaboration |
| Subagents | Direct Agent tool invocation | One-off tasks, no inter-agent communication needed |
Architecture Patterns
| Pattern | Description |
|---|---|
| Pipeline | Sequential dependent tasks |
| Fan-out/Fan-in | Parallel independent tasks |
| Expert Pool | Context-dependent selective invocation |
| Producer-Reviewer | Generation followed by quality review |
| Supervisor | Central agent with dynamic task distribution |
| Hierarchical Delegation | Top-down recursive delegation |
Output
Files generated by Harness:
your-project/
├── .claude/
│ ├── agents/ # Agent definition files
│ │ ├── analyst.md
│ │ ├── builder.md
│ │ └── qa.md
│ └── skills/ # Skill files
│ ├── analyze/
│ │ └── SKILL.md
│ └── build/
│ ├── SKILL.md
│ └── references/
Use Cases — Try These Prompts
Copy any prompt below into Claude Code after installing Harness:
Deep Research
Build a harness for deep research. I need an agent team that can investigate
any topic from multiple angles — web search, academic sources, community
sentiment — then cross-validate findings and produce a comprehensive report.
Website Development
Build a harness for full-stack website development. The team should handle
design, frontend (React/Next.js), backend (API), and QA testing in a
coordinated pipeline from wireframe to deployment.
Webtoon / Comic Production
Build a harness for webtoon episode production. I need agents for story
writing, character design prompts, panel layout planning, and dialogue
editing. They should review each other's work for style consistency.
YouTube Content Planning
Build a harness for YouTube content creation. The team should research
trending topics, write scripts, optimize titles/tags for SEO, and plan
thumbnail concepts — all coordinated by a supervisor agent.
Code Review & Refactoring
Build a harness for comprehensive code review. I want parallel agents
checking architecture, security vulnerabilities, performance bottlenecks,
and code style — then merging all findings into a single report.
Technical Documentation
Build a harness that generates API documentation from this codebase.
Agents should analyze endpoints, write descriptions, generate usage
examples, and review for completeness.
Data Pipeline Design
Build a harness for designing data pipelines. I need agents for schema
design, ETL logic, data validation rules, and monitoring setup that
delegate sub-tasks hierarchically.
Marketing Campaign
Build a harness for marketing campaign creation. The team should research
the target market, write ad copy, design visual concepts, and set up
A/B test plans with iterative quality review.
Coexistence — Harness and Neighbors
Harness is not alone in the Claude Code / agent-framework ecosystem. The following repos live in adjacent layers; each is described in a parallel “X is …, Harness is …” form so you can pick the one that fits your need or combine several.
| Repo | Their position | Relationship to Harness |
|---|---|---|
| coleam00/Archon | “harness builder” — deterministic, repeatable runtime configurations | Same L3, neighbor sub-layer. Archon is a Runtime-Configuration Factory, Harness is a Team-Architecture Factory. Pick Archon for runtime determinism, Harness for team architecture, or combine them. |
| SaehwanPark/meta-harness | Codex port of the same concept | Same L3, different runtime. Use Harness on Claude Code, meta-harness on Codex. |
| affaan-m/ECC | “Agent harness performance & workflow layer” (sits on top of existing harnesses) | Different layer. ECC is a standardization layer across harnesses; Harness is a factory that generates harnesses. Serial combination possible. |
| wshobson/agents | Subagent / skill catalog (182 agents, 149 skills) | Factory ↔ parts supply. wshobson is a catalog to shop from; Harness designs the team. Absorb wshobson entries as parts inside a Harness-generated team. |
| LangGraph | State-graph orchestration, LLM-agnostic | Different track. LangGraph is for long-running, state-recoverable orchestration; Harness is for fast Claude-Code-native team design. |
Built with Harness
Harness 100
revfactory/harness-100 — 100 production-ready agent team harnesses across 10 domains, available in both English and Korean (200 packages total). Each harness ships with 4-5 specialist agents, an orchestrator skill, and domain-specific skills — all generated by this plugin. 1,808 markdown files covering content creation, software development, data/AI, business strategy, education, legal, health, and more.
Research: A/B Testing Harness Effectiveness
revfactory/claude-code-harness — A controlled experiment across 15 software engineering tasks measuring the impact of structured pre-configuration on LLM code agent output quality.
| Metric | Without Harness | With Harness | Improvement |
|---|---|---|---|
| Average Quality Score | 49.5 | 79.3 | +60% |
| Win Rate | — | — | 100% (15/15) |
| Output Variance | — | — | -32% |
Key finding: effectiveness scales with task complexity — the harder the task, the greater the improvement (+23.8 Basic, +29.6 Advanced, +36.2 Expert).
Exact phrasing to use everywhere: +60% avg quality (49.5 → 79.3), 15/15 win-rate, −32% variance (n=15, author-measured A/B, third-party replications pending).
Full paper: Hwang, M. (2026). Harness: Structured Pre-Configuration for Enhancing LLM Code Agent Output Quality.
Requirements
- Agent Teams enabled:
CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1
FAQ
Q1. Isn't "+60%" oversold?
A. The +60% figure comes from an author-measured A/B (n=15, 15 tasks, measured on the sister repo claude-code-harness). Every citation in this repo pairs the number with the disclosure “n=15, author-measured, third-party replications pending” in the same sentence. For adoption decisions, we recommend running a 2–4 week internal pilot and measuring your own numbers.
Evidence:
- Author A/B: revfactory/claude-code-harness
- Paper: Hwang, M. (2026). Harness: Structured Pre-Configuration for Enhancing LLM Code Agent Output Quality
Q2. Why "harness factory" and not "harness builder"? Isn't this competing with Archon?
A. Archon generates deterministic runtime configurations — it’s a Runtime-Configuration Factory. Harness generates agent team architectures (team structure, message protocols, review gates) — it’s a Team-Architecture Factory. They are neighbor sub-layers of the same L3 Meta-Factory and serve different needs. Pick Archon for runtime determinism, Harness for team-architecture patterns, or combine them (design architecture with Harness → deploy runtime with Archon).
Evidence:
- Archon self-definition: clawfit docs/reference-levels.md
- Sub-layer declaration: see the Category — Where Harness Sits section above
- Archon repo: github.com/coleam00/Archon
Q3. Isn't "Claude Code only" too narrow? What about Gemini/Codex?
A. Currently the official runtime is Claude Code only. A Codex port of the same concept — SaehwanPark/meta-harness — is already public, so Codex teams can start there. Harness chose “Claude-Code-native, deep” over “multi-runtime, shallow”; cross-runtime collaboration with sibling repos (meta-harness, harness-init, OpenRig) is on the roadmap.
Evidence:
- Codex port: github.com/SaehwanPark/meta-harness
- Cross-runtime scaffolder: github.com/Gizele1/harness-init
License
Apache 2.0
相似文章
@GitHub_Daily: 用 Claude Code 做复杂项目,单个 Agent 能力有限,想让多个 Agent 协作分工,但手动配置团队结构和技能文件太繁琐。 最近找到 Harness 这个 Claude Code 插件,一句话描述你的项目,它就能自动生成一整…
Harness 是一个 Claude Code 插件,能根据一句话描述自动生成多 Agent 团队架构,内置 6 种协作模式和 100 套现成配置,帮助 Claude Code 从单兵作战变为团队协作。
@GoSailGlobal: Claude Code 插件这周出了一个 14.7K star 的爆款 Understand-Anything · 它的钩子很狠 「你刚加入一个新团队,代码 200,000 行,从哪开始?」 · 它做的事 把整个 codebase 跑成一…
Understand-Anything 是一个 Claude Code 插件,通过多 Agent 流水线将代码库转化为可交互的知识图谱,帮助开发者快速理解大型项目,支持依赖分析、Diff Impact Analysis 等功能。
@sitinme: 一个给 Claude Code 装了一套“增强插件包”的开源项目——oh-my-claudecode,把原本单兵作战的 Claude Code,升级成更像一个有分工、有流程、有自动化能力的 AI 开发团队。 很多人用 Claude Cod…
oh-my-claudecode 是一个开源项目,为 Claude Code 提供增强插件包,将其升级为具备任务分工、自动化流程和团队协作能力的 AI 开发团队,适合重度用户和复杂项目。
@chenggeshuo: 装上这个官方插件,Claude Code直接脱胎换骨! Anthropic悄悄放了个大招:claude-code-setup插件,把原生版从"能用"直接拉满到"神级开发环境"。自动扫描你的项目,一键配置hooks、skills、MCP服务…
Anthropic发布了claude-code-setup官方插件,可以自动扫描项目并配置hooks、skills、MCP服务等,将Claude Code从基础可用提升为强大的开发环境。
@_vmlops: 这是让 Claude Code 快 25 倍的 harness。大多数开发者直接使用 claude code。这个仓库将它包装成一个完整的计划→…
一个名为 claude-code-harness 的新开源工具将 Claude Code 封装在结构化的计划-工作-审查-提交循环中,使用 Go 原生引擎实现 25 倍更快的性能,并包含防止破坏性操作的防护措施。