@vista8: Highly recommend installing this Skill, it's much more powerful than the official Skill-creator. If you don't know how to write a skill, using Teacher Yao's skill can produce a 90-point skill. This skill originates from the Anthropic official leaked Claude code source code, as well as other models across the web...

X AI KOLs Timeline Tools

Summary

Recommending an open-source meta-skill tool yao-meta-skill, more powerful than the official Skill-creator, built from Anthropic's leaked Claude code source code and other integrations.

Highly recommend installing this Skill, it's much more powerful than the official Skill-creator. If you don't know how to write a skill, using Teacher Yao's skill can produce a 90-point skill. This skill originates from the Anthropic official leaked Claude code source code, combined with skills from other models across the web to form a meta-skill. After a month of refinement by Teacher Yao, this is the best Meta Skill I've ever used. Github: https://github.com/yaojingang/yao-meta-skill...
Original Article
View Cached Full Text

Cached at: 06/20/26, 08:23 PM

Highly recommend installing this Skill, much more powerful than the official Skill-creator. If you don’t know how to write a skill, using Teacher Yao’s skill can produce a 90-point skill. This skill originates from the leaked Claude code source from Anthropic, combined with meta-skills from other models across the web. After a month of refinement by Teacher Yao, this is the best Meta Skill I’ve ever used. Github:https://github.com/yaojingang/yao-meta-skill…


yaojingang/yao-meta-skill

Source: https://github.com/yaojingang/yao-meta-skill

Yao Meta Skill

CI (https://github.com/yaojingang/yao-meta-skill/actions/workflows/test.yml)

License: MIT

English Chinese Japanese French Russian

YAO stands for Yielding AI Outcomes — the goal is not to generate more prompt text, but to produce reusable AI assets and real operational outcomes.

yao-meta-skill creates, evaluates, packages, and governs reusable agent skills. The 1.0 line focused on turning repeated workflows into installable, readable, cross-platform skill packages. The 2.0 line expands that factory into a Skill OS: a governed system for modeling a skill once, compiling it for multiple targets, testing its behavior, reviewing its release evidence, and tracking the next iteration.

Quick Start · Skill OS 2.0 · 1.0 vs 2.0 · Operator UX · Benchmark · Examples · Evals · Failure Library · Method Doctrine

Skill OS 2.0 Upgrade

Skill OS 2.0 keeps the original promise of yao-meta-skill, but makes the package lifecycle more explicit. Instead of stopping at SKILL.md, it adds a semantic contract, target compilers, evaluation evidence, release gates, and operation reports around the skill.

  • Skill IR: a platform-neutral intermediate representation for intent, triggers, inputs, outputs, boundaries, references, and expected artifacts.
  • Target compilers and adapters: generated surfaces for OpenAI, Claude, generic agent skills, Agent Skills compatible packages, and VS Code-oriented workflows.
  • Output Eval Lab: trigger checks, output assertions, execution evidence, timing and token evidence, benchmark reproducibility, blind-review packs, answer keys, and adjudication reports.
  • Review Studio 2.0: a single HTML gate page for intent, triggers, output eval, context cost, runtime checks, trust, Skill Atlas signals, adoption drift, waivers, annotations, release evidence, warnings, blockers, and fix actions.
  • Evidence and release governance: evidence consistency checks, package verification, install simulation, runtime permission probes, world-class evidence intake, world-class ledger, operator runbook, and public claim guard.
  • SkillOps loop: metadata-only adoption drift, telemetry hooks, adaptive proposals, daily and weekly curator reports, and portfolio-level drift signals.

Current posture: the repository is ready for beta and external testing, while stronger public “world-class” claims remain evidence-gated. Provider-backed production evidence, human blind-review evidence, native permission execution, and real-client telemetry are tracked as separate evidence tasks instead of being treated as completed work.

See the companion artifacts:

From 1.0 to 2.0

Dimension1.0 focus2.0 upgrade
Product roleCreate, refactor, evaluate, and package reusable skills.Govern the full lifecycle of a skill: creation, compilation, evaluation, review, release, telemetry, and iteration.
ArchitectureSKILL.md, agents/interface.yaml, manifest files, and report artifacts.Skill IR, target compilers, adapters, gate contracts, evidence ledgers, release locks, and action-oriented review pages.
Cross-platform deliveryOpenAI, Claude, and generic package targets.Adds broader Agent Skills and VS Code-oriented compatibility, with registry-readable compatibility records.
Quality modelTrigger and structure checks plus report-based review.Output eval, benchmark reproducibility, execution evidence, failure disclosure, blind-review packs, and evidence consistency checks.
Report experienceOverview HTML and first-pass review pages.Bilingual Skill Overview v2, Review Studio 2.0, reviewer annotations, action cards, charts, and audit-oriented report contracts.
Release boundaryPackage output with basic validation.Package verification, install simulation, runtime permission probes, release locks, public claim guard, and operator runbooks.
Operating loopManual feedback and local iteration.Adoption drift, metadata telemetry, SkillOps reports, adaptive proposals, and portfolio-level drift detection.

2.0 Use Cases

  • Create a new skill from repeated work: start with a workflow note, prompt set, transcript, runbook, or document pattern, then generate a package with a lean entrypoint, explicit inputs and outputs, references, reports, and the lightest justified gates.
  • Upgrade a personal skill into a team asset: add interface contracts, manifests, target adapters, trust checks, output evals, reviewer waivers, release notes, and Review Studio evidence before other people depend on the skill.
  • Prepare a skill for beta release: run package verification, install simulation, compatibility checks, runtime permission probes, and evidence consistency checks, then separate beta readiness from stronger public claims.
  • Keep a skill useful after release: use metadata-only telemetry, adoption drift, feedback logs, SkillOps reports, and adaptive proposals to decide whether the next move should be documentation, an eval, a skill patch, or a governance update.
  • Compare with other meta-skill approaches: keep Anthropic/OpenAI-style conversational creation and lean instruction writing where they fit, then use yao-meta-skill when the package needs evidence, portability, release gates, and repeatable maintenance.

Operator UX Commands

These read-only helper commands turn common maintainer questions into repeatable diagnostics:

python3 scripts/yao.py install-status --expected-source .
python3 scripts/yao.py localized-doc-sync-check
python3 scripts/yao.py pr-review-report 4 --repo yaojingang/yao-meta-skill
  • install-status explains whether the active skill is coming from .codex/skills, .agents/skills, or the disabled mirror, and flags duplicate active installs.
  • localized-doc-sync-check verifies that the Chinese README carries the public homepage sections that were added to the English README.
  • pr-review-report reads GitHub PR metadata, changed files, status checks, and suggested local commands without merging or mutating the PR.

Capability Surface

It turns rough workflows, transcripts, prompts, notes, and runbooks into reusable skill packages with:

  • a clear trigger surface
  • a lean SKILL.md
  • optional references, scripts, and evals
  • a front-loaded intent dialogue with an intent confidence gate, so the system keeps clarifying when the true job, outputs, exclusions, or standards are still fuzzy
  • a silent-by-default GitHub benchmark scan plus reference synthesis that studies top public repositories and world-class pattern tracks, then surfaces only real conflicts or uncertainty to the user
  • a generated visual HTML overview for each newly initialized skill
  • a Review Studio 2.0 HTML gate page that combines intent, trigger, output eval, context, runtime, trust, atlas, adoption drift, reviewer waivers, reviewer annotations, release evidence, and per-warning fix actions
  • a Skill OS 2.0 audit that maps each world-class requirement to current evidence, human-required gaps, and external-required gaps
  • a Skill OS 2.0 blueprint coverage report that maps the upgrade plan’s core modules and recommended PRs to concrete artifacts, commands, and tests
  • a world-class evidence plan that turns remaining provider, human, native-permission, and real-client telemetry gaps into executable evidence tasks
  • a world-class evidence ledger that records which external and human evidence is accepted or still pending without treating planned work as proof
  • a world-class evidence intake contract that validates external and human evidence packets for provenance, privacy, artifact refs, and anti-overclaim rules before ledger review
  • a redacted world-class preflight report that checks local files, environment readiness, human/external prerequisites, and source blockers before operators collect evidence
  • a world-class submission review queue that compares evidence packets, intake validation, source artifacts, and ledger state without accepting evidence
  • a world-class operator runbook that gives reviewers the exact commands, artifacts, and collection checklist needed to close remaining evidence gaps
  • a world-class claim guard that scans public claim surfaces and blocks premature completed/true claims while the evidence ledger still has pending external or human evidence
  • a benchmark reproducibility manifest that checks methodology sections, required artifacts, failure disclosure, and reproduction commands
  • an evidence consistency gate that compares generated reports against each other so benchmark, overview, interpretation, adoption, world-class ledger, coverage, and Review Studio facts do not drift silently
  • Output Eval Lab evidence with assertion grading, execution/timing/token evidence, a blind A/B review pack, a separate answer key, and reviewer adjudication reports
  • a runtime permission probe report that checks packaged target adapters for explicit permission metadata, native-enforcement flags, metadata fallback notes, and residual risks
  • a Python compatibility gate that catches supported-runtime syntax hazards before they reach GitHub Actions or packaged distribution
  • a side-by-side HTML review studio for first-pass human review
  • an artifact design profile that defines visual direction, layout patterns, and quality gates for reports, tutorials, dashboards, screenshots, and review pages
  • a prompt quality profile that abstracts need modeling, RTF mapping, complexity, and quality checks into reviewer-visible evidence instead of bloating SKILL.md
  • a systems-thinking model that maps boundaries, feedback loops, drift risks, recurring failure patterns, and highest-leverage quality moves
  • three high-value next iteration directions after the first package is created
  • a lightweight feedback log that does not require a full promotion cycle
  • a local-first metadata-only adoption and drift report that turns real usage signals into next iteration candidates, with optional yao.py CLI run capture, external client event emit hooks, hook recipes, and JSONL import that record command names and outcomes without arguments or raw content
  • an explicit-source adaptive proposal loop that summarizes redacted repeated user preferences and generates approval-gated adaptation proposals without scanning private logs or writing source files
  • a SkillOps opportunity scorer and decision policy that ranks redacted repeated signals, maps them to report-only, AGENTS update, existing-skill patch, or eval-addition actions, and keeps every durable write approval-gated
  • a weekly SkillOps curator report that aggregates daily opportunities, Skill Atlas portfolio signals, release lock state, and world-class evidence gaps into a proposal-only maintenance queue
  • a Browser/Chrome Native Messaging telemetry host that can receive length-prefixed metadata-only client events and generate a local launcher plus manifest without storing raw content
  • a Skill Atlas drift layer that reads aggregate adoption reports and surfaces portfolio-level drift signals without packaging raw telemetry logs
  • a baseline compare report for with-skill vs baseline review
  • a conversation-style, archetype-aware quickstart that steers new packages toward scaffold, production, library, or governed fits
  • Skill IR as the platform-neutral semantic contract, plus compiler reports and client-specific adapters
  • Registry audit metadata with package version, owner, license, checksum, and compatibility matrix
  • governance, promotion, and portability checks built into the default flow

Architecture

Hero view: Skill OS 2.0 turns messy operational input into a governed, reusable skill package through a model, compile, evaluate, release, and operate loop.

flowchart LR
    A["Inputs<br/>workflow / prompt / transcript / docs / notes"] --> B["Intent model<br/>job / outputs / exclusions / standards"]
    B --> C["Skill IR<br/>trigger / contracts / resources / evidence"]
    C --> D["Skill package<br/>SKILL.md / references / scripts / reports"]
    C --> E["Target compilers<br/>OpenAI / Claude / generic / Agent Skills / VS Code"]
    D --> F["Eval Lab<br/>trigger / output / benchmark / runtime"]
    E --> F
    F --> G["Review Studio<br/>gates / warnings / actions / waivers"]
    G --> H["Release boundary<br/>package verification / install simulation / claim guard"]
    H --> I["SkillOps loop<br/>feedback / adoption drift / next iteration"]
    I --> B

Read it in 10 seconds:

  • Inputs: start from rough operational material instead of a polished spec.
  • Intent model: make the job, outputs, exclusions, constraints, and standards explicit before generating files.
  • Skill IR: keep the semantic contract separate from any single platform format.
  • Package and compile: generate the lean skill package and the target-specific adapters from the same source model.
  • Evaluate and review: turn trigger behavior, output quality, runtime checks, and trust signals into reviewable evidence.
  • Release and operate: publish only within the current evidence boundary, then feed adoption drift and reviewer feedback into the next iteration.

Weighted Quality Benchmark

This benchmark is a project-level engineering review, scored from 0-10 per dimension and weighted to 100. GitHub stars are intentionally excluded because they measure ecosystem heat, not meta-skill engineering quality.

Weighted score formula: sum(score / 10 * weight).

Meta SkillMethod Depth 15Context Discipline 10Toolchain 15Eval/Test Rigor 20Governance 15Portability 10Onboarding/Review 5Local Reliability 10Weighted Score
Yao Meta Skill9.58.09.59.59.59.06.59.591.5
Anthropic Skill Creator9.06.58.57.54.05.07.55.067.5
OpenAI Skill Creator8.59.55.02.03.04.08.54.050.5
RankMeta SkillScoreCore Positioning
1Yao Meta Skill91.5A complete engineering, evaluation, governance, and portability system for reusable skills.
2Anthropic Skill Creator67.5Strong methodology and iteration loop, with weaker local execution reliability and governance coverage.
3OpenAI Skill Creator50.5Best treated as a concise skill-writing method guide rather than a full engineering system.

Best-Fit Scenarios

  • Choose Yao Meta Skill when the target is a reusable team asset with explicit boundaries, trigger evaluation, governance, packaging, portability, and local execution checks.
  • Choose Anthropic Skill Creator when the target is a conversation-first creation loop and the priority is human-guided iteration over repository-level governance.
  • Choose OpenAI Skill Creator when the target is a compact reference for writing lean skill instructions and keeping context small.
  • A practical hybrid pattern is still useful: draft conversationally, then use yao-meta-skill to harden the package, add evidence, and make it team-ready.

Quick Start

  1. Describe the workflow, prompt set, or repeated task you want to turn into a skill.
  2. Start with

Similar Articles

@FakeMaidenMaker: Have you installed the best Skills for Codex? These 6 skills (can be installed for Codex, Claude Code, and Cursor) will make it far more powerful than you imagine. Their reputation has been proven by major communities: 1. Superpowers (ob…

X AI KOLs Timeline

Recommends six Skill plugins to enhance AI coding assistants like Codex, Claude Code, and Cursor, including Superpowers, OpenAI official plugins, claude-mem, Agent-Reach, GitNexus, and Humanizer-zh, covering working methodologies, long-term memory, internet access, code understanding, etc.

@max_ai_max: https://x.com/max_ai_max/status/2060221653259547069

X AI KOLs Timeline

This article shares a practical guide to writing a truly usable Claude Skill, covering the operating mechanism, directory skeleton, frontmatter writing, iteration methods, etc., to help developers efficiently build and debug custom skills.

@vista8: Without good Skills, AI Agents cannot realize their potential. Another pain point is that with millions of Skills online, which one best matches my current task? Can it run? Is it safe? Vercel's find skill can search and install Skills, but only includes 90,000 Skills scattered across vertical domains…

X AI KOLs Timeline

SkillsVote is an AI Agent Skill management tool that filters 790K+ Skills from GitHub and extracts functional descriptions, environment requirements, and permission information. It supports precise recommendation, execution attribution, and iterative optimization, and can also provide workflow combination suggestions.

@nini_incrypto_: Want to freeload the official Agent building solution from a big company? Anthropic has open-sourced the underlying secrets of how Claude learns new skills! This official repository called anthropic/skills went viral as soon as it was released. Now you don't need to write complex prompts or fine-tune the model...

X AI KOLs Timeline

Anthropic has open-sourced the skills repository, allowing users to define Claude's behavior through structured Markdown instructions without complex prompts or fine-tuning. It covers a variety of skill sets including creative, technical, and enterprise.

@wanerfu: A Japanese developer discovered a little-known trick for Claude Code. He set up a skill called 'Find Skill'. It embeds into Claude Code like a small MCP package. Write a task description once. Claude searches hundreds of skills in the human ecosystem and automatically assembles the most suitable workflow. Video script (Notion…)

X AI KOLs Timeline

A Japanese developer discovered a trick for Claude Code: by setting a 'Find Skill' skill, Claude automatically searches and combines hundreds of skills to create workflows, enabling automated video content generation.