@tom_doerr: Automates empirical research with 23,000 AI agent skills https://github.com/brycewang-stanford/Auto-Empirical-Research-…

X AI KOLs Timeline 06/02/26, 06:39 PM Tools

Summary

Auto-Empirical-Research-Skills (AERS) is an open-source toolkit that automates the entire empirical research pipeline using 23,000+ AI agent skills, from data cleaning to submission-ready drafts.

Automates empirical research with 23,000 AI agent skills https://github.com/brycewang-stanford/Auto-Empirical-Research-Skills…

Original Article

View Cached Full Text

Cached at: 06/02/26, 09:38 PM

Automates empirical research with 23,000 AI agent skills https://github.com/brycewang-stanford/Auto-Empirical-Research-Skills…

brycewang-stanford/Auto-Empirical-Research-Skills

Source: https://github.com/brycewang-stanford/Auto-Empirical-Research-Skills

Auto-Empirical Research Skills (AERS, 23K+ Skills)

Repository renamed → now “Auto”. This project started life as Awesome Agent Skills for Empirical Research and has been renamed to Auto-Empirical-Research-Skills (AERS). The new name reflects the core idea: not just a collection of skills, but an agent that automatically runs the full empirical-research pipeline end to end — from raw data cleaning → identification & estimation → robustness checks → tables, figures, and a submission-ready draft — with minimal human hand-holding.

GitHub automatically redirects the old URL, but please update your bookmarks and local remote:
git remote set-url origin https://github.com/brycewang-stanford/Auto-Empirical-Research-Skills.git

🌐 Language / 语言: English | 中文

Stanford REAP × CoPaper.AI · An academic-industrial AI toolkit for empirical research
_{Crafted by Stanford’s empirical methodology team — covering the full pipeline from data cleaning to top-journal submission}

The Definitive Collection of AI Agent Skills for Empirical Research — 119 GitHub Repos / 23,000+ Skills

A curated, opinionated list of 119 GitHub repositories and 23,000+ AI Agent Skills for empirical research in economics, political science, sociology, psychology, public health, education, management, finance, and public policy — organized by research workflow, from topic selection to journal submission.

In 2026, the way we do empirical research is being redefined.

CoPaper.AI — an empirical research AI assistant incubated by researchers at Stanford REAP / SCCEI (Stanford Center on China’s Economy and Institutions) — can complete a publication-quality empirical paper in 20 minutes: from data import, descriptive statistics, causal inference models, and robustness checks to formatted result tables, all in one go. The secret isn’t a more powerful model — it’s Skills: encoding senior researchers’ methodological expertise into structured workflows, so the AI knows “what a complete DID analysis should include” instead of waiting for you to remind it step by step.

This repository is the Agent Skills landscape we compiled while building CoPaper.AI. We organized hundreds of Skills repos and tens of thousands of Skills scattered across GitHub, communities, and academia by research workflow stages, so you can pick what you need.

🎓 Three Layers of Trust · Why It’s Us Building This

Layer	Anchor	Lever
🏛️ Academic lineage	Stanford REAP / SCCEI — Stanford Center on China’s Economy and Institutions	A research center with a sustained publication record in empirical economics methodology and a deep tradition in applied causal inference
🔧 Engineering delivery	CoPaper.AI empirical research AI assistant	Ships with 20 econometric methodology Skills (DID/IV/RDD/PSM/DML, etc.), Supervisor + 4 sub-agent multi-agent architecture, one-sentence triggers, automatic result output
⚙️ Open-source engine	StatsPAI — the causal-inference engine that powers CoPaper.AI	900+ functions · one `import statspai as sp` · JOSS in submission · MIT-licensed. Every DID/IV/RD/SCM estimate CoPaper.AI produces is driven by StatsPAI; this Skills collection is itself part of the StatsPAI ecosystem

🔒 Use with confidence: every one of the 52 Skills / 2,940+ files in this repo passed our systematic security audit — 52/52 CLEAN, zero FLAGGED, zero exfiltration, zero reverse shells, zero prompt injection.

💡 Want it out of the box? Skip the Skills assembly — try → copaper.ai and let the Stanford methodology team run the empirical pipeline end-to-end for you.

Start Here

Search the local index: docs/search.html
Browse the generated local catalog: docs/SKILL_CATALOG.md
Copy a ready-to-run empirical workflow: docs/GOLDEN_WORKFLOWS.md
See flagship demos: docs/demos/
Run flagship regression prompts: docs/EVALS.md
Install or copy skills into an agent runtime: docs/INSTALL.md
Use the machine-readable index: catalog/skills.json
Coordinate parallel agent work: docs/AGENT_COORDINATION.md
Check provenance and license risk: docs/LICENSE_AUDIT.md
Check contribution and validation rules: docs/QUALITY_GATE.md · docs/SKILL_SUBMISSION_GUIDE.md
See the repo audit and improvement plan: docs/REPO_AUDIT_2026-05-31.md · docs/ROADMAP.md
Rebuild and validate locally:

make catalog
make validate

Pick a Workflow in 30 Seconds

Goal	Start with
Run a complete empirical pipeline	`StatsPAI_skill`
Audit a top-5 economics identification strategy	`aer-identification`
Prepare AER / AEJ submission	`aer-workflow`
Build a replication package	`aer-replication`
Lower Chinese academic AI-writing signal	`chinese-de-aigc`

🆕 Changelog

2026-05-25: 📕 AER-skills vendored — Top-5 economics (AER / AER:Insights / AEJ) submission stack (9 skills, skills/50) + weekly auto-sync workflow

📕 skills/50-brycewang-aer-skills: This repo’s sister project brycewang-stanford/AER-skills vendored in whole, with the same StatsPAI-style sync loop (scripts/sync-aer-skills.sh + .github/workflows/sync-aer-skills.yml — Monday 06:00 UTC weekly diff, PR on drift). Positioning: a top-5 economics submission skill stack (AER / AER:Insights / AEJ family), extending the StatsPAI / 00.x “analysis” line to the “manuscript + submission” line.
- 🧱 Nine skills covering the full submission pipeline: aer-topic-selection (AER vs Insights vs AEJ routing) → aer-identification (identification audit: modern DiD / weak IV / boundary RDD pitfalls) → aer-robustness (referee-anticipating robustness matrix) → aer-introduction (Keith Head five-paragraph intro) → aer-tables-figures (AER booktabs typesetting) → aer-replication (AEA Data and Code Availability Policy package, openICPSR-ready) → aer-submission (preflight: 100-word abstract, disclosure, cover letter) → aer-rebuttal (R&R letters written against the revised manuscript, not the old draft) → aer-workflow (orchestrator that tells you which skill to use next).
- 🆚 Differentiation from existing skills: StatsPAI / 00.x solve “how to run the analysis correctly”; AER-skills solves “how to write the paper to top-5 acceptance threshold” — the AER 100-word abstract / AER:Insights 7000-word limit / 45% desk-rejection rate / AEA mandatory replication are top-5-specific constraints that generic scientific-writing skills (Nature-Paper-Skills etc.) do not cover. Identification-first: if your design is fragile, no prose will save it.
- 🔁 Vendor-sync loop: git clone --depth=1 upstream → rsync -a --delete --exclude='.git' mirror the whole tree → diff content hashes before/after, exit 0 on no drift, exit 1 on drift to trigger peter-evans/create-pull-request@v6 on chore/sync-aer-skills branch. Supports manual workflow_dispatch for on-demand sync.
- License: MIT — consistent with StatsPAI / 00.x; commercial and academic use both allowed.
- First upstream commit: 7e9c44d (2026-05-25, includes modern-aer-exemplars.md with 30+ subfield-organized papers).

2026-04-28: 🛡️ Repository-wide security scan complete — all 52 Skills CLEAN, zero FLAGGED

🛡️ SECURITY-SCAN-REPORT.md: We ran a systematic security audit across all 52 Skills / 2,940+ files in this repository. No malicious prompts, viruses, trojans, reverse shells, or other malicious content were found. Bottom line: every Skill in this repo is safe to use.
- 🔍 Six-phase defense-in-depth methodology: (1) automated grep across 13 risk categories (pipe-to-shell, reverse shell, credential exfil, decode-and-run, mining/RAT signatures, prompt injection, etc.) → (2) 100% manual review of all 6 hook-bearing Skills and their 40+ hook scripts → (3) three parallel agents auditing SKILL.md prose, agent definitions, and reference docs separately → (4) supplemental integrity checks (hidden Unicode, encoding anomalies, ultra-long lines, HTML injection, network-related imports).
- 📊 Result distribution: every “sensitive” hit verified as one of three legitimate categories — defensive security rules (deny rules, bash-safety hooks, credential detectors), legitimate academic API calls (arXiv / CrossRef / PubMed / FRED / World Bank / OECD / BLS), or standard Claude Code workflow hooks (scaffolding / state save / context monitor — all local file operations, zero network IO).
- 🔑 Key insight: 17-DAAF is actually the strongest “security-aware” reference template in this batch (14 defensive hooks + 32 deny rules + active credential scanning). Largest size ≠ highest risk.
- 📈 Visual infographics: 5 zhihu-style information graphics embedded in the report (overview / methodology / threat matrix / Top 5 size distribution / supplemental scan) — readable in 3 seconds.
- See the full security scan report for details.

2026-04-24: 📗 Full Empirical Analysis Skill (R) shipped — tidyverse + fixest, 8-step Quarto-friendly loop (skills/00.3)

📗 Full Empirical Analysis Skill — R: Same-day fourth member of the family, vendored at skills/00.3-Full-empirical-analysis-skill_R/ — slot #0.3, the R / Quarto edition.
- 🧱 Modern tidyverse + fixest stack: dplyr / tidyr / haven for data; fixest::feols/feglm/fepois as the panel/IV/DID workhorse (one line for HD FE + multi-way cluster + IV); did::att_gt + fixest::sunab + didimputation::did_imputation + synthdid + DIDmultiplegtDYN + bacondecomp + HonestDiD for modern DID; rdrobust / rddensity / rdmulti / rdlocrand for RD; Synth / gsynth / tidysynth / synthdid for synthetic control; MatchIt / WeightIt / cobalt / ebal for matching; grf::causal_forest + DoubleML for ML causal; mediation::mediate + lavaan::sem for mediation; marginaleffects::avg_slopes / plot_slopes for post-estimation; modelsummary / kableExtra / gt / flextable for publication tables; ggplot2 + iplot + binsreg + cowplot + patchwork for figures; Quarto to render PDF/HTML/Word in one command.
- 🔁 8-step R closed loop (mirrors 00.1 / 00.2): (1) Import & cleaning (read_dta + clean_names + naniar::vis_miss + mice + validate / assertr) → (2) Variable construction (mutate + across + DescTools::Winsorize + scale + arrange %>% group_by %>% lag/lead) → (3) Descriptives (gtsummary::tbl_summary + modelsummary::datasummary_balance + psych::corr.test + corrplot / ggcorrplot) → (4) Diagnostics (12 classes: shapiro.test / tseries::jarque.bera.test / lmtest::bptest / dwtest / bgtest / car::vif / tseries::adf.test / kpss.test / plm::pbgtest / pcdtest / phtest / lmtest::resettest) → (5) Estimation (12 classes: feols + AER::ivreg + did::att_gt + fixest::sunab + didimputation + synthdid + rdrobust + tidysynth + gsynth + MatchIt + WeightIt + ebal + grf::causal_forest + DoubleML + sampleSelection::heckit + quantreg::rq + lavaan::sem) → (6) Robustness (modelsummary for M1–M6 + clubSandwich + fwildclusterboot::boottest + ri2::conduct_ri + bacondecomp::bacon + HonestDiD::createSensitivityResults + robomit::o_test/o_beta) → (7) Further analysis (formula interactions + marginaleffects::plot_slopes + mediation::mediate + medsens + lavaan::sem multi-group + grf::causal_forest CATE + splines::ns dose-response) → (8) Publication output (modelsummary to LaTeX/Word/HTML/Markdown in one call + fixest::iplot + marginaleffects::plot_slopes/predictions + cowplot::plot_grid + patchwork + Quarto rendering).
- 📚 Progressive disclosure + Quarto-native: SKILL.md 893-line spine (with full install.packages list, project skeleton, Quarto YAML template); 8 references/NN-*.md totalling 3700+ lines. The Quarto template makes “narrative + code + tables + figures” render to a single self-contained report from a single .qmd source.
- 🆚 Four-skill positioning: StatsPAI = Python one-shot DSL; 00.1 = explicit Python; 00.2 = explicit Stata; 00.3 = R + tidyverse + Quarto. Four parallel implementations of the same 8 steps, none replacing the others. The Quarto-rendered reproducibility report is unique to 00.3.
- Use cases: Quarto-rendered replication reports, academic blogs (distill / quarto blog), graduate R courses, rigorous projects needing marginaleffects + mediation + grf post-estimation, anything R-flavoured outside of pure Bayesian work.

2026-04-24: 📊 Full Empirical Analysis Skill (Stata) shipped — traditional Stata ecosystem, 8-step .do loop (skills/00.2)

📊 Full Empirical Analysis Skill — Stata: Same-day Stata sibling of StatsPAI / 00.1, vendored at skills/00.2-Full-empirical-analysis-skill_Stata/ — slot #0.2, for Stata users.
- 🧱 Traditional Stata ecosystem, de-facto standard command chain: every step calls community-standard commands reghdfe / ivreg2 / ivreghdfe / csdid / did_imputation / eventstudyinteract / sdid / did_multiplegt_dyn / bacondecomp / honestdid / rdrobust / rddensity / synth / synth_runner / psmatch2 / teffects / ebalance / ppmlhdfe / boottest / ritest / rwolf / psacalc / coefplot / esttab / outreg2 / asdoc / binscatter — referee-level Stata replication packs, one ssc install block installs 30+ packages.
- 🔁 8-step .do loop (same structure as 00.1, Stata-native rewrite): (1) Import & cleaning (use/import excel/import sas/destring/misstable/mdesc/duplicates report/merge m:1 ... assert(match using)/xtset/xtdescribe/mi impute chained) → (2) Variable construction (winsor2 by(industry year)/egen std/xtile/xtset + L./F./D./S./CPI deflation/first_treat+rel_time+gvar) → (3) Descriptives (tabstat/balancetable/asdoc sum/pwcorr, sig star(.05)/heatplot/twoway kdensity/xtdescribe) → (4) Diagnostics (12 classes: swilk/sktest/estat hettest/estat imtest, white/xtserial/xttest3/xtcsd, pesaran/estat vif/dfuller/kpss/xtunitroot ips/llc/hausman fe re/estat ovtest/linktest) → (5) Estimation (12 classes: reghdfe+areg+xtreg, fe/re/ivreg2+ivreghdfe+ivregress liml/gmm/csdid+eventstudyinteract+did_imputation+sdid+did_multiplegt_dyn/rdrobust+rdmc+rddensity/synth+synth_runner/psmatch2+teffects psmatch/ipwra/aipw+ebalance+cem/heckman+heckprob/qreg+sqreg/ppmlhdfe/sem+gsem) → (6) Robustness (eststo+esttab M1–M6, multi-cluster, boottest, ritest, rwolf, bacondecomp, honestdid, psacalc delta) → (7) Further analysis (factor-var interactions+margins+marginsplot/suest cross-eq Wald/DDD/outcome ladder coefplot/medsem+khb+sem estat teffects/dose-response via xtile or bspline/Stata-Python bridge to econml for CATE/spillover) → (8) Publication output (esttab+outreg2+asdoc to .tex/.rtf/.docx/.xlsx; coefplot+marginsplot+binscatter+rdplot+graph combine to .pdf).
- 📚 Progressive disclosure: SKILL.md 801-line spine (full ssc install list + complete .do skeleton + library cheat-sheet); 8 references/NN-*.md totalling 3500+ lines, loaded on demand.
- 🆚 Triple positioning (now extended to 4 with 00.3): StatsPAI = Python DSL one-shot; 00.1 = explicit Python; 00.2 = explicit Stata — the only choice when a referee or co-author insists on Stata replication.
- Use cases: referee-level Stata replication packs, graduate Stata courses, AER/QJE/JPE/ReStud-style standard .do pipelines, rigorous research needing the full modern DID toolkit (bacondecomp + honestdid + psacalc).

2026-04-24: 📘 Full Empirical Analysis Skill shipped — traditional Python econometric stack, explicit 8-step loop (skills/00.1)

📘 Full Empirical Analysis Skill: Same-day sibling to StatsPAI, vendored at skills/00.1-Full-empirical-analysis-skill_Python/ — slot #0.1, the explicit / auditable counterpart.
- 🧱 Traditional Python econometrics stack, no DSL wrapper: every step directly calls pandas / numpy / scipy / statsmodels / linearmodels / pyfixest / rdrobust / econml / causalml / matplotlib / seaborn — every line of agent-written code is inspectable and swappable.
- 🔁 8-step closed loop (finer granularity than StatsPAI’s 6 steps): (1) Data cleaning (MCAR/MAR/MNAR handling, IQR/z/Mahalanobis outliers, validate= on every merge, panel-structure checks) → (2) Variable construction (log/IHS/Box–Cox, 1/99 winsorization, z/MinMax/Robust scaling, interactions/lags/diffs, CPI deflation, staggered-DID timing vars) → (3) Descriptive statistics (stratified Table 1 with SMDs+t-tests, starred correlation heatmap, 4-panel distribution figure, DID motivation plot, panel-coverage heatmap) → (4) Diagnostic tests (12 classes: normality / heteroskedasticity / autocorrelation / multicollinearity / stationarity / cointegration / endogeneity / weak-IV / overid / panel Hausman / RESET / Cook’s D) → (5) Baseline modeling (12 classes of estimators: OLS / panel FE-RE-FD / GLM / IV-2SLS-LIML-GMM / DID×5-2×2/TWFE/event-study/CS/SA/BJS/SDiD / RD-Sharp/Fuzzy/Kink/multi-cutoff / SC / PSM-IPW-EB / DML / Causal Forest / Heckman / Quantile) → (6) Robustness battery (M1–M6 progressive specs, cluster-level sensitivity, wild cluster bootstrap, placebo timing+permutation, specification curve, Oster δ*, LOO, Rosenbaum) → (7) Further analysis (heterogeneity × 4, outcome-ladder mechanism, Baron–Kenny + Imai mediation, moderated mediation, dose-response, spillover) → (8) Publication tables & figures (stargazer / pyfixest.etable / coefplot / event-study / binscatter / forest / RD plot / CATE heatmap / love plot, full LaTeX/Word/Excel export).
- 📚 Progressive-disclosure architecture: SKILL.md holds only the one canonical call per step (610 lines of spine); variants are offloaded to 8 references/NN-*.md deep manuals (3000+ lines total), loaded by agents only when needed.
- 🆚 Relationship to StatsPAI: StatsPAI = agent-native one-shot DSL (one sp.causal(...) runs everything); this skill = explicit traditional stack (every line swappable, every diagnostic by hand). They coexist and complement — reach for StatsPAI when you trust the DSL; reach for this skill when teaching, auditing, or requiring full control.
- Use cases: replicating applied-economics papers, referee-level line-by-line audit, graduate teaching, any project that insists on hanging every diagnostic and robustness check into the explicit pipeline.

2026-04-24: 🔥 StatsPAI Skill officially shipped — end-to-end automated empirical analysis (skills/00)

🔥🔥 StatsPAI Skill: Our agent-native, one-stop empirical-analysis Skill is now officially vendored in this repo at skills/00-Full-empirical-analysis-skill_StatsPAI/ — slot #0, the repository’s flagship.
- 🚀 End-to-end automation for the entire empirical pipeline: data cleaning (pandas pre-step) → EDA & descriptives (sp.sumstats / sp.balance_table) → pre-flight diagnostics (sp.diagnose / sp.balance_panel / overlap / missingness) → research-question DSL (sp.causal_question(...).identify()) → LLM-assisted DAG discovery (sp.llm_dag_propose / validate / constrained) → one-call estimation (sp.causal(...)) → robustness (sp.spec_curve / sp.honest_did / sp.evalue). 6-step closed loop, no tool switching — the agent runs the whole thing from a single instruction.
- 900+ functions, one import statspai as sp: more than doubled from the 390+ version on 2026-04-12. Covers OLS, IV, panel, DID (Callaway-Sant’Anna / Sun-Abraham / Bacon / HonestDID / continuous DID), RDD (Sharp / Fuzzy / multi-cutoff / Kink), PSM, SCM, SDID, DML, Causal Forest, Meta-Learners, TMLE, AIPW, neural causal models (TARNet / CFRNet / DragonNet), text causal (sp.causal_text), Heckman, structural estimation (BLP).
- Agent-native self-describing API: sp.list_functions() / sp.describe_function() / sp.function_schema() — agents discover and understand functions without doc lookup. Every estimator returns a unified CausalResult with .summary() / .plot() / .to_latex() / .to_word() / .to_excel() / .cite() and a structured .diagnostics dict — purpose-built for LLM-driven workflows.
- Estimand-first decisions: sp.causal_question makes the “DID vs RD vs IV?” choice explicit and defensible — no more guesswork.
- Submitted to JOSS, MIT-licensed. → PyPI | → GitHub | → Local Skill
🔁 Weekly upstream sync: new GitHub Action auto-pulls the latest SKILL.md / README.md from the StatsPAI main repo into skills/00-Full-empirical-analysis-skill_StatsPAI/ every week — users always get the latest version.
Corrected several sp.* signatures in Skill code examples; Step 0–6 code blocks are now explicitly flagged as illustrative (so agents don’t copy them verbatim).

2026-04-13: 🇨🇳 Original Chinese De-AIGC Skill Launched (skills/48)

🇨🇳🔥 chinese-de-aigc: CoPaper.AI team’s original Chinese academic de-AIGC skill. Currently the only humanizer on GitHub dedicated to Chinese academic empirical papers and targeting China’s CNKI AMLC / Wanfang / VIP / Turnitin Chinese detectors.
- 17-pattern library of Chinese AI tells (4-character clichés / hollow connectives / explicit transitions / absolutist claims / total-part-total symmetry / sentence-length uniformity)
- 5-step closed-loop workflow: Locate → Diagnose → Differential Rewrite → 5-Dim Self-Score → Second-Pass Review
- Per-section strategy: Abstract / Introduction / Literature Review / Methods / Results / Discussion / Conclusion each has different rewrite intensity
- 5-dimension scoring rubric: Concreteness / Rhythm / Caution / Implicit Cohesion / Researcher Voice (weighted max 50)
- 12 before/after case comparisons covering 7 main chapters of empirical papers
- Architecture inspired by English humanizers (humanizer_academic / skill-deslop / stop-slop / avoid-ai-writing), but fully re-designed for Chinese language context

2026-04-12: Added StatsPAI Agent-Native Econometrics Package + Anti-AIGC Detection Skills

🔥 StatsPAI: Our own agent-native causal inference & econometrics Python package. 390+ functions, one import, self-describing API (list_functions() / describe_function() / function_schema()). Covers OLS, IV, DID (Callaway-Sant’Anna / Sun-Abraham / Bacon / HonestDID / continuous DID), RDD, PSM, SCM, DML, Causal Forest, Meta-Learners, TMLE, neural causal models (TARNet/CFRNet/DragonNet), and more. Published in JOSS, MIT license. → PyPI | → GitHub
📝 Anti-AIGC Detection Skills (4 new, → dedicated section):
- humanizer_academic — Academic paper specialist, 23 AI writing pattern detectors (skills/44)
- skill-deslop — Scientific writing de-AI, respects discipline conventions (skills/45)
- stop-slop — 3-layer detection + 5-dimension scoring (skills/46)
- avoid-ai-writing — Structured audit + rewrite + second-pass audit (skills/47)
🛡️ revision-guard: Prevents AI over-refinement, limits revision rounds + 7-point homogenization checklist (community PR contribution)

2026-04-11: Expanded from 43 collections to 119 repos, covering 23,000+ Skills

Added 76 GitHub repositories across 8 social science disciplines (economics, political science, sociology, psychology, education, public health, management, finance)
Added skill suites for finance, law, marketing, product management, education, public health
Added 13 academic data MCP servers (OpenAlex, Semantic Scholar, FRED, World Bank, etc.)
Added 11 multi-agent collaboration systems (Agent Laboratory, AI-Scientist-v2, etc.)
Added bilingual Chinese/English README

What Can This List Do for You?

If you do empirical research, you’ve probably experienced these scenarios:

You ask AI to run a DID, and it gives you the baseline regression and stops. You say “parallel trends?” — it adds one. “Placebo test?” — another one. “Event study plot?” — yet another. Every time, it’s like squeezing toothpaste.
You finally finish a draft, but citations are a mess, with a few hallucinated references mixed in.
You want to replicate an identification strategy from a top journal, but the gap between understanding it and implementing it feels like a mountain.

The problem isn’t that AI can’t do it — it doesn’t know what a complete workflow should include.

A Skill solves this: it’s a methodological playbook for AI. With a Skill, AI knows “running DID means first testing parallel trends, then baseline regression, then 4 robustness checks, then heterogeneity analysis, then mechanism analysis, with specific output formats at each step.” You just say “run a DID analysis” and it follows the complete workflow.

This list helps you find the best Skills for every stage of the empirical research workflow.

Quick Lookup by Research Stage

Not sure which Skill to use? Start from your current research stage:

Topic Ideation → Lit Search → Deep Reading → Research Design → Data Collection
      │              │             │              │                │
      ▼              ▼             ▼              ▼                ▼
     01             02            03             01               04

Data Cleaning → Statistical Analysis → First Draft → Revision → Typesetting
      │              │                    │            │            │
      ▼              ▼                    ▼            ▼            ▼
     04             05                   06           07           08

Replication → Submission → Peer Review Response → Defense
      │           │              │                   │
      ▼           ▼              ▼                   ▼
     09          10             10                  10

One-Stop Solutions

If you don’t want to pick Skills one by one, these solutions cover the full workflow:

Solution	Coverage	Highlights	Link
CoPaper.AI	Data Analysis → Paper Writing	20 built-in methodology Skills, multi-agent architecture, complete publication-quality empirical paper in 20 minutes	copaper.ai
StatsPAI Skill 🔥🔥	End-to-end automated empirical analysis	900+ functions, one `import statspai as sp`. A single agent instruction runs the full 6-step loop: EDA → pre-flight → research-question DSL → DAG discovery → estimation → robustness. Agent-native self-describing API, covers OLS/IV/DID (incl. Callaway-Sant’Anna, Sun-Abraham, HonestDID, continuous DID)/RDD/PSM/SCM/DML/Causal Forest/neural causal/text causal, publication-ready output (Word/Excel/LaTeX)	Local Skill · GitHub
Full Empirical Analysis Skill 📘	Traditional Python stack, explicit 8-step loop (StatsPAI’s philosophical counterpart)	No DSL wrapper — the agent drives `pandas` / `statsmodels` / `linearmodels` / `pyfixest` / `rdrobust` / `econml` / `causalml` / `matplotlib` directly. Covers data cleaning → variable construction → descriptive statistics → diagnostic tests → modeling → robustness → mechanism/heterogeneity/mediation/moderation → publication-ready tables & figures, one deep reference per step. Ideal for teaching, referee-level audit, and strict replication work that needs line-by-line control	Local Skill
Full Empirical Analysis Skill — Stata 📊	Traditional Stata `.do` 8-step loop (the Stata sibling of 00.1)	`reghdfe` + `ivreg2` + `csdid` + `did_imputation` + `eventstudyinteract` + `sdid` + `rdrobust` + `synth` + `psmatch2` + `teffects` + `ebalance` + `boottest` + `ritest` + `rwolf` + `bacondecomp` + `honestdid` + `coefplot` + `esttab` + `outreg2` + `asdoc` + `binscatter`. Same 8 steps, from `use` / `import` all the way to `.tex` / `.rtf` tables + `.pdf` figures. The first choice for referee-level Stata replication packs	Local Skill
Full Empirical Analysis Skill — R 📗	tidyverse + fixest 8-step loop + Quarto rendering (R sibling of 00.1 / 00.2)	`dplyr` + `haven` + `fixest` + `did` + `synthdid` + `bacondecomp` + `HonestDiD` + `rdrobust` + `tidysynth` + `gsynth` + `MatchIt` + `WeightIt` + `cobalt` + `ebal` + `grf` + `DoubleML` + `mediation` + `lavaan` + `marginaleffects` + `modelsummary` + `kableExtra` + `gt` + `flextable` + `ggplot2` + `iplot` + `binsreg` + `cowplot`. All 8 steps fit in a single `.qmd`; `quarto render` produces a unified PDF/HTML/Word reproducibility report.	Local Skill
Claude Scholar	Ideation → Submission	25+ Skills covering the full research lifecycle, Zotero MCP integration	GitHub
K-Dense Scientific Skills	Cross-disciplinary Science	140+ Skills, 28+ scientific databases, 55+ Python packages	GitHub
AI-Research-SKILLs	AI/ML Research	22 categories, 87 skills, full research cycle	GitHub
OpenClaw Medical Skills	Biomedical/Public Health	869 Skills, epidemiology, clinical research, drug safety, biostatistics	GitHub
Agent Laboratory	Fully Autonomous Research	Lit review → Experiments → Report, 84% reduction in research costs	GitHub

Comprehensive Skill Suites

These repositories contain multiple Skills and typically cover several research stages:

Academic Research

Suite	Stars	# Skills	Key Features	Social Science Fit
K-Dense-AI/claude-scientific-skills	8,799	140+	28+ scientific databases (OpenAlex, PubMed), scientific-writing + literature-review + statistical-analysis	⭐⭐⭐⭐
Orchestra-Research/AI-Research-SKILLs	3,637	87	22 categories, ML paper writing, LaTeX templates, citation verification	⭐⭐⭐
Imbad0202/academic-research-skills	~1,790	Multiple	Full paper pipeline (research → write → review → revise → finalize), style calibration, hallucination detection	⭐⭐⭐⭐
Galaxy-Dawn/claude-scholar	-	25+	Full research lifecycle: ideation → review → experiments → writing → peer review response, Zotero MCP	⭐⭐⭐⭐⭐
luwill/research-skills	209	3	Research proposal generation, medical review writing, paper-to-slides, bilingual	⭐⭐⭐⭐⭐
lishix520/academic-paper-skills	22	2	Strategist (7-dimension reviewer simulation) + Composer (systematic writing)	⭐⭐⭐⭐
Data-Wise/claude-plugins	-	17	Statistical research: arXiv search, DOI lookup, BibTeX management, methodology writing, referee response	⭐⭐⭐⭐⭐

Economics / Causal Inference

Suite	Key Features	Use Case
CoPaper.AI	20 methodology Skills (OLS, DID, staggered DID, IV, RDD, PSM, SCM, DML, causal forest, etc.), multi-agent architecture (Supervisor + 4 sub-agents), smart routing, automatic output	Full empirical economics workflow
StatsPAI Skill 🔥🔥	End-to-end automated empirical analysis. Agent-native econometrics Python package: 900+ functions, one `import statspai as sp` runs the full loop: EDA → research-question DSL → LLM-assisted DAG discovery → estimation → robustness. Self-describing API (`list_functions()` / `describe_function()` / `function_schema()`), unified `CausalResult` objects. Covers OLS, IV, panel data, DID (Callaway-Sant’Anna / Sun-Abraham / Bacon / HonestDID / continuous DID), RDD (Sharp/Fuzzy/multi-cutoff/Kink), PSM, SCM, SDID, DML, Causal Forest, Meta-Learners, TMLE, AIPW, neural causal models (TARNet/CFRNet/DragonNet), text causal (`sp.causal_text`), Heckman, structural estimation (BLP). Submitted to JOSS, MIT license	Whole-pipeline automation: one agent call goes from cleaned data to robust estimates
Full Empirical Analysis Skill 📘	Traditional Python econometrics stack, explicit 8-step closed loop (philosophical counterpart to StatsPAI: DSL one-shot vs. explicit line-by-line). No wrapper — drives `pandas` + `numpy` + `scipy` + `statsmodels` + `linearmodels` + `pyfixest` + `rdrobust` + `econml` + `causalml` + `matplotlib` + `seaborn` directly. Fine-grained 8 steps: (1) data cleaning (MCAR/MAR/MNAR, IQR/z/Mahalanobis, `validate=` safe merges, panel-structure checks) → (2) variable construction (log/IHS/Box–Cox, 1/99 winsorization, z/MinMax/Robust, interactions/lags/diffs, CPI deflation, staggered-DID timing) → (3) descriptives (stratified Table 1 with SMD+t-tests, starred correlation heatmap, 4-panel distributions, DID motivation plot, panel-coverage heatmap) → (4) diagnostics (12 classes: normality / heteroskedasticity / autocorrelation / collinearity / stationarity / cointegration / endogeneity / weak-IV / overid / Hausman / RESET / Cook’s D) → (5) modeling (OLS / panel FE-RE-FD / GLM / IV-2SLS-LIML-GMM / 5 DID variants / 4 RD variants / SC / PSM-IPW-EB / DML / CF / Heckman / QR — 12 classes) → (6) robustness (M1–M6 progressive specs, cluster sensitivity, wild bootstrap, placebo, spec curve, Oster δ, LOO, Rosenbaum) → (7) further analysis (heterogeneity × 4 / outcome-ladder mechanism / Baron–Kenny + Imai mediation / moderated mediation / dose-response / spillover) → (8) publication tables & figures (`stargazer` / `etable` / coefplot / event-study / binscatter / forest / RD plot / CATE heatmap / love plot, plus LaTeX/Word/Excel export). 610-line SKILL.md spine + 8 deep reference manuals (3000+ lines), progressively loaded*	Teaching, referee-level audit, graduate replication training, rigorous empirical projects requiring line-by-line control and full diagnostic coverage
Full Empirical Analysis Skill — Stata 📊	Traditional Stata `.do` 8-step closed loop (Stata sibling of 00.1, same structure, same cadence). One `ssc install` block installs 30+ packages. End-to-end community-standard chain: `reghdfe` / `ivreg2` / `ivreghdfe` / `csdid` / `did_imputation` / `eventstudyinteract` / `sdid` / `did_multiplegt_dyn` / `bacondecomp` / `honestdid` / `rdrobust` / `rddensity` / `synth` / `synth_runner` / `psmatch2` / `teffects` / `ebalance` / `ppmlhdfe` / `boottest` / `ritest` / `rwolf` / `psacalc` / `coefplot` / `esttab` / `outreg2` / `asdoc` / `binscatter`. 8 steps: (1) `use`+`import`+`destring`+`misstable`+`merge assert`+`xtset` → (2) `winsor2`+`xtile`+`L./F./D./S.`+CPI+staggered timing → (3) `tabstat`+`balancetable`+`asdoc`+`pwcorr sig star`+`heatplot` → (4) 12 estat-style diagnostics → (5) 12 estimator classes (`reghdfe` + 5 DID + 4 RD + `synth` + `teffects` + `ebalance` + `heckman` + `qreg` + `ppmlhdfe` + `sem/gsem`) → (6) `eststo`+`esttab` M1–M6 + `boottest` + `ritest` + `rwolf` + `bacondecomp` + `honestdid` + `psacalc delta` → (7) factor-var + `margins` + `marginsplot` + `suest` + DDD + `medsem` + `khb` + SEM + Stata-Python bridge to `econml` for CATE → (8) `esttab`+`outreg2`+`asdoc` to `.tex/.rtf/.docx/.xlsx`; `coefplot`+`marginsplot`+`binscatter`+`rdplot`+`graph combine` to `.pdf`. 801-line SKILL.md + 8 deep references (3500+ lines) + complete `.do` skeleton	Referee / co-author insists on Stata replication; graduate Stata courses; AER/QJE/JPE/ReStud-style standard `.do` pipelines
Full Empirical Analysis Skill — R 📗	Modern tidyverse + fixest + Quarto stack, explicit 8-step loop (R sibling of 00.1 / 00.2; the fourth and final piece of the family). One `install.packages(...)` block installs 50+ packages. End-to-end modern R standards: `dplyr` / `tidyr` / `haven` / `janitor` / `naniar` / `mice` / `validate` / `assertr` for data; `fixest::feols/feglm/fepois` for HD FE + multi-way clustering + IV in one line; `did::att_gt` / `fixest::sunab` / `didimputation::did_imputation` / `synthdid` / `DIDmultiplegtDYN` / `bacondecomp` / `HonestDiD` for modern DID; `rdrobust` / `rddensity` / `rdmulti` / `rdlocrand` for RD; `Synth` / `gsynth` / `tidysynth` / `synthdid` for SC; `MatchIt` / `WeightIt` / `cobalt` / `ebal` for matching; `grf::causal_forest` / `DoubleML` for ML causal; `mediation::mediate` + `medsens` / `lavaan::sem` for mediation; `marginaleffects` for post-estimation; `modelsummary` / `kableExtra` / `gt` / `flextable` for tables; `ggplot2` + `iplot` + `binsreg` + `cowplot` + `patchwork` for figures. 8-step R pipeline + Quarto template (one `.qmd` holding narrative + code + tables + figures, `quarto render` for PDF/HTML/Word in one go). 893-line SKILL.md + 8 deep references (3700+ lines), progressively loaded	Quarto reproducibility reports, academic blogs (distill / quarto blog), graduate R courses, projects needing `marginaleffects` + Imai sensitivity mediation + `grf` CATE post-estimation
AER-Skills 📕🔥	Top-5 economics submission skill stack (AER / AER:Insights / AEJ family), complementary to StatsPAI / 00.x “run the analysis” — specialised in “write the paper + submit + R&R”. Nine skills, full pipeline: `aer-topic-selection` (AER vs Insights vs AEJ routing) → `aer-identification` (identification audit: modern DiD / weak IV / boundary RDD pitfalls) → `aer-robustness` (referee-anticipating robustness matrix) → `aer-introduction` (Keith Head five-paragraph intro) → `aer-tables-figures` (AER booktabs typesetting) → `aer-replication` (AEA Data and Code Availability Policy package, openICPSR-ready) → `aer-submission` (preflight: 100-word abstract, disclosure, cover letter) → `aer-rebuttal` (R&R letters written against the revised manuscript) → `aer-workflow` (orchestrator). Identification-first — if your design is fragile, no prose will save it. Covers AER 100-word abstract / AER:Insights 7000-word limit / 45% desk-rejection / AEA mandatory replication — top-5-specific constraints that generic scientific-writing skills do not cover. `scripts/sync-aer-skills.sh` + weekly GH Actions loop syncs from upstream brycewang-stanford/AER-skills. License: MIT	Full AER / AER:Insights / AEJ submission flow: topic routing → identification audit → writing → typesetting → replication package → submission → R&R rebuttal
claesbackman/AI-research-feedback	2-agent economics paper pre-review: causal overclaiming detection, identification strategy assessment; supports AER/QJE/JPE/Econometrica/REStud; 6-agent grant review	Pre-submission self-review, grant applications
fuhaoda/stats-paper-writing-agent-skills	LaTeX statistical paper writing, front-end draft generation	Statistics & econometrics papers
dylantmoore/stata-skill	Full Stata coverage: syntax, data management, econometrics, causal inference, graphics, Mata, 20+ community packages	Stata users
SepineTam/stata-mcp	LLM operates Stata regression directly via MCP, “evolve from regression monkey to causal thinker”	Stata econometrics

🚨 Anti-AIGC Detection & De-AI Academic Writing (Highlighted)

This is one of the most critical pain points in academic writing in 2026. Papers failing AIGC detection can be rejected outright, and detectors like Turnitin, GPTZero, and China’s CNKI are getting stricter. The 4 skills below are the most authoritative and complete solutions on GitHub — all MIT open-source, and all locally archived in this repo (skills/44-47).

Suite	Key Features	Use Case	Local Path
CoPaper.AI / chinese-de-aigc 🇨🇳🔥	Original Chinese academic de-AIGC skill by CoPaper.AI team. Targets China’s CNKI AMLC / Wanfang / VIP / Turnitin Chinese detectors. 17-pattern library of Chinese-specific AI tells (4-char clichés, hollow connectives, explicit transitions, absolutist claims, sentence-length uniformity), 5-step closed loop workflow (locate→diagnose→rewrite→self-score→review), per-section strategy, 5-dim scoring rubric. Currently the only GitHub skill dedicated to Chinese academic de-AIGC	Chinese journal submissions, theses, grant proposals	`skills/48`
matsuikentaro1/humanizer_academic 🔥	Academic-specific. 23 AI writing patterns (6 content + 6 language + 3 style + 3 filler + 5 word choice), examples from EMPA-REG OUTCOME cardiovascular trials, preserves legitimate academic transitions, based on Wikipedia “Signs of AI writing”	Medical, life sciences, natural science papers	`skills/44`
stephenturner/skill-deslop	Scientific writing de-AI. Smartly distinguishes legitimate discipline conventions (passive voice in methods) from AI tells; 5-dimension scoring (directness/rhythm/trust/authenticity/density); 4 reference files (examples/phrases/structures/tropes)	Scientific papers, technical blogs	`skills/45`
hardikpandya/stop-slop	3-layer detection + 5-dim scoring. Banned phrases (throat-clearing openers, emphasis crutches, corporate jargon), structural clichés (binary contrasts, dramatic fragmentation, false agency), sentence-level rules (no em dash, no Wh- starters). Below 35/50 → revise	General prose, blogs, reports	`skills/46`
conorbronsdon/avoid-ai-writing	Structured audit + rewrite + second-pass audit. Four-section output: identified issues (with quotes) → rewrite → change summary → second audit. Compatible with Claude Code, OpenClaw, Hermes, and other agents	Workflows needing auditable, traceable revision	`skills/47`
ShiyanW/ai-revision-guard	Prevents over-refinement (different angle). Limits revision rounds (≤2 per section), 7-point homogenization checklist, cross-model verification. Protects author’s voice from AI erosion	Multi-round polishing scenarios	(community PR)

Recommended combos:

🇨🇳 Chinese academic papers (CNKI/Wanfang/VIP) → chinese-de-aigc (original) + revision-guard

🇬🇧 English academic papers → humanizer_academic + revision-guard (prevent over-refinement)

Bilingual papers → chinese-de-aigc + humanizer_academic combined

Need auditable workflow → avoid-ai-writing (structured reports)

General writing → stop-slop (5-dim scoring for quantified improvement)

Finance & Investment Research

Suite	Key Features	Use Case
anthropics/financial-services-plugins	Anthropic official: investment banking, equity research, private equity, wealth management	Financial services
OctagonAI/skills	Octagon agentic financial research Claude Skills	Institutional financial research
tradermonty/claude-trading-skills	Stock investing & trading: market analysis, technical charts, economic calendar, strategy development	Quantitative trading research
himself65/finance-skills	Agent Skills open standard, earnings analysis, consensus estimates, analyst sentiment	Financial analysis
quant-sentiment-ai/claude-equity-research	Institutional equity research: fundamental analysis, technical indicators, risk assessment	Equity research

Education & Public Health

Suite	Key Features	Use Case
GarethManning/claude-education-skills	Evidence-based education Claude Skills, designed for teachers and agent orchestration	Education research
FreedomIntelligence/OpenClaw-Medical-Skills	869 medical AI Skills: epidemiology, public health surveillance, clinical research, drug safety, biostatistics	Public health, medical research

Governance, Compliance & Law

Suite	Key Features	Use Case
Sushegaad/Claude-Skills-Governance-Risk-and-Compliance	GRC Skills: ISO 27001, SOC 2, GDPR, HIPAA compliance guidance (94% vs 72% baseline)	Compliance research, policy analysis
zubair-trabzada/ai-legal-claude	Legal assistant: contract review, risk analysis, NDA generation, compliance audit, 14 Skills + 5 agents	Law & economics, regulatory research
evolsb/claude-legal-skill	AI contract review: CUAD risk detection, market benchmarks, attorney-grade red-lining	Law & economics research

Marketing & Consumer Behavior

Suite	Key Features	Use Case
coreyhaines31/marketingskills	CRO, copywriting, SEO, analytics, and growth engineering	Marketing research
zubair-trabzada/ai-marketing-claude	15 Skills + parallel sub-agents: website audit, copy, email sequences, competitive intelligence	Consumer behavior analysis
ericosiu/ai-marketing-skills	Growth experiments, sales pipeline, content operations, SEO, financial automation	Marketing strategy research

Product Management & Organizational Behavior

Suite	Key Features	Use Case
phuryn/pm-skills	100+ agent Skills: discovery → strategy → execution → launch → growth, 65 PM Skills + 36 chained workflows	Product management, organizational research
mastepanoski/claude-skills	UX/UI evaluation (Nielsen heuristics, WCAG), AI governance (NIST AI RMF, ISO 42001)	UX research

General Agent Capabilities

Suite	Stars	Key Features
lyndonkl/claude	-	85 skills + 6 orchestration agents, incl. causal inference, Bayesian reasoning, experimental design, multi-criteria analysis
alirezarezvani/claude-skills	~5,200	220+ skills + 298 CLI scripts, incl. financial analysis and data processing
rohitg00/awesome-claude-code-toolkit	-	135 agents incl. data scientist agent (EDA, DID, RDD), 35 skills, 42 commands
jeremylongshore/claude-code-plugins-plus-skills	-	340 plugins + 1,367 agent skills, CCPI package manager
affaan-m/everything-claude-code	-	Skills, intuition, memory, security, research-first development framework
posit-dev/skills	-	Posit official: modern-r-tidyverse, predictive-modeling, quarto-authoring, shiny-bslib

Multi-Agent Collaboration Systems

A single Skill solves a point problem; multi-agent systems solve end-to-end workflows. These systems let multiple AI roles divide work, cross-review, and produce output quality far beyond what a single agent can achieve:

Paper Revision & Writing

System	Architecture	Key Features
copy-edit-master	3 sub-agents: structure-editor + line-editor + quality-reviewer	Auto document type detection, Strunk & White / McCloskey rules encoded, git checkpoints per phase, review loop (max 2 iterations)
introduction-writer	4 sub-agents: strategist → drafter → reviewer → reviser	Keith Head formula for drafting introductions, reviewer independent from drafter for quality loop
CoPaper.AI PaperAgent	Supervisor + 4 sub-agents (preparation / modeling / visualization / writing)	Skills routed by target_agent, each sub-agent sees only relevant methodology guidance, reduced context noise

Why multi-agent beats single agent? When the same agent writes and reviews, it tends to approve its own work. Role separation means the reviewer is independent from the drafter — forming a genuine quality loop. Same logic as academic peer review.

Data Analysis & Research

System	Source	Key Features
ruc-datalab/DeepAnalyze	Renmin Univ.	Autonomous data analysis agent, raw data → professional report, CSV/Excel/JSON/DB support, open-source DeepAnalyze-8B
business-science/ai-data-science-team	Business Science	Multi-agent data science team: EDA Agent + SQL Agent + MLflow Agent, LangChain integration
HungHsunHan/claude-code-data-science-team	Community	Claude Code multi-agent data science team, auto cleaning → modeling → executable Notebook
HKUDS/AI-Researcher	HKU (NeurIPS 2025 Spotlight)	Fully autonomous research pipeline: lit review → hypothesis → algorithm → paper
wanshuiyin/Auto-claude-code-research-in-sleep (ARIS)	Community	Overnight autonomous research, cross-model review loops (Claude + external LLM as critic)
SamuelSchmidgall/AgentLaboratory	Academic (ICLR)	End-to-end autonomous research: lit review → experiments → report, arXiv/HuggingFace/LaTeX integration, 84% cost reduction
SakanaAI/AI-Scientist-v2	Sakana AI	Fully automated scientific discovery: hypothesis → experiments → paper, first AI-generated paper accepted via peer review
assafelovic/gpt-researcher	Community	Autonomous deep research agent, supports any LLM provider
LitLLM/LitLLM	Academic	AI literature review assistant: keyword extraction + multi-strategy retrieval + re-ranking, RAG-based
pedrohcgs/claude-code-my-workflow	Emory Univ.	Academic LaTeX/Beamer + R template, multi-agent review + quality gates, adopted by 15+ research groups
hugosantanna/clo-author	Community	Extends Sant’Anna’s workflow from lecture production to full social science empirical research publication

Academic Data MCP Servers

System	Key Features
xingyulu23/Academix	Unified academic research interface aggregating OpenAlex + DBLP + Semantic Scholar + arXiv + CrossRef
Eclipse-Cj/paper-distill-mcp	11-source parallel search, 4-dimension weighted ranking (relevance/recency/impact/novelty)
oksure/openalex-research-mcp	OpenAlex API: search 240M+ academic works, citation analysis, trend tracking, collaboration networks
zongmin-yu/semantic-scholar-fastmcp-mcp-server	Full Semantic Scholar API access: papers, authors, citation networks
openags/paper-search-mcp	Search 20+ sources: arXiv, PubMed, bioRxiv, Google Scholar, SSRN, Unpaywall, etc.
aringadre76/mcp-for-research	Integrates PubMed + Google Scholar + ArXiv + JSTOR, published on NPM
blazickjp/arxiv-mcp-server	arXiv paper search and analysis MCP
lzinga/us-gov-open-data-mcp	40+ US government APIs (FRED/Census/CDC/FDA/FEC, etc.), 250+ tools
stefanoamorelli/fred-mcp-server	Direct access to FRED’s 800K+ economic time series
llnOrmll/world-bank-data-mcp	World Bank Data360, 1000+ socioeconomic indicators, 200+ countries
54yyyu/zotero-mcp	Connect Zotero library with AI assistants: paper review, summaries, citation analysis, PDF annotation
datagouv/datagouv-mcp	French national open data platform MCP

Skill Aggregation Platforms & Discovery Tools

Don’t know where to find Skills? These platforms are your starting point:

Platform	Scale	Features
VoltAgent/awesome-agent-skills	1,000+ skills	13,700 stars, curated by official team and community
sickn33/antigravity-awesome-skills	1,340+ skills	28,000 stars, one-click install `npx antigravity-awesome-skills`
VoltAgent/awesome-openclaw-skills	5,400+ skills	Curated from OpenClaw registry (ClawHub 13,729 Skills)
jeremylongshore/claude-code-plugins-plus-skills	1,367 skills	340 plugins + CCPI package manager
skills.sh	Online market	Searchable Skill marketplace
ClawHub (clawhub.com)	13,729 skills	Open-source AI skill marketplace, one-line install
Agent Skills Standard	Spec docs	Universal Agent Skills specification
Anthropic Official Skills	Official	PDF/DOCX/XLSX/PPTX document processing
Anthropic Official Plugin Market	Official	Anthropic-managed high-quality Claude Code plugin catalog
Anthropic Knowledge Work Plugins	Official	11 plugins incl. Data Plugin (SQL queries, data exploration)
Anthropic Financial Services Plugins	Official	Financial services plugins: IB, equity research, PE, wealth mgmt

Learning Resources

Official Documentation

Claude Code Skills Complete Guide — Anthropic’s official 32-page guide
Agent Skills Standard Specification
Claude Code Official Docs

Academic Talks & Courses

AI Agents for Economics Research — Aniket Panjwani, CEPR/VoxDev, 2026.03
Claude Code & Cowork for Academic Research — A Practical Guide — Practical guide for economists and social scientists, 2026.02
Building Claude Code Workflow for Economics Scholars — Building Claude Code workflows for economics researchers

Causal Inference Textbooks

Causal Inference for the Brave and True — Chinese translation, Python code
Statistical Tools for Causal Inference — Open-source textbook
Causal Inference and Machine Learning Book

Survey Papers & Awesome Lists

A Survey of Data Agents — Data agent survey (HKUST)
From AI for Science to Agentic Science — arXiv:2508.14111
From Automation to Autonomy — LLM scientific discovery survey (EMNLP 2025)
Awesome Agents for Science — Papers on LLMs and agents in scientific R&D
Awesome AI for Science — AI tools, papers, datasets for accelerating scientific discovery
Awesome AI for Economists — AI tools, libraries, and resources for economics (OpenEcon team)
Awesome Econ AI Stuff — AI Skills collection for economists, follows SKILL.md standard
AI for Grant Writing — Curated resources for LLM-assisted grant writing
Awesome AI Scientist Papers — AI scientist / robot scientist papers
FreedomIntelligence/OpenClaw-Medical-Skills — 869 medical AI Skills, covering epidemiology, public health, biostatistics

Community & References

Awesome Claude Skills — Community curated
Awesome Claude Skills (ComposioHQ) — Curated Claude Skills list
Awesome Claude Skills (BehiSecc) — Curated Claude Skills list
Awesome Claude Code — Skills, Hooks, slash commands, agent orchestrators
Reddit r/ClaudeCode
Anthropic Claude Code Skills Cookbook — Financial applications Skills tutorial

🛡️ Security Scan

We ran a systematic security audit across all 52 Skills / 2,940+ files in this repository — 52/52 CLEAN, zero FLAGGED. Every “sensitive” hit was verified as legitimate content. No malicious prompts, viruses, trojans, or reverse shells were found. Bottom line: every Skill in this repo is safe to use.

Skills Security Scan Overview

Six-phase defense-in-depth methodology:

Automated pattern scan — grep across 13 risk categories (pipe-to-shell, reverse shell, credential exfil, decode-and-run, mining/RAT signatures, prompt injection, etc.)
Hook & permission matrix audit — 100% manual review of all 6 hook-bearing Skills and their 40+ hook scripts. Permission allow-lists are restricted to research tooling — no Bash(*) wildcards anywhere.
Three parallel agent content audits — independent reviews of SKILL.md prose, agent definitions, and reference documentation for prompt injection, backdoors, hidden Unicode, suspicious package sources.
Supplemental integrity checks — hidden characters, encoding anomalies, ultra-long lines, HTML injection, network-related imports, high non-ASCII ratios.

Result distribution: every hit fell into one of three legitimate categories:

🛡️ Defensive security rules — deny rules, bash-safety hooks, credential detectors. 17-DAAF is the strongest “security-aware” reference in this batch (14 defensive hooks + 32 deny rules + active credential scanning).
📚 Legitimate academic API calls — arXiv / CrossRef / PubMed / Semantic Scholar / FRED / World Bank / OECD / BLS, etc.
🔁 Standard Claude Code workflow hooks — project scaffolding, state save/restore, context monitoring, session archive, pre-commit reminders — all local file operations, zero network IO.

Key insight: largest size ≠ highest risk. The Top 5 largest Skills (43-wentorai 478 files / 33-Galaxy-Dawn 327 files / 17-DAAF 319 files / 35-bahayonghang 264 files / 18-jusi-aalto 126 files) all passed full audit, with 17-DAAF actually setting the bar for security-conscious design.

The full report includes Phase 1-6 methodology, a per-Skill audit table for all 52 Skills, and 5 visual infographics: 📋 SECURITY-SCAN-REPORT.md

Contributing

Contributions welcome! Please read CONTRIBUTING.md to learn how to submit new Skill recommendations.

We especially welcome:

Skills for social science disciplines: economics, political science, sociology, psychology, education, public administration, public health
New Skill implementations for causal inference methods
Skills for business disciplines: finance, management, marketing, law & economics
General academic Skills: literature review, grant writing, research proposals
MCP servers (academic databases, government data APIs)
Chinese-friendly Skills
Multi-agent collaboration system case studies

Star History

If this list helps you, please give it a Star so more researchers can find it.

AI is an amplifier, not a replacement. It handles the heavy lifting; you keep the core judgment.

_{Stanford REAP × CoPaper.AI · An academic-industrial AI toolkit for empirical research}

Visit copaper.ai

WeChat: CoPaper.AI

20 built-in methodology Skills · 20-minute empirical paper · Powered by StatsPAI (900+ functions, MIT)

Maintained by CoPaper.AI, incubated at Stanford REAP / SCCEI | AI Assistant for Empirical Research