@tom_doerr: Automates empirical research with 23,000 AI agent skills https://github.com/brycewang-stanford/Auto-Empirical-Research-…

X AI KOLs Timeline Tools

Summary

Auto-Empirical-Research-Skills (AERS) is an open-source toolkit that automates the entire empirical research pipeline using 23,000+ AI agent skills, from data cleaning to submission-ready drafts.

Automates empirical research with 23,000 AI agent skills https://github.com/brycewang-stanford/Auto-Empirical-Research-Skills…
Original Article
View Cached Full Text

Cached at: 06/02/26, 09:38 PM

Automates empirical research with 23,000 AI agent skills https://github.com/brycewang-stanford/Auto-Empirical-Research-Skills…


brycewang-stanford/Auto-Empirical-Research-Skills

Source: https://github.com/brycewang-stanford/Auto-Empirical-Research-Skills

Auto-Empirical Research Skills (AERS, 23K+ Skills)

Repository renamed → now “Auto”. This project started life as Awesome Agent Skills for Empirical Research and has been renamed to Auto-Empirical-Research-Skills (AERS). The new name reflects the core idea: not just a collection of skills, but an agent that automatically runs the full empirical-research pipeline end to end — from raw data cleaning → identification & estimation → robustness checks → tables, figures, and a submission-ready draft — with minimal human hand-holding.

GitHub automatically redirects the old URL, but please update your bookmarks and local remote:

git remote set-url origin https://github.com/brycewang-stanford/Auto-Empirical-Research-Skills.git

🌐 Language / 语言: English | 中文


Auto-Empirical Research Skills cover
CoPaper.AI Stanford REAP - Center on China's Economy & Institutions

Stanford REAP × CoPaper.AI · An academic-industrial AI toolkit for empirical research
Crafted by Stanford’s empirical methodology team — covering the full pipeline from data cleaning to top-journal submission


Awesome GitHub stars License: CC BY-SA 4.0 PRs Welcome Maintained by CoPaper.AI from Stanford REAP Powered by StatsPAI Security Scanned Files Audited Audit Phases Hooks Audited Zero Threats Validate catalog OpenSSF Scorecard

The Definitive Collection of AI Agent Skills for Empirical Research — 119 GitHub Repos / 23,000+ Skills

A curated, opinionated list of 119 GitHub repositories and 23,000+ AI Agent Skills for empirical research in economics, political science, sociology, psychology, public health, education, management, finance, and public policy — organized by research workflow, from topic selection to journal submission.

In 2026, the way we do empirical research is being redefined.

CoPaper.AIan empirical research AI assistant incubated by researchers at Stanford REAP / SCCEI (Stanford Center on China’s Economy and Institutions) — can complete a publication-quality empirical paper in 20 minutes: from data import, descriptive statistics, causal inference models, and robustness checks to formatted result tables, all in one go. The secret isn’t a more powerful model — it’s Skills: encoding senior researchers’ methodological expertise into structured workflows, so the AI knows “what a complete DID analysis should include” instead of waiting for you to remind it step by step.

This repository is the Agent Skills landscape we compiled while building CoPaper.AI. We organized hundreds of Skills repos and tens of thousands of Skills scattered across GitHub, communities, and academia by research workflow stages, so you can pick what you need.

🎓 Three Layers of Trust · Why It’s Us Building This

LayerAnchorLever
🏛️ Academic lineageStanford REAP / SCCEI — Stanford Center on China’s Economy and InstitutionsA research center with a sustained publication record in empirical economics methodology and a deep tradition in applied causal inference
🔧 Engineering deliveryCoPaper.AI empirical research AI assistantShips with 20 econometric methodology Skills (DID/IV/RDD/PSM/DML, etc.), Supervisor + 4 sub-agent multi-agent architecture, one-sentence triggers, automatic result output
⚙️ Open-source engineStatsPAIthe causal-inference engine that powers CoPaper.AI900+ functions · one import statspai as sp · JOSS in submission · MIT-licensed. Every DID/IV/RD/SCM estimate CoPaper.AI produces is driven by StatsPAI; this Skills collection is itself part of the StatsPAI ecosystem

🔒 Use with confidence: every one of the 52 Skills / 2,940+ files in this repo passed our systematic security audit52/52 CLEAN, zero FLAGGED, zero exfiltration, zero reverse shells, zero prompt injection.

💡 Want it out of the box? Skip the Skills assembly — try → copaper.ai and let the Stanford methodology team run the empirical pipeline end-to-end for you.


Start Here

make catalog
make validate

Pick a Workflow in 30 Seconds

GoalStart with
Run a complete empirical pipelineStatsPAI_skill
Audit a top-5 economics identification strategyaer-identification
Prepare AER / AEJ submissionaer-workflow
Build a replication packageaer-replication
Lower Chinese academic AI-writing signalchinese-de-aigc

🆕 Changelog

2026-05-25: 📕 AER-skills vendored — Top-5 economics (AER / AER:Insights / AEJ) submission stack (9 skills, skills/50) + weekly auto-sync workflow
  • 📕 skills/50-brycewang-aer-skills: This repo’s sister project brycewang-stanford/AER-skills vendored in whole, with the same StatsPAI-style sync loop (scripts/sync-aer-skills.sh + .github/workflows/sync-aer-skills.yml — Monday 06:00 UTC weekly diff, PR on drift). Positioning: a top-5 economics submission skill stack (AER / AER:Insights / AEJ family), extending the StatsPAI / 00.x “analysis” line to the “manuscript + submission” line.
    • 🧱 Nine skills covering the full submission pipeline: aer-topic-selection (AER vs Insights vs AEJ routing) → aer-identification (identification audit: modern DiD / weak IV / boundary RDD pitfalls) → aer-robustness (referee-anticipating robustness matrix) → aer-introduction (Keith Head five-paragraph intro) → aer-tables-figures (AER booktabs typesetting) → aer-replication (AEA Data and Code Availability Policy package, openICPSR-ready) → aer-submission (preflight: 100-word abstract, disclosure, cover letter) → aer-rebuttal (R&R letters written against the revised manuscript, not the old draft) → aer-workflow (orchestrator that tells you which skill to use next).
    • 🆚 Differentiation from existing skills: StatsPAI / 00.x solve “how to run the analysis correctly”; AER-skills solves “how to write the paper to top-5 acceptance threshold” — the AER 100-word abstract / AER:Insights 7000-word limit / 45% desk-rejection rate / AEA mandatory replication are top-5-specific constraints that generic scientific-writing skills (Nature-Paper-Skills etc.) do not cover. Identification-first: if your design is fragile, no prose will save it.
    • 🔁 Vendor-sync loop: git clone --depth=1 upstream → rsync -a --delete --exclude='.git' mirror the whole tree → diff content hashes before/after, exit 0 on no drift, exit 1 on drift to trigger peter-evans/create-pull-request@v6 on chore/sync-aer-skills branch. Supports manual workflow_dispatch for on-demand sync.
    • License: MIT — consistent with StatsPAI / 00.x; commercial and academic use both allowed.
    • First upstream commit: 7e9c44d (2026-05-25, includes modern-aer-exemplars.md with 30+ subfield-organized papers).
2026-04-28: 🛡️ Repository-wide security scan complete — all 52 Skills CLEAN, zero FLAGGED
  • 🛡️ SECURITY-SCAN-REPORT.md: We ran a systematic security audit across all 52 Skills / 2,940+ files in this repository. No malicious prompts, viruses, trojans, reverse shells, or other malicious content were found. Bottom line: every Skill in this repo is safe to use.
    • 🔍 Six-phase defense-in-depth methodology: (1) automated grep across 13 risk categories (pipe-to-shell, reverse shell, credential exfil, decode-and-run, mining/RAT signatures, prompt injection, etc.) → (2) 100% manual review of all 6 hook-bearing Skills and their 40+ hook scripts → (3) three parallel agents auditing SKILL.md prose, agent definitions, and reference docs separately → (4) supplemental integrity checks (hidden Unicode, encoding anomalies, ultra-long lines, HTML injection, network-related imports).
    • 📊 Result distribution: every “sensitive” hit verified as one of three legitimate categories — defensive security rules (deny rules, bash-safety hooks, credential detectors), legitimate academic API calls (arXiv / CrossRef / PubMed / FRED / World Bank / OECD / BLS), or standard Claude Code workflow hooks (scaffolding / state save / context monitor — all local file operations, zero network IO).
    • 🔑 Key insight: 17-DAAF is actually the strongest “security-aware” reference template in this batch (14 defensive hooks + 32 deny rules + active credential scanning). Largest size ≠ highest risk.
    • 📈 Visual infographics: 5 zhihu-style information graphics embedded in the report (overview / methodology / threat matrix / Top 5 size distribution / supplemental scan) — readable in 3 seconds.
    • See the full security scan report for details.
2026-04-24: 📗 Full Empirical Analysis Skill (R) shipped — tidyverse + fixest, 8-step Quarto-friendly loop (skills/00.3)
  • 📗 Full Empirical Analysis Skill — R: Same-day fourth member of the family, vendored at skills/00.3-Full-empirical-analysis-skill_R/slot #0.3, the R / Quarto edition.
    • 🧱 Modern tidyverse + fixest stack: dplyr / tidyr / haven for data; fixest::feols/feglm/fepois as the panel/IV/DID workhorse (one line for HD FE + multi-way cluster + IV); did::att_gt + fixest::sunab + didimputation::did_imputation + synthdid + DIDmultiplegtDYN + bacondecomp + HonestDiD for modern DID; rdrobust / rddensity / rdmulti / rdlocrand for RD; Synth / gsynth / tidysynth / synthdid for synthetic control; MatchIt / WeightIt / cobalt / ebal for matching; grf::causal_forest + DoubleML for ML causal; mediation::mediate + lavaan::sem for mediation; marginaleffects::avg_slopes / plot_slopes for post-estimation; modelsummary / kableExtra / gt / flextable for publication tables; ggplot2 + iplot + binsreg + cowplot + patchwork for figures; Quarto to render PDF/HTML/Word in one command.
    • 🔁 8-step R closed loop (mirrors 00.1 / 00.2): (1) Import & cleaning (read_dta + clean_names + naniar::vis_miss + mice + validate / assertr) → (2) Variable construction (mutate + across + DescTools::Winsorize + scale + arrange %>% group_by %>% lag/lead) → (3) Descriptives (gtsummary::tbl_summary + modelsummary::datasummary_balance + psych::corr.test + corrplot / ggcorrplot) → (4) Diagnostics (12 classes: shapiro.test / tseries::jarque.bera.test / lmtest::bptest / dwtest / bgtest / car::vif / tseries::adf.test / kpss.test / plm::pbgtest / pcdtest / phtest / lmtest::resettest) → (5) Estimation (12 classes: feols + AER::ivreg + did::att_gt + fixest::sunab + didimputation + synthdid + rdrobust + tidysynth + gsynth + MatchIt + WeightIt + ebal + grf::causal_forest + DoubleML + sampleSelection::heckit + quantreg::rq + lavaan::sem) → (6) Robustness (modelsummary for M1–M6 + clubSandwich + fwildclusterboot::boottest + ri2::conduct_ri + bacondecomp::bacon + HonestDiD::createSensitivityResults + robomit::o_test/o_beta) → (7) Further analysis (formula interactions + marginaleffects::plot_slopes + mediation::mediate + medsens + lavaan::sem multi-group + grf::causal_forest CATE + splines::ns dose-response) → (8) Publication output (modelsummary to LaTeX/Word/HTML/Markdown in one call + fixest::iplot + marginaleffects::plot_slopes/predictions + cowplot::plot_grid + patchwork + Quarto rendering).
    • 📚 Progressive disclosure + Quarto-native: SKILL.md 893-line spine (with full install.packages list, project skeleton, Quarto YAML template); 8 references/NN-*.md totalling 3700+ lines. The Quarto template makes “narrative + code + tables + figures” render to a single self-contained report from a single .qmd source.
    • 🆚 Four-skill positioning: StatsPAI = Python one-shot DSL; 00.1 = explicit Python; 00.2 = explicit Stata; 00.3 = R + tidyverse + Quarto. Four parallel implementations of the same 8 steps, none replacing the others. The Quarto-rendered reproducibility report is unique to 00.3.
    • Use cases: Quarto-rendered replication reports, academic blogs (distill / quarto blog), graduate R courses, rigorous projects needing marginaleffects + mediation + grf post-estimation, anything R-flavoured outside of pure Bayesian work.
2026-04-24: 📊 Full Empirical Analysis Skill (Stata) shipped — traditional Stata ecosystem, 8-step .do loop (skills/00.2)
  • 📊 Full Empirical Analysis Skill — Stata: Same-day Stata sibling of StatsPAI / 00.1, vendored at skills/00.2-Full-empirical-analysis-skill_Stata/slot #0.2, for Stata users.
    • 🧱 Traditional Stata ecosystem, de-facto standard command chain: every step calls community-standard commands reghdfe / ivreg2 / ivreghdfe / csdid / did_imputation / eventstudyinteract / sdid / did_multiplegt_dyn / bacondecomp / honestdid / rdrobust / rddensity / synth / synth_runner / psmatch2 / teffects / ebalance / ppmlhdfe / boottest / ritest / rwolf / psacalc / coefplot / esttab / outreg2 / asdoc / binscatterreferee-level Stata replication packs, one ssc install block installs 30+ packages.
    • 🔁 8-step .do loop (same structure as 00.1, Stata-native rewrite): (1) Import & cleaning (use/import excel/import sas/destring/misstable/mdesc/duplicates report/merge m:1 ... assert(match using)/xtset/xtdescribe/mi impute chained) → (2) Variable construction (winsor2 by(industry year)/egen std/xtile/xtset + L./F./D./S./CPI deflation/first_treat+rel_time+gvar) → (3) Descriptives (tabstat/balancetable/asdoc sum/pwcorr, sig star(.05)/heatplot/twoway kdensity/xtdescribe) → (4) Diagnostics (12 classes: swilk/sktest/estat hettest/estat imtest, white/xtserial/xttest3/xtcsd, pesaran/estat vif/dfuller/kpss/xtunitroot ips/llc/hausman fe re/estat ovtest/linktest) → (5) Estimation (12 classes: reghdfe+areg+xtreg, fe/re/ivreg2+ivreghdfe+ivregress liml/gmm/csdid+eventstudyinteract+did_imputation+sdid+did_multiplegt_dyn/rdrobust+rdmc+rddensity/synth+synth_runner/psmatch2+teffects psmatch/ipwra/aipw+ebalance+cem/heckman+heckprob/qreg+sqreg/ppmlhdfe/sem+gsem) → (6) Robustness (eststo+esttab M1–M6, multi-cluster, boottest, ritest, rwolf, bacondecomp, honestdid, psacalc delta) → (7) Further analysis (factor-var interactions+margins+marginsplot/suest cross-eq Wald/DDD/outcome ladder coefplot/medsem+khb+sem estat teffects/dose-response via xtile or bspline/Stata-Python bridge to econml for CATE/spillover) → (8) Publication output (esttab+outreg2+asdoc to .tex/.rtf/.docx/.xlsx; coefplot+marginsplot+binscatter+rdplot+graph combine to .pdf).
    • 📚 Progressive disclosure: SKILL.md 801-line spine (full ssc install list + complete .do skeleton + library cheat-sheet); 8 references/NN-*.md totalling 3500+ lines, loaded on demand.
    • 🆚 Triple positioning (now extended to 4 with 00.3): StatsPAI = Python DSL one-shot; 00.1 = explicit Python; 00.2 = explicit Statathe only choice when a referee or co-author insists on Stata replication.
    • Use cases: referee-level Stata replication packs, graduate Stata courses, AER/QJE/JPE/ReStud-style standard .do pipelines, rigorous research needing the full modern DID toolkit (bacondecomp + honestdid + psacalc).
2026-04-24: 📘 Full Empirical Analysis Skill shipped — traditional Python econometric stack, explicit 8-step loop (skills/00.1)
  • 📘 Full Empirical Analysis Skill: Same-day sibling to StatsPAI, vendored at skills/00.1-Full-empirical-analysis-skill_Python/slot #0.1, the explicit / auditable counterpart.
    • 🧱 Traditional Python econometrics stack, no DSL wrapper: every step directly calls pandas / numpy / scipy / statsmodels / linearmodels / pyfixest / rdrobust / econml / causalml / matplotlib / seaborn — every line of agent-written code is inspectable and swappable.
    • 🔁 8-step closed loop (finer granularity than StatsPAI’s 6 steps): (1) Data cleaning (MCAR/MAR/MNAR handling, IQR/z/Mahalanobis outliers, validate= on every merge, panel-structure checks) → (2) Variable construction (log/IHS/Box–Cox, 1/99 winsorization, z/MinMax/Robust scaling, interactions/lags/diffs, CPI deflation, staggered-DID timing vars) → (3) Descriptive statistics (stratified Table 1 with SMDs+t-tests, starred correlation heatmap, 4-panel distribution figure, DID motivation plot, panel-coverage heatmap) → (4) Diagnostic tests (12 classes: normality / heteroskedasticity / autocorrelation / multicollinearity / stationarity / cointegration / endogeneity / weak-IV / overid / panel Hausman / RESET / Cook’s D) → (5) Baseline modeling (12 classes of estimators: OLS / panel FE-RE-FD / GLM / IV-2SLS-LIML-GMM / DID×5-2×2/TWFE/event-study/CS/SA/BJS/SDiD / RD-Sharp/Fuzzy/Kink/multi-cutoff / SC / PSM-IPW-EB / DML / Causal Forest / Heckman / Quantile) → (6) Robustness battery (M1–M6 progressive specs, cluster-level sensitivity, wild cluster bootstrap, placebo timing+permutation, specification curve, Oster δ*, LOO, Rosenbaum) → (7) Further analysis (heterogeneity × 4, outcome-ladder mechanism, Baron–Kenny + Imai mediation, moderated mediation, dose-response, spillover) → (8) Publication tables & figures (stargazer / pyfixest.etable / coefplot / event-study / binscatter / forest / RD plot / CATE heatmap / love plot, full LaTeX/Word/Excel export).
    • 📚 Progressive-disclosure architecture: SKILL.md holds only the one canonical call per step (610 lines of spine); variants are offloaded to 8 references/NN-*.md deep manuals (3000+ lines total), loaded by agents only when needed.
    • 🆚 Relationship to StatsPAI: StatsPAI = agent-native one-shot DSL (one sp.causal(...) runs everything); this skill = explicit traditional stack (every line swappable, every diagnostic by hand). They coexist and complement — reach for StatsPAI when you trust the DSL; reach for this skill when teaching, auditing, or requiring full control.
    • Use cases: replicating applied-economics papers, referee-level line-by-line audit, graduate teaching, any project that insists on hanging every diagnostic and robustness check into the explicit pipeline.
2026-04-24: 🔥 StatsPAI Skill officially shipped — end-to-end automated empirical analysis (skills/00)
  • 🔥🔥 StatsPAI Skill: Our agent-native, one-stop empirical-analysis Skill is now officially vendored in this repo at skills/00-Full-empirical-analysis-skill_StatsPAI/slot #0, the repository’s flagship.
    • 🚀 End-to-end automation for the entire empirical pipeline: data cleaning (pandas pre-step) → EDA & descriptives (sp.sumstats / sp.balance_table) → pre-flight diagnostics (sp.diagnose / sp.balance_panel / overlap / missingness) → research-question DSL (sp.causal_question(...).identify()) → LLM-assisted DAG discovery (sp.llm_dag_propose / validate / constrained) → one-call estimation (sp.causal(...)) → robustness (sp.spec_curve / sp.honest_did / sp.evalue). 6-step closed loop, no tool switching — the agent runs the whole thing from a single instruction.
    • 900+ functions, one import statspai as sp: more than doubled from the 390+ version on 2026-04-12. Covers OLS, IV, panel, DID (Callaway-Sant’Anna / Sun-Abraham / Bacon / HonestDID / continuous DID), RDD (Sharp / Fuzzy / multi-cutoff / Kink), PSM, SCM, SDID, DML, Causal Forest, Meta-Learners, TMLE, AIPW, neural causal models (TARNet / CFRNet / DragonNet), text causal (sp.causal_text), Heckman, structural estimation (BLP).
    • Agent-native self-describing API: sp.list_functions() / sp.describe_function() / sp.function_schema() — agents discover and understand functions without doc lookup. Every estimator returns a unified CausalResult with .summary() / .plot() / .to_latex() / .to_word() / .to_excel() / .cite() and a structured .diagnostics dict — purpose-built for LLM-driven workflows.
    • Estimand-first decisions: sp.causal_question makes the “DID vs RD vs IV?” choice explicit and defensible — no more guesswork.
    • Submitted to JOSS, MIT-licensed. → PyPI | → GitHub | → Local Skill
  • 🔁 Weekly upstream sync: new GitHub Action auto-pulls the latest SKILL.md / README.md from the StatsPAI main repo into skills/00-Full-empirical-analysis-skill_StatsPAI/ every week — users always get the latest version.
  • Corrected several sp.* signatures in Skill code examples; Step 0–6 code blocks are now explicitly flagged as illustrative (so agents don’t copy them verbatim).
2026-04-13: 🇨🇳 Original Chinese De-AIGC Skill Launched (skills/48)
  • 🇨🇳🔥 chinese-de-aigc: CoPaper.AI team’s original Chinese academic de-AIGC skill. Currently the only humanizer on GitHub dedicated to Chinese academic empirical papers and targeting China’s CNKI AMLC / Wanfang / VIP / Turnitin Chinese detectors.
    • 17-pattern library of Chinese AI tells (4-character clichés / hollow connectives / explicit transitions / absolutist claims / total-part-total symmetry / sentence-length uniformity)
    • 5-step closed-loop workflow: Locate → Diagnose → Differential Rewrite → 5-Dim Self-Score → Second-Pass Review
    • Per-section strategy: Abstract / Introduction / Literature Review / Methods / Results / Discussion / Conclusion each has different rewrite intensity
    • 5-dimension scoring rubric: Concreteness / Rhythm / Caution / Implicit Cohesion / Researcher Voice (weighted max 50)
    • 12 before/after case comparisons covering 7 main chapters of empirical papers
    • Architecture inspired by English humanizers (humanizer_academic / skill-deslop / stop-slop / avoid-ai-writing), but fully re-designed for Chinese language context
2026-04-12: Added StatsPAI Agent-Native Econometrics Package + Anti-AIGC Detection Skills
  • 🔥 StatsPAI: Our own agent-native causal inference & econometrics Python package. 390+ functions, one import, self-describing API (list_functions() / describe_function() / function_schema()). Covers OLS, IV, DID (Callaway-Sant’Anna / Sun-Abraham / Bacon / HonestDID / continuous DID), RDD, PSM, SCM, DML, Causal Forest, Meta-Learners, TMLE, neural causal models (TARNet/CFRNet/DragonNet), and more. Published in JOSS, MIT license. → PyPI | → GitHub
  • 📝 Anti-AIGC Detection Skills (4 new, → dedicated section):
    • humanizer_academic — Academic paper specialist, 23 AI writing pattern detectors (skills/44)
    • skill-deslop — Scientific writing de-AI, respects discipline conventions (skills/45)
    • stop-slop — 3-layer detection + 5-dimension scoring (skills/46)
    • avoid-ai-writing — Structured audit + rewrite + second-pass audit (skills/47)
  • 🛡️ revision-guard: Prevents AI over-refinement, limits revision rounds + 7-point homogenization checklist (community PR contribution)
2026-04-11: Expanded from 43 collections to 119 repos, covering 23,000+ Skills
  • Added 76 GitHub repositories across 8 social science disciplines (economics, political science, sociology, psychology, education, public health, management, finance)
  • Added skill suites for finance, law, marketing, product management, education, public health
  • Added 13 academic data MCP servers (OpenAlex, Semantic Scholar, FRED, World Bank, etc.)
  • Added 11 multi-agent collaboration systems (Agent Laboratory, AI-Scientist-v2, etc.)
  • Added bilingual Chinese/English README

Table of Contents


What Can This List Do for You?

If you do empirical research, you’ve probably experienced these scenarios:

  • You ask AI to run a DID, and it gives you the baseline regression and stops. You say “parallel trends?” — it adds one. “Placebo test?” — another one. “Event study plot?” — yet another. Every time, it’s like squeezing toothpaste.
  • You finally finish a draft, but citations are a mess, with a few hallucinated references mixed in.
  • You want to replicate an identification strategy from a top journal, but the gap between understanding it and implementing it feels like a mountain.

The problem isn’t that AI can’t do it — it doesn’t know what a complete workflow should include.

A Skill solves this: it’s a methodological playbook for AI. With a Skill, AI knows “running DID means first testing parallel trends, then baseline regression, then 4 robustness checks, then heterogeneity analysis, then mechanism analysis, with specific output formats at each step.” You just say “run a DID analysis” and it follows the complete workflow.

This list helps you find the best Skills for every stage of the empirical research workflow.


Quick Lookup by Research Stage

Not sure which Skill to use? Start from your current research stage:

Topic Ideation → Lit Search → Deep Reading → Research Design → Data Collection
      │              │             │              │                │
      ▼              ▼             ▼              ▼                ▼
     01             02            03             01               04

Data Cleaning → Statistical Analysis → First Draft → Revision → Typesetting
      │              │                    │            │            │
      ▼              ▼                    ▼            ▼            ▼
     04             05                   06           07           08

Replication → Submission → Peer Review Response → Defense
      │           │              │                   │
      ▼           ▼              ▼                   ▼
     09          10             10                  10

One-Stop Solutions

If you don’t want to pick Skills one by one, these solutions cover the full workflow:

SolutionCoverageHighlightsLink
CoPaper.AIData Analysis → Paper Writing20 built-in methodology Skills, multi-agent architecture, complete publication-quality empirical paper in 20 minutescopaper.ai
StatsPAI Skill 🔥🔥End-to-end automated empirical analysis900+ functions, one import statspai as sp. A single agent instruction runs the full 6-step loop: EDA → pre-flight → research-question DSL → DAG discovery → estimation → robustness. Agent-native self-describing API, covers OLS/IV/DID (incl. Callaway-Sant’Anna, Sun-Abraham, HonestDID, continuous DID)/RDD/PSM/SCM/DML/Causal Forest/neural causal/text causal, publication-ready output (Word/Excel/LaTeX)Local Skill · GitHub
Full Empirical Analysis Skill 📘Traditional Python stack, explicit 8-step loop (StatsPAI’s philosophical counterpart)No DSL wrapper — the agent drives pandas / statsmodels / linearmodels / pyfixest / rdrobust / econml / causalml / matplotlib directly. Covers data cleaning → variable construction → descriptive statistics → diagnostic tests → modeling → robustness → mechanism/heterogeneity/mediation/moderation → publication-ready tables & figures, one deep reference per step. Ideal for teaching, referee-level audit, and strict replication work that needs line-by-line controlLocal Skill
Full Empirical Analysis Skill — Stata 📊Traditional Stata .do 8-step loop (the Stata sibling of 00.1)reghdfe + ivreg2 + csdid + did_imputation + eventstudyinteract + sdid + rdrobust + synth + psmatch2 + teffects + ebalance + boottest + ritest + rwolf + bacondecomp + honestdid + coefplot + esttab + outreg2 + asdoc + binscatter. Same 8 steps, from use / import all the way to .tex / .rtf tables + .pdf figures. The first choice for referee-level Stata replication packsLocal Skill
Full Empirical Analysis Skill — R 📗tidyverse + fixest 8-step loop + Quarto rendering (R sibling of 00.1 / 00.2)dplyr + haven + fixest + did + synthdid + bacondecomp + HonestDiD + rdrobust + tidysynth + gsynth + MatchIt + WeightIt + cobalt + ebal + grf + DoubleML + mediation + lavaan + marginaleffects + modelsummary + kableExtra + gt + flextable + ggplot2 + iplot + binsreg + cowplot. All 8 steps fit in a single .qmd; quarto render produces a unified PDF/HTML/Word reproducibility report.Local Skill
Claude ScholarIdeation → Submission25+ Skills covering the full research lifecycle, Zotero MCP integrationGitHub
K-Dense Scientific SkillsCross-disciplinary Science140+ Skills, 28+ scientific databases, 55+ Python packagesGitHub
AI-Research-SKILLsAI/ML Research22 categories, 87 skills, full research cycleGitHub
OpenClaw Medical SkillsBiomedical/Public Health869 Skills, epidemiology, clinical research, drug safety, biostatisticsGitHub
Agent LaboratoryFully Autonomous ResearchLit review → Experiments → Report, 84% reduction in research costsGitHub

Comprehensive Skill Suites

These repositories contain multiple Skills and typically cover several research stages:

Academic Research

SuiteStars# SkillsKey FeaturesSocial Science Fit
K-Dense-AI/claude-scientific-skills8,799140+28+ scientific databases (OpenAlex, PubMed), scientific-writing + literature-review + statistical-analysis⭐⭐⭐⭐
Orchestra-Research/AI-Research-SKILLs3,6378722 categories, ML paper writing, LaTeX templates, citation verification⭐⭐⭐
Imbad0202/academic-research-skills~1,790MultipleFull paper pipeline (research → write → review → revise → finalize), style calibration, hallucination detection⭐⭐⭐⭐
Galaxy-Dawn/claude-scholar-25+Full research lifecycle: ideation → review → experiments → writing → peer review response, Zotero MCP⭐⭐⭐⭐⭐
luwill/research-skills2093Research proposal generation, medical review writing, paper-to-slides, bilingual⭐⭐⭐⭐⭐
lishix520/academic-paper-skills222Strategist (7-dimension reviewer simulation) + Composer (systematic writing)⭐⭐⭐⭐
Data-Wise/claude-plugins-17Statistical research: arXiv search, DOI lookup, BibTeX management, methodology writing, referee response⭐⭐⭐⭐⭐

Economics / Causal Inference

SuiteKey FeaturesUse Case
CoPaper.AI20 methodology Skills (OLS, DID, staggered DID, IV, RDD, PSM, SCM, DML, causal forest, etc.), multi-agent architecture (Supervisor + 4 sub-agents), smart routing, automatic outputFull empirical economics workflow
StatsPAI Skill 🔥🔥End-to-end automated empirical analysis. Agent-native econometrics Python package: 900+ functions, one import statspai as sp runs the full loop: EDA → research-question DSL → LLM-assisted DAG discovery → estimation → robustness. Self-describing API (list_functions() / describe_function() / function_schema()), unified CausalResult objects. Covers OLS, IV, panel data, DID (Callaway-Sant’Anna / Sun-Abraham / Bacon / HonestDID / continuous DID), RDD (Sharp/Fuzzy/multi-cutoff/Kink), PSM, SCM, SDID, DML, Causal Forest, Meta-Learners, TMLE, AIPW, neural causal models (TARNet/CFRNet/DragonNet), text causal (sp.causal_text), Heckman, structural estimation (BLP). Submitted to JOSS, MIT licenseWhole-pipeline automation: one agent call goes from cleaned data to robust estimates
Full Empirical Analysis Skill 📘Traditional Python econometrics stack, explicit 8-step closed loop (philosophical counterpart to StatsPAI: DSL one-shot vs. explicit line-by-line). No wrapper — drives pandas + numpy + scipy + statsmodels + linearmodels + pyfixest + rdrobust + econml + causalml + matplotlib + seaborn directly. Fine-grained 8 steps: (1) data cleaning (MCAR/MAR/MNAR, IQR/z/Mahalanobis, validate= safe merges, panel-structure checks) → (2) variable construction (log/IHS/Box–Cox, 1/99 winsorization, z/MinMax/Robust, interactions/lags/diffs, CPI deflation, staggered-DID timing) → (3) descriptives (stratified Table 1 with SMD+t-tests, starred correlation heatmap, 4-panel distributions, DID motivation plot, panel-coverage heatmap) → (4) diagnostics (12 classes: normality / heteroskedasticity / autocorrelation / collinearity / stationarity / cointegration / endogeneity / weak-IV / overid / Hausman / RESET / Cook’s D) → (5) modeling (OLS / panel FE-RE-FD / GLM / IV-2SLS-LIML-GMM / 5 DID variants / 4 RD variants / SC / PSM-IPW-EB / DML / CF / Heckman / QR — 12 classes) → (6) robustness (M1–M6 progressive specs, cluster sensitivity, wild bootstrap, placebo, spec curve, Oster δ*, LOO, Rosenbaum) → (7) further analysis (heterogeneity × 4 / outcome-ladder mechanism / Baron–Kenny + Imai mediation / moderated mediation / dose-response / spillover) → (8) publication tables & figures (stargazer / etable / coefplot / event-study / binscatter / forest / RD plot / CATE heatmap / love plot, plus LaTeX/Word/Excel export). 610-line SKILL.md spine + 8 deep reference manuals (3000+ lines), progressively loadedTeaching, referee-level audit, graduate replication training, rigorous empirical projects requiring line-by-line control and full diagnostic coverage
Full Empirical Analysis Skill — Stata 📊Traditional Stata .do 8-step closed loop (Stata sibling of 00.1, same structure, same cadence). One ssc install block installs 30+ packages. End-to-end community-standard chain: reghdfe / ivreg2 / ivreghdfe / csdid / did_imputation / eventstudyinteract / sdid / did_multiplegt_dyn / bacondecomp / honestdid / rdrobust / rddensity / synth / synth_runner / psmatch2 / teffects / ebalance / ppmlhdfe / boottest / ritest / rwolf / psacalc / coefplot / esttab / outreg2 / asdoc / binscatter. 8 steps: (1) use+import+destring+misstable+merge assert+xtset → (2) winsor2+xtile+L./F./D./S.+CPI+staggered timing → (3) tabstat+balancetable+asdoc+pwcorr sig star+heatplot → (4) 12 estat-style diagnostics → (5) 12 estimator classes (reghdfe + 5 DID + 4 RD + synth + teffects + ebalance + heckman + qreg + ppmlhdfe + sem/gsem) → (6) eststo+esttab M1–M6 + boottest + ritest + rwolf + bacondecomp + honestdid + psacalc delta → (7) factor-var + margins + marginsplot + suest + DDD + medsem + khb + SEM + Stata-Python bridge to econml for CATE → (8) esttab+outreg2+asdoc to .tex/.rtf/.docx/.xlsx; coefplot+marginsplot+binscatter+rdplot+graph combine to .pdf. 801-line SKILL.md + 8 deep references (3500+ lines) + complete .do skeletonReferee / co-author insists on Stata replication; graduate Stata courses; AER/QJE/JPE/ReStud-style standard .do pipelines
Full Empirical Analysis Skill — R 📗Modern tidyverse + fixest + Quarto stack, explicit 8-step loop (R sibling of 00.1 / 00.2; the fourth and final piece of the family). One install.packages(...) block installs 50+ packages. End-to-end modern R standards: dplyr / tidyr / haven / janitor / naniar / mice / validate / assertr for data; fixest::feols/feglm/fepois for HD FE + multi-way clustering + IV in one line; did::att_gt / fixest::sunab / didimputation::did_imputation / synthdid / DIDmultiplegtDYN / bacondecomp / HonestDiD for modern DID; rdrobust / rddensity / rdmulti / rdlocrand for RD; Synth / gsynth / tidysynth / synthdid for SC; MatchIt / WeightIt / cobalt / ebal for matching; grf::causal_forest / DoubleML for ML causal; mediation::mediate + medsens / lavaan::sem for mediation; marginaleffects for post-estimation; modelsummary / kableExtra / gt / flextable for tables; ggplot2 + iplot + binsreg + cowplot + patchwork for figures. 8-step R pipeline + Quarto template (one .qmd holding narrative + code + tables + figures, quarto render for PDF/HTML/Word in one go). 893-line SKILL.md + 8 deep references (3700+ lines), progressively loadedQuarto reproducibility reports, academic blogs (distill / quarto blog), graduate R courses, projects needing marginaleffects + Imai sensitivity mediation + grf CATE post-estimation
AER-Skills 📕🔥Top-5 economics submission skill stack (AER / AER:Insights / AEJ family), complementary to StatsPAI / 00.x “run the analysis” — specialised in “write the paper + submit + R&R”. Nine skills, full pipeline: aer-topic-selection (AER vs Insights vs AEJ routing) → aer-identification (identification audit: modern DiD / weak IV / boundary RDD pitfalls) → aer-robustness (referee-anticipating robustness matrix) → aer-introduction (Keith Head five-paragraph intro) → aer-tables-figures (AER booktabs typesetting) → aer-replication (AEA Data and Code Availability Policy package, openICPSR-ready) → aer-submission (preflight: 100-word abstract, disclosure, cover letter) → aer-rebuttal (R&R letters written against the revised manuscript) → aer-workflow (orchestrator). Identification-first — if your design is fragile, no prose will save it. Covers AER 100-word abstract / AER:Insights 7000-word limit / 45% desk-rejection / AEA mandatory replication — top-5-specific constraints that generic scientific-writing skills do not cover. scripts/sync-aer-skills.sh + weekly GH Actions loop syncs from upstream brycewang-stanford/AER-skills. License: MITFull AER / AER:Insights / AEJ submission flow: topic routing → identification audit → writing → typesetting → replication package → submission → R&R rebuttal
claesbackman/AI-research-feedback2-agent economics paper pre-review: causal overclaiming detection, identification strategy assessment; supports AER/QJE/JPE/Econometrica/REStud; 6-agent grant reviewPre-submission self-review, grant applications
fuhaoda/stats-paper-writing-agent-skillsLaTeX statistical paper writing, front-end draft generationStatistics & econometrics papers
dylantmoore/stata-skillFull Stata coverage: syntax, data management, econometrics, causal inference, graphics, Mata, 20+ community packagesStata users
SepineTam/stata-mcpLLM operates Stata regression directly via MCP, “evolve from regression monkey to causal thinker”Stata econometrics

🚨 Anti-AIGC Detection & De-AI Academic Writing (Highlighted)

This is one of the most critical pain points in academic writing in 2026. Papers failing AIGC detection can be rejected outright, and detectors like Turnitin, GPTZero, and China’s CNKI are getting stricter. The 4 skills below are the most authoritative and complete solutions on GitHub — all MIT open-source, and all locally archived in this repo (skills/44-47).

SuiteKey FeaturesUse CaseLocal Path
CoPaper.AI / chinese-de-aigc 🇨🇳🔥Original Chinese academic de-AIGC skill by CoPaper.AI team. Targets China’s CNKI AMLC / Wanfang / VIP / Turnitin Chinese detectors. 17-pattern library of Chinese-specific AI tells (4-char clichés, hollow connectives, explicit transitions, absolutist claims, sentence-length uniformity), 5-step closed loop workflow (locate→diagnose→rewrite→self-score→review), per-section strategy, 5-dim scoring rubric. Currently the only GitHub skill dedicated to Chinese academic de-AIGCChinese journal submissions, theses, grant proposalsskills/48
matsuikentaro1/humanizer_academic 🔥Academic-specific. 23 AI writing patterns (6 content + 6 language + 3 style + 3 filler + 5 word choice), examples from EMPA-REG OUTCOME cardiovascular trials, preserves legitimate academic transitions, based on Wikipedia “Signs of AI writing”Medical, life sciences, natural science papersskills/44
stephenturner/skill-deslopScientific writing de-AI. Smartly distinguishes legitimate discipline conventions (passive voice in methods) from AI tells; 5-dimension scoring (directness/rhythm/trust/authenticity/density); 4 reference files (examples/phrases/structures/tropes)Scientific papers, technical blogsskills/45
hardikpandya/stop-slop3-layer detection + 5-dim scoring. Banned phrases (throat-clearing openers, emphasis crutches, corporate jargon), structural clichés (binary contrasts, dramatic fragmentation, false agency), sentence-level rules (no em dash, no Wh- starters). Below 35/50 → reviseGeneral prose, blogs, reportsskills/46
conorbronsdon/avoid-ai-writingStructured audit + rewrite + second-pass audit. Four-section output: identified issues (with quotes) → rewrite → change summary → second audit. Compatible with Claude Code, OpenClaw, Hermes, and other agentsWorkflows needing auditable, traceable revisionskills/47
ShiyanW/ai-revision-guardPrevents over-refinement (different angle). Limits revision rounds (≤2 per section), 7-point homogenization checklist, cross-model verification. Protects author’s voice from AI erosionMulti-round polishing scenarios(community PR)

Recommended combos:

  • 🇨🇳 Chinese academic papers (CNKI/Wanfang/VIP) → chinese-de-aigc (original) + revision-guard
  • 🇬🇧 English academic papers → humanizer_academic + revision-guard (prevent over-refinement)
  • Bilingual papers → chinese-de-aigc + humanizer_academic combined
  • Need auditable workflow → avoid-ai-writing (structured reports)
  • General writing → stop-slop (5-dim scoring for quantified improvement)

Finance & Investment Research

SuiteKey FeaturesUse Case
anthropics/financial-services-pluginsAnthropic official: investment banking, equity research, private equity, wealth managementFinancial services
OctagonAI/skillsOctagon agentic financial research Claude SkillsInstitutional financial research
tradermonty/claude-trading-skillsStock investing & trading: market analysis, technical charts, economic calendar, strategy developmentQuantitative trading research
himself65/finance-skillsAgent Skills open standard, earnings analysis, consensus estimates, analyst sentimentFinancial analysis
quant-sentiment-ai/claude-equity-researchInstitutional equity research: fundamental analysis, technical indicators, risk assessmentEquity research

Education & Public Health

SuiteKey FeaturesUse Case
GarethManning/claude-education-skillsEvidence-based education Claude Skills, designed for teachers and agent orchestrationEducation research
FreedomIntelligence/OpenClaw-Medical-Skills869 medical AI Skills: epidemiology, public health surveillance, clinical research, drug safety, biostatisticsPublic health, medical research

Governance, Compliance & Law

SuiteKey FeaturesUse Case
Sushegaad/Claude-Skills-Governance-Risk-and-ComplianceGRC Skills: ISO 27001, SOC 2, GDPR, HIPAA compliance guidance (94% vs 72% baseline)Compliance research, policy analysis
zubair-trabzada/ai-legal-claudeLegal assistant: contract review, risk analysis, NDA generation, compliance audit, 14 Skills + 5 agentsLaw & economics, regulatory research
evolsb/claude-legal-skillAI contract review: CUAD risk detection, market benchmarks, attorney-grade red-liningLaw & economics research

Marketing & Consumer Behavior

SuiteKey FeaturesUse Case
coreyhaines31/marketingskillsCRO, copywriting, SEO, analytics, and growth engineeringMarketing research
zubair-trabzada/ai-marketing-claude15 Skills + parallel sub-agents: website audit, copy, email sequences, competitive intelligenceConsumer behavior analysis
ericosiu/ai-marketing-skillsGrowth experiments, sales pipeline, content operations, SEO, financial automationMarketing strategy research

Product Management & Organizational Behavior

SuiteKey FeaturesUse Case
phuryn/pm-skills100+ agent Skills: discovery → strategy → execution → launch → growth, 65 PM Skills + 36 chained workflowsProduct management, organizational research
mastepanoski/claude-skillsUX/UI evaluation (Nielsen heuristics, WCAG), AI governance (NIST AI RMF, ISO 42001)UX research

General Agent Capabilities

SuiteStarsKey Features
lyndonkl/claude-85 skills + 6 orchestration agents, incl. causal inference, Bayesian reasoning, experimental design, multi-criteria analysis
alirezarezvani/claude-skills~5,200220+ skills + 298 CLI scripts, incl. financial analysis and data processing
rohitg00/awesome-claude-code-toolkit-135 agents incl. data scientist agent (EDA, DID, RDD), 35 skills, 42 commands
jeremylongshore/claude-code-plugins-plus-skills-340 plugins + 1,367 agent skills, CCPI package manager
affaan-m/everything-claude-code-Skills, intuition, memory, security, research-first development framework
posit-dev/skills-Posit official: modern-r-tidyverse, predictive-modeling, quarto-authoring, shiny-bslib

Multi-Agent Collaboration Systems

A single Skill solves a point problem; multi-agent systems solve end-to-end workflows. These systems let multiple AI roles divide work, cross-review, and produce output quality far beyond what a single agent can achieve:

Paper Revision & Writing

SystemArchitectureKey Features
copy-edit-master3 sub-agents: structure-editor + line-editor + quality-reviewerAuto document type detection, Strunk & White / McCloskey rules encoded, git checkpoints per phase, review loop (max 2 iterations)
introduction-writer4 sub-agents: strategist → drafter → reviewer → reviserKeith Head formula for drafting introductions, reviewer independent from drafter for quality loop
CoPaper.AI PaperAgentSupervisor + 4 sub-agents (preparation / modeling / visualization / writing)Skills routed by target_agent, each sub-agent sees only relevant methodology guidance, reduced context noise

Why multi-agent beats single agent? When the same agent writes and reviews, it tends to approve its own work. Role separation means the reviewer is independent from the drafter — forming a genuine quality loop. Same logic as academic peer review.

Data Analysis & Research

SystemSourceKey Features
ruc-datalab/DeepAnalyzeRenmin Univ.Autonomous data analysis agent, raw data → professional report, CSV/Excel/JSON/DB support, open-source DeepAnalyze-8B
business-science/ai-data-science-teamBusiness ScienceMulti-agent data science team: EDA Agent + SQL Agent + MLflow Agent, LangChain integration
HungHsunHan/claude-code-data-science-teamCommunityClaude Code multi-agent data science team, auto cleaning → modeling → executable Notebook
HKUDS/AI-ResearcherHKU (NeurIPS 2025 Spotlight)Fully autonomous research pipeline: lit review → hypothesis → algorithm → paper
wanshuiyin/Auto-claude-code-research-in-sleep (ARIS)CommunityOvernight autonomous research, cross-model review loops (Claude + external LLM as critic)
SamuelSchmidgall/AgentLaboratoryAcademic (ICLR)End-to-end autonomous research: lit review → experiments → report, arXiv/HuggingFace/LaTeX integration, 84% cost reduction
SakanaAI/AI-Scientist-v2Sakana AIFully automated scientific discovery: hypothesis → experiments → paper, first AI-generated paper accepted via peer review
assafelovic/gpt-researcherCommunityAutonomous deep research agent, supports any LLM provider
LitLLM/LitLLMAcademicAI literature review assistant: keyword extraction + multi-strategy retrieval + re-ranking, RAG-based
pedrohcgs/claude-code-my-workflowEmory Univ.Academic LaTeX/Beamer + R template, multi-agent review + quality gates, adopted by 15+ research groups
hugosantanna/clo-authorCommunityExtends Sant’Anna’s workflow from lecture production to full social science empirical research publication

Academic Data MCP Servers

SystemKey Features
xingyulu23/AcademixUnified academic research interface aggregating OpenAlex + DBLP + Semantic Scholar + arXiv + CrossRef
Eclipse-Cj/paper-distill-mcp11-source parallel search, 4-dimension weighted ranking (relevance/recency/impact/novelty)
oksure/openalex-research-mcpOpenAlex API: search 240M+ academic works, citation analysis, trend tracking, collaboration networks
zongmin-yu/semantic-scholar-fastmcp-mcp-serverFull Semantic Scholar API access: papers, authors, citation networks
openags/paper-search-mcpSearch 20+ sources: arXiv, PubMed, bioRxiv, Google Scholar, SSRN, Unpaywall, etc.
aringadre76/mcp-for-researchIntegrates PubMed + Google Scholar + ArXiv + JSTOR, published on NPM
blazickjp/arxiv-mcp-serverarXiv paper search and analysis MCP
lzinga/us-gov-open-data-mcp40+ US government APIs (FRED/Census/CDC/FDA/FEC, etc.), 250+ tools
stefanoamorelli/fred-mcp-serverDirect access to FRED’s 800K+ economic time series
llnOrmll/world-bank-data-mcpWorld Bank Data360, 1000+ socioeconomic indicators, 200+ countries
54yyyu/zotero-mcpConnect Zotero library with AI assistants: paper review, summaries, citation analysis, PDF annotation
datagouv/datagouv-mcpFrench national open data platform MCP

Skill Aggregation Platforms & Discovery Tools

Don’t know where to find Skills? These platforms are your starting point:

PlatformScaleFeatures
VoltAgent/awesome-agent-skills1,000+ skills13,700 stars, curated by official team and community
sickn33/antigravity-awesome-skills1,340+ skills28,000 stars, one-click install npx antigravity-awesome-skills
VoltAgent/awesome-openclaw-skills5,400+ skillsCurated from OpenClaw registry (ClawHub 13,729 Skills)
jeremylongshore/claude-code-plugins-plus-skills1,367 skills340 plugins + CCPI package manager
skills.shOnline marketSearchable Skill marketplace
ClawHub (clawhub.com)13,729 skillsOpen-source AI skill marketplace, one-line install
Agent Skills StandardSpec docsUniversal Agent Skills specification
Anthropic Official SkillsOfficialPDF/DOCX/XLSX/PPTX document processing
Anthropic Official Plugin MarketOfficialAnthropic-managed high-quality Claude Code plugin catalog
Anthropic Knowledge Work PluginsOfficial11 plugins incl. Data Plugin (SQL queries, data exploration)
Anthropic Financial Services PluginsOfficialFinancial services plugins: IB, equity research, PE, wealth mgmt

Learning Resources

Official Documentation

Academic Talks & Courses

Causal Inference Textbooks

Survey Papers & Awesome Lists

Community & References


🛡️ Security Scan

We ran a systematic security audit across all 52 Skills / 2,940+ files in this repository — 52/52 CLEAN, zero FLAGGED. Every “sensitive” hit was verified as legitimate content. No malicious prompts, viruses, trojans, or reverse shells were found. Bottom line: every Skill in this repo is safe to use.

Skills Security Scan Overview

Six-phase defense-in-depth methodology:

  1. Automated pattern scan — grep across 13 risk categories (pipe-to-shell, reverse shell, credential exfil, decode-and-run, mining/RAT signatures, prompt injection, etc.)
  2. Hook & permission matrix audit — 100% manual review of all 6 hook-bearing Skills and their 40+ hook scripts. Permission allow-lists are restricted to research tooling — no Bash(*) wildcards anywhere.
  3. Three parallel agent content audits — independent reviews of SKILL.md prose, agent definitions, and reference documentation for prompt injection, backdoors, hidden Unicode, suspicious package sources.
  4. Supplemental integrity checks — hidden characters, encoding anomalies, ultra-long lines, HTML injection, network-related imports, high non-ASCII ratios.

Result distribution: every hit fell into one of three legitimate categories:

  • 🛡️ Defensive security rules — deny rules, bash-safety hooks, credential detectors. 17-DAAF is the strongest “security-aware” reference in this batch (14 defensive hooks + 32 deny rules + active credential scanning).
  • 📚 Legitimate academic API calls — arXiv / CrossRef / PubMed / Semantic Scholar / FRED / World Bank / OECD / BLS, etc.
  • 🔁 Standard Claude Code workflow hooks — project scaffolding, state save/restore, context monitoring, session archive, pre-commit reminders — all local file operations, zero network IO.

Key insight: largest size ≠ highest risk. The Top 5 largest Skills (43-wentorai 478 files / 33-Galaxy-Dawn 327 files / 17-DAAF 319 files / 35-bahayonghang 264 files / 18-jusi-aalto 126 files) all passed full audit, with 17-DAAF actually setting the bar for security-conscious design.

The full report includes Phase 1-6 methodology, a per-Skill audit table for all 52 Skills, and 5 visual infographics: 📋 SECURITY-SCAN-REPORT.md


Contributing

Contributions welcome! Please read CONTRIBUTING.md to learn how to submit new Skill recommendations.

We especially welcome:

  • Skills for social science disciplines: economics, political science, sociology, psychology, education, public administration, public health
  • New Skill implementations for causal inference methods
  • Skills for business disciplines: finance, management, marketing, law & economics
  • General academic Skills: literature review, grant writing, research proposals
  • MCP servers (academic databases, government data APIs)
  • Chinese-friendly Skills
  • Multi-agent collaboration system case studies

Star History

If this list helps you, please give it a Star so more researchers can find it.

GitHub stars

Star History Chart

AI is an amplifier, not a replacement. It handles the heavy lifting; you keep the core judgment.


CoPaper.AI Stanford REAP

Stanford REAP × CoPaper.AI · An academic-industrial AI toolkit for empirical research


Visit copaper.ai
Visit copaper.ai
CoPaper.AI WeChat
WeChat: CoPaper.AI

20 built-in methodology Skills · 20-minute empirical paper · Powered by StatsPAI (900+ functions, MIT)


Maintained by CoPaper.AI, incubated at Stanford REAP / SCCEI | AI Assistant for Empirical Research

Similar Articles

AutoResearch AI: Towards AI-Powered Research Automation for Scientific Discovery

arXiv cs.AI

This survey examines the emerging field of AI-powered research automation (AutoResearch), analyzing how AI systems are moving from isolated task assistance to full workflow-level scientific discovery. It defines a spectrum from human-steered 'Vibe Research' to AI-led systems, and proposes five evaluation dimensions for scientific credibility.