@garrytan: My simple secret to agentic coding https://forbes.com/sites/josipamajic/2026/04/12/the-yc-chief-who-codes-10000-lines-a…
Summary
Garry Tan of Y Combinator shares his 'thin harness, fat skills' agentic coding framework, while the accidental leak of Claude Code's source code reveals the complex scaffolding behind AI coding agents.
View Cached Full Text
Cached at: 05/22/26, 11:54 PM
My simple secret to agentic coding https://forbes.com/sites/josipamajic/2026/04/12/the-yc-chief-who-codes-10000-lines-a-day-has-a-simple-secret/…
The YC Chief Who Codes 10,000 Lines A Day Has A Simple Secret
Source: https://www.forbes.com/sites/josipamajic/2026/04/12/the-yc-chief-who-codes-10000-lines-a-day-has-a-simple-secret/
SUQIAN, CHINA - JUNE 10: In this photo illustration, the logo of AI Agent is displayed on a smartphone screen on June 10, 2025 in Suqian, Jiangsu Province of China. (Photo by VCG/VCG via Getty Images)
VCG via Getty Images
Garry Tan runs Y Combinator, the accelerator that launched Airbnb, Stripe, and Coinbase. He alsoclaims to ship 600,000 lines of production code every 60 days, part-time, while running that organization full-time. The number sounds fabricated until you understand the architecture behind it, and the architecture fits on an index card.
In awidely circulated post on X, Tan laid out a framework he calls thin harness, fat skills: a design philosophy for AI agents that theMarch 31 accidental leak of Claude Code’s 512,000-line TypeScript sourceconfirmed in unexpected detail. The insight is that you can use the same model, but increase it’s output significantly by leveraging scaffolding.
The Leak That Proved the Point
Whensecurity researcher Chaofan Shoudiscovered that Anthropic had accidentally published a 59.8 MB source map file alongside version 2.1.88 of Claude Code on npm, the immediate story was the security lapse. The story was what the 1,906-file codebase revealed about how production AI products actually work.
Claude Code,as TechTalks analyzed, is not a thin wrapper around a language model. It is a self-healing query loop, a background memory daemon called autoDream, concurrency-safe tool batching, compile-time feature gating, and a context management system that prevents the model from drowning in its own history. The model sits at the center of all of it and the harness does everything else.
Anthropic’s head of Claude Code, Boris Cherny, confirmed the incident was a plain developer error in astatement on X, and added a note that has since circulated widely:“100% of my contributions to Claude Code were written by Claude Code.”
Five Definitions Investors Should Understand
Tan’s framework, articulated in the X post and instantiated ingstack, his open-source Claude Code configuration that accumulated66,000 GitHub stars within weeks of release, rests on five concepts that reframe how AI development tools should be evaluated.
Skill filesare reusable markdown documents that encode process, not content. The same /investigate skill, pointed at a safety scientist or at FEC filings, produces radically different outputs because the skill describes judgment and the invocation supplies the world. Tan frames these as method calls, with markdown as the programming language and human judgment as the runtime.
The harnessruns the model in a loop, manages context, reads and writes files, and enforces safety. The anti-pattern Tan explicitly calls out is a fat harness: 40-plus tool definitions eating half the context window, god-tools with multi-second MCP round-trips, REST API wrappers that turn every endpoint into a separate tool. The Claude Code source confirms this discipline: opinionated, narrow tooling throughout.
Resolversare routing tables for context. When task type X appears, load document Y. The embedded resolver in Claude Code matches user intent to skill descriptions automatically. Tan’s CLAUDE.md went from 20,000 lines to roughly 200, with pointers to documents rather than the documents themselves.
Latent versus deterministicis the most consequential distinction in agent design. Judgment, synthesis, and pattern recognition belong in latent space. SQL queries, arithmetic, and combinatorial optimization belong in deterministic tooling. Forcing a deterministic problem through a model produces outputs that look plausible and are wrong. The best systems are ruthless about which work goes where.
Diarizationis the step that turns document retrieval into genuine analysis. The model reads everything about a subject and produces a structured single-page brief, a distillation no SQL query or RAG pipeline replicates. Tan uses it throughout his YC Startup School matching system: 6,000 founder profiles, running nightly, surfacing the gap between what founders say they are building and what the commit history shows.
The VC Angle
The framework matters to investors for a reason that has little to do with software architecture. The companies that understand this distinction, model versus harness, are compounding in ways that companies chasing raw model capability are not.
The productivity gains have a direct implication for how investors evaluate AI-native teams. The question is no longer which model a company uses. Model capability is table stakes andcommoditizing faster than most forecasts expected. The question is whether the team has built the surrounding architecture with the same rigor they applied to the product itself. Skill files that encode domain judgment and context management that scales beyond a single developer.
The Claude Code leak made this legible in a way that blog posts could not. Anthropic’s moat is not the model butthe self-healing loop, the memory architecture, the anti-distillation mechanisms, the years of engineering judgment baked into 1,906 TypeScript files. Competitors can read the source now but the judgment encoded in it took years to accumulate.
What This Means Going Forward
Steve Yegge,whose productivity estimates Tan cites, puts well-harnessed AI agents at 10x to 100x the productivity of developers using standard chat tools, and roughly 1,000x relative to baseline knowledge workers in 2005. The 2x people and the 100x people use the same underlying models.
For founders; every repeatable task that crosses your desk should become a skill file, not a recurring prompt. If you ask your agent for the same thing twice, you are alreadly losing. For investors performing due diligence on AI-native companies, the question to ask is whether the team’s productivity claims survive a harness audit. Whose context management sits in a 20,000-line CLAUDE.md that degrades with scale, and whose sits in 200 lines of pointers to purpose-built skills.
Tan’sgstackis MIT-licensed, installs in thirty seconds, and has already been forked more than 9,100 times. The architecture it embodies will outlast any model generation currently in production.
Similar Articles
@garrytan: https://x.com/garrytan/status/2054064931515855118
Garry Tan argues that AI coding agents like Claude Code and Codex have changed software engineering by making high test coverage affordable, creating a 'complexity ratchet' that ensures code quality improves over time without sacrificing speed.
@garrytan: https://x.com/garrytan/status/2061454423034110372
Garry Tan argues that developers are over-engineering with excessive code when using AI agents; instead, they should trust the model and build minimal, instruction-based software, exemplified by his open-source project GStack.
@garrytan: The reason why I release my X articles about AI agents (fat skill fat code thin harness) and GStack and GBrain is that …
Garry Tan explains the motivation behind releasing his X articles on AI agents and concepts like GStack and GBrain, emphasizing the idea of 'process power' as a personal moat.
@garrytan: My friend @finbarr says: It's like code as memory. You work with your agent in a non deterministic way to figure out ho…
Garry Tan discusses the concept of "code as memory" for AI agents, suggesting they generate executable scripts for new tasks and reuse them for efficiency.
@SaitoWu: https://x.com/SaitoWu/status/2052967845626290326
YC CEO Garry Tan shared how he returned to active development after 13 years away from coding, using Claude Code and OpenClaw with a 'Thin Harness + Fat Skills' methodology to achieve a 400x productivity boost. He also built an agentic news platform called Garry's List and an agent workflow framework called Gstack.