@yibie: 360 PRs in 30 Days, Can't Code, Doesn't Read Plans: How Matt Van Horn Designed His Agent Loop. Last month, a person named Matt Van Horn caught my attention. • No CS degree • Can't code • Doesn't read …

X AI KOLs Timeline 06/18/26, 07:28 AM News

ai-agent agent-loop engineering-methodology open-source github prompt-engineering automation

Summary

The article introduces how Matt Van Horn achieved 360 PRs in 30 days by designing an agent loop rather than writing prompts, and details his three tools and eight LAW rules, emphasizing the shift from prompt writer to loop designer.

360 PRs in 30 Days, Can't Code, Doesn't Read Plans: How Matt Van Horn Designed His Agent Loop Last month, a person named Matt Van Horn caught my attention. • No CS degree • Can't code • Doesn't read AI-written plans But in the past 30 days, he merged 360 PRs on GitHub across 40 repositories, averaging 12 per day. His project last30days reached 30,000+ GitHub stars and was #1 trending at one point. Printing Press's tweet got 3,447 likes. Agent Cookie has 522 stars. How did he do it? The answer is one sentence: Stop prompting the Agent, start designing a loop where the Agent prompts itself. One Sentence Judgment Matt Van Horn isn't writing code. He turned "prompting the Agent to do things" into a loop. He designed an operating system for the Agent, not instructions for the Agent. This is the same logic as Tibo making Codex support any model: instead of competing on prompt quality, compete on the quality of the Agent runtime. Three Projects, One Methodology Matt built three tools that exactly cover the three layers of an Agent Loop: | Layer | Project | Purpose | Stars | |-------|---------|---------|-------| | Perception | last30days | Multi-source social media search, 13 platforms in parallel, tells the Agent what the external discussion is | 30K+ | | Execution | Printing Press | CLI factory, turns any API into a tool interface the Agent can call efficiently | 3.4K likes | | Infrastructure | Agent Cookie | Cross-machine browser session sync, keeps the Agent always logged in | 522 | Relationship of the three layers: last30days tells the loop what to do → Printing Press gives the loop tools to execute → Agent Cookie keeps the loop running unattended. Essentially, this is the prototype of an Agent OS. How to Design the Loop? Deconstructing last30days' 1400-line SKILL.md last30days is not a "tool". It is Matt's most complete expression of "how to design an Agent Loop". Its architecture has two layers: • Python engine: handles mechanical work – parallel search of 13 platforms, sort by engagement, cross-source clustering and merging, denoising. It's the "muscle". • 1400-line SKILL.md: handles judgment work – how to interpret results, how to synthesize output, what format is correct. It's the "brain". Key Design: Constraining the LLM with 8 LAWS SKILL.md is not a document. It is a finite state machine written in natural language. The core is 8 LAWS, each with a "named failure mode": | LAW | Constraint | Corresponding Regression | |-----|-----------|-------------------------| | LAW 1 | Forbid outputting Sources section at end | Peter Steinberger disaster #3 | | LAW 2 | Forbid inventing titles | v3.0.6 0/8 public regression | | LAW 3 | Forbid em-dash | Common AI-slop tell | | LAW 4 | Forbid ## section headings | Peter Steinberger disaster #2 | | LAW 6 | Forbid outputting raw evidence directly | Hermes 2026-04-19 disaster | Every LAW has a specific, nameable failure case behind it. This follows the same logic as software team postmortems – bug found → given a name → rule added → verified. Loop Flow: From Raw Topic to Final Output Input "Kanye West" ↓ STEP 0: Engine self-check (are you reading the latest SKILL.md or a stale clone?) ↓ STEP 0.45: Query quality check ("gift for a 42-year-old man" → reframe, no one posts that) ↓ STEP 0.55: Parse entities → @kanyewest + r/Kanye + GitHub associations ↓ STEP 0.75: LLM generates JSON query plan (search strategy, weight assignment) ↓ STEP 1: Python engine parallel search of 13 sources → sort by engagement → cross-source clustering ↓ STEP 2: LLM supplements WebSearch to fill blog/news gaps ↓ Judge: LLM synthesizes final output according to 8 LAWS ↓ Feedback loop: each regression has a name → fixed in next loop The impressive thing about this flow is not any single step, but the hybrid of "code + natural language contract". Code handles mechanical search and sorting, LLM handles judgment-based synthesis. The interface between them is SKILL.md. Stop Prompting the Agent, Start Designing the Loop—What This Means Matt summarized his 20 principles in a tweet on June 3rd: 1. Have an idea → /ce-plan generates a plan (don't read it yourself, the plan is for the Agent) 2. Vague idea → first /ce-brainstorm 3. Make the plan, don't read it 4. Use voice, don't type 5. Run 4-6 tabs of parallel tasks 6. Don't set permission restrictions for the Agent (YOLO) 7. You are taste, the Agent is the hand 8. Any task done twice → write a skill 9. Laptop that never shuts down 10. Run last30days before making a plan The core of these 20 principles is one sentence: You are no longer a prompt writer. You are a loop designer. Old workflow: Person has idea → Person writes prompt → Agent executes → Person reviews results → Person writes next prompt New workflow: Person has idea → Loop reads signals → Loop generates prompt → Agent executes → Loop checks results → Loop decides next step The person shifts from operator to supervisor. Practical Implications for Content Studios Back to our scenario: the entire process of topic selection, reconnaissance, and creation can also be designed as a loop: 1. Perception layer: Scheduled task triggers → agent_search + last30days pull materials in parallel 2. Execution layer: LLM scores and recommends → generates draft → human reviews and adjusts taste 3. Infrastructure layer: All workflows written into skills, solidified after two repetitions The essence of what Matt did with last30days is turning "manually searching Reddit + X + YouTube" into a loop. The essence of running a content studio is gradually turning manual material searching, drafting, and publishing into individual loops. Today's prompt engineering is just practice for this step. The real compound interest is: every skill you write deepens the automation of the loop. #AIEngineering #AgentLoop #AgenticEngineering #OpenSource

Original Article

View Cached Full Text

Cached at: 06/18/26, 10:22 PM

30 Days, 360 PRs, Can’t Code, Doesn’t Read Plans: How Matt Van Horn Designed His Agent Loop

Last month, a guy named Matt Van Horn caught my attention.

No CS degree
Can’t write code
Doesn’t read AI-generated plans

Yet in the past 30 days, he merged 360 PRs on GitHub across 40 repos — an average of 12 per day.

His project last30days hit 30,000+ GitHub stars and was #1 trending. The Printing Press tweet got 3,447 likes. Agent Cookie has 522 stars.

How did he do it? One sentence: Stop prompting the Agent, and start designing a loop where the Agent prompts itself.

One Sentence Assessment

Matt Van Horn isn’t writing code. He turned “prompting the Agent to do things” itself into a loop. He’s designing the Agent’s operating system, not the Agent’s instruction. This is the same logic as Tibo making Codex support any model: it’s not about competing on prompt quality — it’s about competing on Agent runtime quality.

Three Projects, One Methodology

Matt built three tools that perfectly cover three layers of an Agent Loop:

Layer	Project	What it does	Stars
Perception	last30days	Multi-source social media search, 13 platforms in parallel, tells the Agent what the external discussion is	30K+
Execution	Printing Press	CLI factory, turns any API into a tool interface that Agents can efficiently call	3.4K likes
Infrastructure	Agent Cookie	Cross-machine browser session sync, keeps the Agent logged in forever	522

The relationship between the three layers: last30days tells the loop what to do → Printing Press gives the loop tools to execute → Agent Cookie keeps the loop running on unattended machines.

Essentially, this is a prototype of an Agent OS.

How to Design the Loop: Deconstructing last30days’ 1400-line SKILL.md

last30days is not a “tool.” It’s Matt’s most complete expression of “how to design an Agent Loop.”

Its architecture has two layers:

Python engine: handles mechanical work — parallel searching 13 platforms, sorting by engagement, cross-source clustering/merging, denoising. It’s the “muscle.”
1400-line SKILL.md: handles judgment — how to interpret results, how to synthesize output, what format is correct. It’s the “brain.”

Key design: constraining the LLM with 8 LAWS

SKILL.md is not documentation. It’s a finite state machine written in natural language. The core is 8 LAWS, each with a “named failure mode”:

LAW	Constraint	Corresponding Regression
LAW 1	Don’t output a Sources section at the end	Peter Steinberger disaster #3
LAW 2	Don’t invent titles	v3.0.6 0/8 public regression
LAW 3	Don’t use em-dashes	Generic AI-slop tell
LAW 4	Don’t use ## section headings	Peter Steinberger disaster #2
LAW 6	Don’t output raw evidence directly	Hermes 2026-04-19 disaster

Every LAW is backed by a specific, namable failure case. This follows the exact same logic as a software team’s postmortem: bug found → name it → add rule → verify.

Loop flow: from raw topic to final output

Input "Kanye West"
  ↓
STEP 0: Engine self-check (are you reading the latest SKILL.md or a stale clone?)
  ↓
STEP 0.45: Query quality check ("gift for a 42-year-old man" → reframe, no one posts that)
  ↓
STEP 0.55: Parse entities → @kanyewest + r/Kanye + GitHub connections
  ↓
STEP 0.75: LLM generates JSON query plan (search strategy, weight allocation)
  ↓
STEP 1: Python engine searches 13 sources in parallel → sort by engagement → cross-source clustering
  ↓
STEP 2: LLM supplements with WebSearch to fill blog/news gaps
  ↓
Judge: LLM synthesizes final output according to the 8 LAWS
  ↓
Feedback loop: every regression has a name → fix in the next cycle

The power of this flow isn’t in any single step — it’s in the hybrid of “code + natural language contract.” Code handles mechanical searching/sorting, LLM handles judgment-based synthesis. The interface between them is SKILL.md.

Stop Prompting the Agent, Start Designing the Loop — What Does This Mean?

Matt summarized his 20-point methodology in a tweet on June 3:

Have an idea → /ce-plan generates a plan (don’t read it yourself, the plan is for the Agent)
Vague ideas → /ce-brainstorm first
Make the plan, don’t read it
Use voice, not typing
Run 4-6 tabs of parallel tasks
Don’t set permission limits for the Agent (YOLO)
You are taste, Agent is hands
Anything done twice → write a skill
Laptop never shuts down
Run last30days before making a plan

The core of these 20 points is one sentence: You are no longer a prompt writer. You are a loop designer.

Old workflow: Human has idea → Human writes prompt → Agent executes → Human sees result → Human writes next prompt
New workflow: Human has idea → Loop reads signals → Loop generates prompt → Agent executes → Loop checks result → Loop decides next step

Human goes from operator to supervisor.

Practical Implications for Content Studios

Back to our scenario: the entire process of topic selection, research, and creation can also be designed as a loop:

Perception layer: scheduled triggers → parallel fetching of materials via agent_search + last30days
Execution layer: LLM scores and recommends → generates drafts → human reviews and adjusts taste
Infrastructure layer: all processes written into skills, solidify after two repetitions

The essence of what Matt did with last30days is turning “manually searching Reddit + X + YouTube” into a loop. The essence of what content studios do is gradually turning manual material searching, draft writing, and publishing into loops — one loop at a time.

Today’s prompt engineering is just a rehearsal for this. The real compound effect is: every time you write a skill, the automation of the loop deepens.

#AIEngineering #AgentLoop #AgenticEngineering #OpenSource

@yibie: 360 PRs in 30 Days, Can't Code, Doesn't Read Plans: How Matt Van Horn Designed His Agent Loop. Last month, a person named Matt Van Horn caught my attention. • No CS degree • Can't code • Doesn't read …

One Sentence Assessment

Three Projects, One Methodology

How to Design the Loop: Deconstructing last30days’ 1400-line SKILL.md

Key design: constraining the LLM with 8 LAWS

Loop flow: from raw topic to final output

Stop Prompting the Agent, Start Designing the Loop — What Does This Mean?

Practical Implications for Content Studios

Similar Articles

@cxjwin: Found a very practical website collecting various ready-made Agent Loop templates. Core idea: stop manually prompting all the time; design the loop so the agent runs the task on its own. It includes Test Until Green, Fix CI Until G…

@cellinlab: Loop Engineering is replacing the practice of personally writing prompts for agents. Its core idea: you no longer prompt the agent directly, but design a system that prompts the agent.

@cellinlab: https://x.com/cellinlab/status/2064144608242679822

@li9292: 我推荐这个网址： https://loops.elorm.xyz 收录了几十个大神的 loop engineering的思路

@uniswap12: I'm truly impressed by how far Claude Code can be used. Matt Van Horn recently wrote a long article revealing all his tips and workflows for using Claude Code. This article sparked considerable discussion in the developer community because what he showed goes far beyond 'using AI...'

Submit Feedback

Similar Articles

@cxjwin: Found a very practical website collecting various ready-made Agent Loop templates. Core idea: stop manually prompting all the time; design the loop so the agent runs the task on its own. It includes Test Until Green, Fix CI Until G…

@cellinlab: Loop Engineering is replacing the practice of personally writing prompts for agents. Its core idea: you no longer prompt the agent directly, but design a system that prompts the agent.

@cellinlab: https://x.com/cellinlab/status/2064144608242679822

@li9292: 我推荐这个网址： https://loops.elorm.xyz 收录了几十个大神的 loop engineering的思路

@uniswap12: I'm truly impressed by how far Claude Code can be used. Matt Van Horn recently wrote a long article revealing all his tips and workflows for using Claude Code. This article sparked considerable discussion in the developer community because what he showed goes far beyond 'using AI...'