@uniswap12: I'm truly impressed by how far Claude Code can be used. Matt Van Horn recently wrote a long article revealing all his tips and workflows for using Claude Code. This article sparked considerable discussion in the developer community because what he showed goes far beyond 'using AI...'
Summary
Matt Van Horn shares advanced workflows for using Claude Code: first plan in detail via plan.md, then use voice input and multi-window parallel execution, demonstrating how to deeply integrate AI into the development process, greatly improving efficiency.
View Cached Full Text
Cached at: 06/08/26, 07:21 AM
Claude Code Used to This Extent — I’m Impressed
Matt Van Horn recently published a long article spilling all his tips and workflows for using Claude Code. It caused quite a stir in the developer community because what he shows goes far beyond “using AI to assist with coding.” His core argument is just one sentence: no IDE, just plan.md files and voice. It sounds like bragging, but after reading his entire setup, you realize he’s serious.
Everything starts with a plan — code is written last
Matt says the most important thing he’s learned is: the moment an idea pops into your head, your first reaction should always be /ce: plan. Whether it’s a crazy product idea, a GitHub bug report, or an error message in the terminal, he always takes a screenshot or copies the link, throws it into Claude Code, and asks for a plan first.
The mechanism behind the /ce: plan command is pretty interesting. It launches several research agents in parallel — one analyzes your codebase, reads files, finds patterns, checks coding conventions; another digs through your past bug-fixing experience docs; if needed, more agents look up external best practices and framework documentation. Everything runs simultaneously and eventually merges into a structured plan.md that clearly states the problem, the proposed solution, which files need to change, acceptance criteria with checkboxes — and all of this is based on your own codebase and history, not generic advice.
Then /ce: work takes that plan and executes it — breaks down tasks, writes code, runs tests, ticks off acceptance criteria one by one. If context is lost mid-session, you can open a new session pointing to that plan and continue. The plan file is a save point that never disappears.
Traditional development is 80% coding, 20% planning. Matt has completely flipped that ratio. Thinking happens in the plan, execution is handed to the machine. This idea doesn’t just apply to writing code — for any complex task, spending more time figuring out what to do and how to do it, then automating execution as much as possible, yields much higher efficiency. Many people jump in and start doing things, thinking as they go, only to find halfway through that they’re heading in the wrong direction and have to start over. Everyone knows the principle of planning first, then executing — but Matt turned it into a mandatory workflow using tools. That’s the key.
Compound Engineering: the plugin that makes plan-driven development real
The workflow that makes this plan-first approach actually work is a plugin called Compound Engineering from Every Inc. The install command is just one line: /plugin marketplace add EveryInc/compound-engineering-plugin.
Matt went from being a loyal fan of this plugin to its third-largest contributor on GitHub, with 21 commits. He now has 70 plan files and has made 263 commits in the last 30 days. He’s set himself a hard rule: unless it’s literally a one-line change, always write a plan.md first.
That number speaks volumes. 70 plan files correspond to 263 commits, meaning each plan yields nearly 4 commits on average. The clearer the plan, the higher the execution efficiency, the lower the rework rate. This is the same principle emphasized in many project management methodologies — “front-loading” — except Matt has applied it to his daily development work, with tools to enforce it.
Voice Input: when the listener is smart enough, speaking becomes the most efficient input method
Matt says he used to hate voice memos — Apple’s built-in dictation feature made him want to throw his phone. But voice-to-LLM is completely different. Transcription doesn’t need to be perfect because Claude Code understands context and can guess what the microphone missed. You can mumble, go off-topic halfway, restart — none of that matters.
The tool he uses is called Monologue, also from Every Inc. It takes what you say and inputs it directly into the currently focused application. You speak, it types into Claude Code. He even bought a gooseneck microphone for his desk. What’s more, he wrote part of that article while driving his kids around on Tesla FSD, dictating with his voice.
The reason voice input has always been terrible is the core problem of transcription accuracy. Every word you say must be recognized perfectly, or errors occur. But when the receiver changes from a rigid word processor to a context-aware AI, that bottleneck disappears. The AI doesn’t need every word crystal clear — it can infer your meaning from context. This change seems small, but it actually unlocks a whole new mode of human-computer interaction: you can talk to the AI the way you talk to a colleague, without weighing every word, worrying about phrasing, or even grammar.
Four to six windows running in parallel: one person, one team
Matt’s daily work mode is to have four to six Ghostty terminal windows open simultaneously, each running an independent Claude Code session. One is writing a plan, another is building code based on a different plan, another is running /last30days for research, and yet another is fixing a bug he found while testing the previous feature.
While one window’s /ce: plan is spinning up research agents, he switches to another window running /ce: work to execute an already-written plan. While that one is building, a third window gets a new bug pasted in. By the time he switches back to the first window, the plan is already done, waiting quietly in the editor.
To make this parallel work mode feasible, he changed three key configurations. First, skip permission confirmations — by default Claude Code asks for “allow” on every action, he opened up all permissions in the config file so each session runs autonomously. Second, play a sound upon completion — so he can walk away and come back when he hears the chime. Third, set Zed editor to auto-save every 500 milliseconds — so when Claude Code changes a file, the editor updates instantly; conversely, when he types in the editor, Claude perceives it within a second. The whole experience feels like collaborating on Google Docs, except the collaborator is an AI.
The cost of this workflow is obvious: his MacBook dies in about an hour — six Claude sessions running in parallel drain the battery hard. He just ordered a new MacBook Pro. But think about it: one person pushing four to six tasks forward simultaneously was previously unimaginable. You used to need a whole team to parallelize different workflows. Now one person plus a set of AI sessions can do it.
/last30days: before making any decision, see what the community is talking about
Before doing /ce: plan, Matt often runs a research tool called /last30days. He open-sourced it himself — it already has 4,500 stars on GitHub. It searches Reddit, X, YouTube, TikTok, Instagram, Hacker News, Polymarket, and the web in parallel, pulling back all community discussions on a topic from the past 30 days in minutes.
He gave an example. When he was choosing between Vercel’s agent-browser and Playwright, instead of reading docs, he ran /last30days. Minutes later, results came back: 78 Reddit posts, 76 tweets, 22 YouTube videos, 15 Hacker News discussions. The data showed agent-browser uses 82%–93% fewer context tokens than Playwright — Playwright’s tool definition alone eats 13,700 tokens.
He then fed all that output into /ce: plan, and the resulting plan was based on the real-time understanding of the community, not six-month-old training data.
The value of this approach: AI models’ training data is always lagging, but tech community discussions are real-time. First, use a tool to grab the latest community consensus, then let the AI plan based on that — decision quality improves dramatically. This idea isn’t limited to tech choices; for product decisions, market research, or competitive analysis, seeing what real users are talking about is far more reliable than brainstorming in a vacuum.
Lunch chat turns into product proposal: the compounding effect of context
Matt told a particularly interesting story. He had lunch with a potential candidate, talking for an hour and a half — covering a new product idea, food, restaurants, kids, everything. He kept Granola recording the entire time.
After lunch, he pasted the full meeting transcript into Claude Code and asked it to turn the conversation into a product proposal. The key: Claude Code already knew where their product code lived on GitHub and had access to every strategic plan.md he’d written before. So when processing the lunch conversation, it didn’t just extract the product idea — it cross-referenced it against the actual codebase and all past strategic decisions.
The result came out great in one shot — goals, user stories, technical approach, milestones — automatically ignoring the parts about restaurants and sushi. He sent it to the candidate that evening. That person later joined them full-time and is now working on that product.
The most notable concept in this story is “the compounding effect of context.” Every strategic document you’ve written, every technical decision you’ve made, every piece of experience you’ve accumulated — if saved in a structured way, becomes reference material for the AI’s next decision. Over time, the more you accumulate, the more accurate the AI’s suggestions become. It’s like compound interest: early on, the difference is negligible, but as time goes on, the gap widens dramatically.
Mac Mini becomes a 24/7 AI workstation
Matt has a Mac Mini dedicated to running OpenClaw, but he also uses it for two clever things.
First, remote control via Telegram. Claude Code has Telegram integration — he just sends messages from his phone to the Mac Mini. Think of a bug during dinner? Type /ce: plan fix the timeout issue into Telegram, and by the time you get back to your computer, the plan is already waiting in the editor.
Second, using tmux on airplanes. Claude Code handles airplane WiFi poorly — a dropped connection kills the session. But if he tmuxes into the Mac Mini, the session runs on that machine, and his laptop is just a window. WiFi drops for 20 minutes? No problem — reconnect, session is still there, still working. He shipped features the entire flight from Europe back home.
A few-thousand-dollar Mac Mini, plus some configuration, becomes a 24/7 AI workstation that can be dispatched remotely anytime. The cost-effectiveness of this approach is extremely high — for anyone who needs to run AI tasks for long periods, it’s worth considering.
Cost strategy: Claude handles thinking, Codex handles grunt work
Such intensive usage naturally comes with cost implications. Four to six Opus sessions running all day will burn through the $200/month Claude Max plan quickly.
Matt’s solution is to add another $200/month Codex plan. He submitted a /ce: work --codex feature to Compound Engineering — when Claude’s quota runs out, it automatically switches to Codex for execution. The two plans complement each other: Claude handles planning and orchestration, Codex handles heavy code implementation.
Some friends use Codex to review code written by Claude Code, and vice versa. Others prefer Codex’s code output quality but use Claude Code for orchestration. This “let different AIs do what they’re best at” approach is like managing a team: you wouldn’t have the same person doing both strategic planning and hands-on execution. Same for AI — let the thinkers think, let the doers do.
He also mentioned a “good night mode” that lets agents keep working while he sleeps, but he’ll explain how later. Just the concept is exciting enough — it means your work hours go from a dozen or so per day to 24 hours.
Disney World: this workflow isn’t just for code
At the end of the article, Matt tells a vivid story completely unrelated to code. Watching his kids at a soccer game, a parent next to him talked about planning a Disney World trip. He pulled out his laptop on the spot and demonstrated.
First, run /last30days Disney World. Two minutes later, he had 66 Reddit posts, 34 tweets, and 8 YouTube videos with up-to-date info — price trends, which rides are under maintenance, which are reopening soon. Then with /ce: plan, he input his requirements: four parks in one day, rides they want to do, budget, kids’ ages. Claude’s research agents cross-referenced the data and wrote a structured guide, including park order, FastPass booking strategy, alarm reminders to set a week in advance, and even kids’ height requirements.
He also made a three-day itinerary for the other parent — 305 lines, with daily schedules and a note saying “make sure your five-year-old wears shoes when you measure their height this week.” Then he told Claude to deploy the guide as a Vercel webpage so it’s easy to view on a phone. Finally, he sent the plan to OpenClaw via Telegram, which set reminders on the calendar and set up cron jobs as a backup.
Voice → research → plan → website → automated reminders — all done from the soccer sidelines.
The significance: this workflow applies far beyond software development. Research, plan, execute, deploy, automate — this loop works for any scenario that requires processing complex information and making decisions. Travel planning is just one example; replace it with market research, event planning, or investment analysis — the logic is the same.
Matt concludes his article with his full setup:
A voice app, a plan file plugin, three configuration changes, four to six parallel sessions, a Mac Mini, and lunch meetings that turn into product proposals. No IDE, no code. Speak, plan, build. At the desk, on the couch, in the car, at the soccer field.
The greatest value of this article might not be the specific tools and configurations, but the new work paradigm it demonstrates: humans think and decide, AI researches and executes. When you polish the collaboration between the two to be smooth enough, what one person can achieve goes far beyond imagination.
And it all starts with one simple habit: when you have an idea, first write a plan.
#AI #AIAgent @grok
Similar Articles
@iluciddreaming: 1/6 Meshy AI founder Hu Yuanming (Ethan) wrote an article: how he got 10 Claude Code instances working for him simultaneously. Not a show-off, but a real working system.
Meshy AI founder Hu Yuanming (Ethan) shares his systematic approach to orchestrating 10 Claude Code instances to work together—a practical, deployable multi-agent workflow.
@yaohui12138: I've finished reading it. Here are some key takeaways I've compiled for everyone: In this session, he primarily broke down a core mechanism overlooked by 90% of users: the CLAUDE.md context injection system. This system is divided into three levels: Enterprise-level: Organization-wide mandatory rules that cannot be overridden by individual settings. Project-level: Team-shared code standards and workflows. Loc...
The article shares key insights from a workshop by Boris on using CLAUDE.md for context injection in Claude, highlighting three usage levels, specific commands like /loop, and plan mode to improve developer workflows.
@servasyy_ai: The head of Claude Code says he hasn’t written a single line of code by hand in months. He shipped 49 complete features in two days, all 100 % AI-generated. Just dropped a 30-minute walkthrough on exactly how he did it—more valuable than any $500 vibe-coding course. Save this.
Claude Code lead claims months without hand-coding, delivered 49 AI-built features in two days, and posted a 30-minute breakdown of the process.
@ma_zhenyuan: https://x.com/ma_zhenyuan/status/2057702858800370052
This article introduces Superpowers, a set of AI workflow Skills based on Claude Code, providing automated brainstorming, planning, sub-agent development, and test-driven development, which can significantly improve AI delivery efficiency.
@billtheinvestor: Anthropic founder @bcherny shares the internal team’s Claude Code workflow in a 12-minute video packed with core tips.
Anthropic founder @bcherny posted a 12-minute video detailing internal team workflows and core tips for using Claude Code.