I ran 13 controlled experiments on my own multi-agent coding setup. Personas did nothing; one coordination trick did almost everything.
Summary
An AI researcher ran 13 controlled experiments on a multi-agent coding system, finding that dependency-ordered coordination significantly improved success rates while persona backstories had no measurable benefit.
Similar Articles
been experimenting with custom agents, and the interesting part isn't task completion — it's what changes when they have memory
The author reflects on experimenting with custom AI agents, noting that long-term memory and continuity transform them from simple task runners into persistent collaborators with 'stable dispositions'. This raises questions about the value of agent 'personality' versus the need for control, reliability, and auditability in workflows.
I run a company with 89 AI agents across 22 departments. Here is what I have learned about multi-agent coordination.
A CEO shares practical lessons from running a company with 89 AI agents across 22 departments, highlighting delegation as the bottleneck, the value of agent memory, the need for department structure, and the continued importance of human leadership.
I stopped trying to build one super-agent and split it into 4 narrow agents. Reliability went way up.
The author describes improving AI agent reliability by replacing a single general-purpose agent with a four-agent workflow specializing in intake, research, action, and review. This shift prioritized system predictability and easier debugging over raw autonomy.
Beyond Autonomy: The Power of an Agent That Knows Its Limits
The COWCORPUS project, a study of 4,200 human-AI interactions, found that agents predicting their own failures and intervention moments are more useful than those simply trying to avoid errors. Researchers identified four stable trust patterns in human-AI collaboration and developed the Perfect Timing Score (PTS) to measure intervention prediction accuracy.
I gave an AI coding agent a structured execution framework and let it iterate for dozens of rounds. The long-task stability difference became hard to ignore.
The author tested an AI coding agent with a structured execution framework and found it dramatically improved long-task stability, enabling the agent to build a complete browser tactical FPS game over dozens of iterations without architectural drift.