Tag
The article discusses the challenge of managing tool access in multi-agent systems, where parallel execution can cause race conditions and coordination issues, leading to inconsistent results.
This paper proposes a multi-agent computer use (MACU) system that uses a manager model to decompose tasks into directed acyclic graphs for parallel execution by subagents. It demonstrates consistent improvements over single-agent baselines on multiple benchmarks and better test-time scaling.
Introducing pi-dynamic-workflows, a tool that enables LLMs to dynamically orchestrate multiple sub-agents for parallel task execution by generating JavaScript scripts, suitable for code audits, large-scale refactoring, and similar scenarios.
Anthropic launched Claude Opus 4.8 with Dynamic Workflows, enabling parallel task execution, and improved judgment that flags uncertainty, moving from a tool to a collaborator.
Aigon is an open-source tool that runs multiple AI coding agents in parallel on the same feature specified in a markdown spec and uses an LLM judge to select the best implementation, with a visual dashboard and optional scheduling.
MobileGym is a browser-based simulation platform for mobile GUI agent research, featuring deterministic state evaluation and scalable parallel execution. It includes a benchmark of 416 tasks and demonstrates gains using GRPO on Qwen3-VL-4B.
multi-cli is a tool that enables running multiple AI coding agents (Claude Code, Codex, Gemini CLI, Cursor) simultaneously with isolated profiles, allowing parallel agentic coding workflows.
Superset is an open-source IDE for orchestrating multiple CLI-based AI coding agents in parallel, with isolated git worktrees, built-in monitoring, and a diff viewer. It supports various agents like Claude Code, Codex CLI, and Gemini CLI.