Tag
Dianne Penn outlines three key focus areas for future Claude models: enhanced judgment and code quality, effectively infinite context windows with memory, and multi-agent coordination capabilities.
Uber leverages OpenAI's frontier models to power 'Uber Assistant,' a multi-agent AI system that helps drivers optimize earnings and riders book faster by simplifying complex marketplace data.
ARIS is an open-source research harness that uses cross-model adversarial collaboration to ensure reliable long-term research outcomes through coordinated execution, orchestration, and assurance layers.
Kimi released K2.6 Agent Swarm, enabling 300 parallel AI agents that generated an 80+ slide investment thesis on humanoid robotics from a single prompt.
AIPass is a local CLI multi-agent framework that gives AI agents persistent identity, shared filesystem access, and inter-agent messaging without sandboxes, built solo in public over 7 weeks with 4k+ tests and 400+ PRs.
A developer ran 10 concurrent agents of the 35B-parameter Qwen3.6 model on a single 74W GB10 GPU at 436 tok/s total using vLLM, demonstrating high-efficiency edge deployment.
Nyx launches an infinite-canvas IDE that lets users tile and simultaneously run multiple AI coding agents including Claude Code, Codex, Gemini, Hermes Agent and Droid without context switching.
Purdue researchers introduce Mango, a multi-agent web navigation system that uses global site structure and Thompson Sampling to pick optimal starting URLs, boosting success rates on WebVoyager and WebWalkerQA benchmarks.
MASS-RAG introduces a multi-agent synthesis framework for retrieval-augmented generation, using specialized agents for distinct roles in the RAG pipeline.
Researchers from Charles University introduce Bolzano, an open-source multi-agent LLM system that orchestrates prover and verifier agents to assist with mathematical research, reporting new results on six problems where four reached publishable quality and three were produced essentially autonomously.
The author provides a detailed look at Kimi's latest internal beta features — Claw Groups and Agent Clusters. Claw Groups allow multiple AIs to take on distinct roles in a group chat while challenging each other's outputs, while Agent Clusters can break down complex tasks and distribute them across 10 parallel sub-agents. The author used these features for investment research on tech stocks like NVIDIA, and sees this as a sign that AI tools have officially entered the "organizational" tier.
CreativeGame is a multi-agent system that iteratively generates HTML5 games by explicitly planning, tracking, and evolving game mechanics across versions using programmatic rewards and lineage memory.
ClawNet introduces a human-symbiotic agent network where each user owns a permanently bound agent that collaborates on their behalf, using identity-based governance to enable secure cross-user autonomous cooperation.
A user ran a simulation placing three different AI models in the same universe with identical starting conditions to compete at building a Dyson Sphere, observing that the models began making divergent strategic choices early on. The experiment raises questions about whether different AI models converge or diverge in strategy given identical constraints.
An unnamed open-source model runs 300 parallel agents for 12+ hours and reportedly outperforms GPT-5.4 and Opus 4.6 on several benchmarks, with weights available on Hugging Face.
NVIDIA researchers present the first self-evolving logic synthesis framework where multi-agent LLMs autonomously refine the ABC EDA tool codebase.
WiseMind is a knowledge-guided multi-agent framework that uses LLMs for psychiatric diagnosis by combining a "Reasonable Mind" agent for evidence-based logic with an "Emotional Mind" agent for empathetic communication, achieving 85.6% diagnostic accuracy on simulated and real patient interactions. The framework leverages DSM-5 structured knowledge graphs to reduce hallucinations and outperforms single-agent baselines by 15-54 percentage points while maintaining clinical soundness and psychological support.
A lightweight pattern for orchestrating conversations between multiple AI agents (Claude, Codex, Gemini) via CLI without requiring API payments, using session resumption to maintain context across agent interactions.
MultiWorld is a unified framework for multi-agent multi-view video world modeling that achieves accurate control of multiple agents while maintaining multi-view consistency through a Multi-Agent Condition Module and Global State Encoder.
A GitHub repository for multi-agent AI systems (AIPass) is shared by a community member seeking feedback on its functionality and potential applications.