multi-agent-reasoning

#multi-agent-reasoning

Mixture of Debaters: Learn to Debate at Architectural Level in Multi-Agent Reasoning

arXiv cs.AI ↗ · yesterday Cached

Proposes Mixture of Debaters (MoD), a framework using Mixture-of-Experts to enable dynamic self-debate within a single LLM, achieving superior accuracy with drastically lower latency and token consumption.

0 favorites 0 likes

#multi-agent-reasoning

From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

arXiv cs.CL ↗ · 2026-06-17 Cached

This paper proposes the LLM-as-Environment-Engineer framework, where a policy model analyzes failures to automatically redesign the training environment for reinforcement learning, and introduces MAPF-FrozenLake as a controllable testbed. The framework, using Qwen3-4B, outperforms larger models like GPT and Gemini, showing that policy learning improves the model's ability to diagnose weaknesses.

0 favorites 0 likes

multi-agent-reasoning

Mixture of Debaters: Learn to Debate at Architectural Level in Multi-Agent Reasoning

From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

Submit Feedback