multi-agent-reinforcement-learning

#multi-agent-reinforcement-learning

Learn to Match: Two-Sided Matching with Temporally Extended Feedback

arXiv cs.LG ↗ · yesterday Cached

This paper introduces a framework for two-sided matching with temporally extended feedback, formulating it as a partially observable Markov game with costly screening, noisy observations, and evolving latent profiles. The authors present Learn2Match, a multi-agent reinforcement learning benchmark, and show that independent PPO outperforms bandit baselines in social welfare but incurs higher information-friction loss.

0 favorites 0 likes

#multi-agent-reinforcement-learning

Scalable Constrained Multi-Agent Reinforcement Learning via State Augmentation and Consensus for Separable Dynamics

arXiv cs.LG ↗ · 2026-06-01 Cached

This paper presents a distributed approach for constrained multi-agent reinforcement learning that uses state-augmented policy learning and neighbor-to-neighbor consensus over dual variables to satisfy global resource constraints while scaling linearly with the number of agents. Experiments on smart grid demand response demonstrate that consensus coordination is essential for feasibility, scaling to thousands of agents unlike centralized training approaches.

0 favorites 0 likes

#multi-agent-reinforcement-learning

Differentiable Belief-based Opponent Shaping

arXiv cs.AI ↗ · 2026-05-29 Cached

This paper introduces Differentiable Belief-based Opponent Shaping (D-BOS), a first-order method that treats observer beliefs as the shaped state and differentiates through belief update dynamics, allowing optimal strategies to emerge naturally from the environment's reward structure in hidden-role multi-agent settings.

0 favorites 0 likes

#multi-agent-reinforcement-learning

Multi-Agent Reinforcement Learning for Safe Autonomous Driving Under Pedestrian Behavioral Uncertainty

arXiv cs.LG ↗ · 2026-05-21 Cached

This paper proposes a multi-agent reinforcement learning framework that co-trains an autonomous vehicle and pedestrians with personality-driven jaywalking behavior, achieving a 30% reduction in collisions compared to single-agent approaches and demonstrating more realistic interaction scenarios.

0 favorites 0 likes

#multi-agent-reinforcement-learning

Decoupling Communication from Policy: Robust MARL under Bandwidth Constraints

Hugging Face Daily Papers ↗ · 2026-05-20 Cached

This paper introduces SLIM, a minimal architecture that decouples communication from policy representation in multi-agent reinforcement learning, achieving state-of-the-art performance under bandwidth constraints with minimal degradation.

0 favorites 0 likes

#multi-agent-reinforcement-learning

Quantum Advantage in Multi Agent Reinforcement Learning

arXiv cs.LG ↗ · 2026-05-15 Cached

This paper presents empirical evidence that quantum entanglement provides a measurable advantage in multi-agent reinforcement learning, using the CHSH game and cooperative navigation tasks to demonstrate performance improvements over classical baselines.

0 favorites 0 likes

#multi-agent-reinforcement-learning

Randomness is sometimes necessary for coordination

arXiv cs.AI ↗ · 2026-05-11 Cached

The paper introduces Diamond Attention, a method for multi-agent reinforcement learning that uses structured randomness to break symmetry and enable role differentiation among homogeneous agents, achieving perfect coordination in symmetric tasks like the XOR game.

0 favorites 0 likes

multi-agent-reinforcement-learning

Learn to Match: Two-Sided Matching with Temporally Extended Feedback

Scalable Constrained Multi-Agent Reinforcement Learning via State Augmentation and Consensus for Separable Dynamics

Differentiable Belief-based Opponent Shaping

Multi-Agent Reinforcement Learning for Safe Autonomous Driving Under Pedestrian Behavioral Uncertainty

Decoupling Communication from Policy: Robust MARL under Bandwidth Constraints

Quantum Advantage in Multi Agent Reinforcement Learning

Randomness is sometimes necessary for coordination

Submit Feedback