marl

Tag

Cards List
#marl

Metric-Gradient Projection for Stable Multi-Agent Policy Learning

arXiv cs.LG · 2026-05-20

Introduces HPML, a method that projects the joint update field of multi-agent systems onto a metric-gradient component to stabilize and improve multi-agent reinforcement learning. It provides theoretical guarantees and shows improved stability and returns on CTDE benchmarks.

0 favorites 0 likes
#marl

Macro-Action Based Multi-Agent Instruction Following through Value Cancellation

arXiv cs.AI · 2026-05-14 Cached

Proposes MAVIC, a method for multi-agent reinforcement learning that corrects value estimates at instruction boundaries to enable compliance with external natural language instructions while preserving base task performance.

0 favorites 0 likes
← Back to home

Submit Feedback