metric-gradient

#metric-gradient

Metric-Gradient Projection for Stable Multi-Agent Policy Learning

arXiv cs.LG ↗ · 2026-05-20

Introduces HPML, a method that projects the joint update field of multi-agent systems onto a metric-gradient component to stabilize and improve multi-agent reinforcement learning. It provides theoretical guarantees and shows improved stability and returns on CTDE benchmarks.

0 favorites 0 likes

metric-gradient

Metric-Gradient Projection for Stable Multi-Agent Policy Learning

Submit Feedback