opponent-shaping

Tag

Cards List
#opponent-shaping

Differentiable Belief-based Opponent Shaping

arXiv cs.AI · 2026-05-29 Cached

This paper introduces Differentiable Belief-based Opponent Shaping (D-BOS), a first-order method that treats observer beliefs as the shaped state and differentiates through belief update dynamics, allowing optimal strategies to emerge naturally from the environment's reward structure in hidden-role multi-agent settings.

0 favorites 0 likes
← Back to home

Submit Feedback