minimax-game

#minimax-game

Theoretical Foundations and Effective Algorithms for Policy-Aware Simulator Learning

arXiv cs.LG ↗ · 2026-05-29 Cached

This paper proposes a strategic robustness objective for learning simulators in model-based reinforcement learning, formulated as a minimax game between a model player and an adversarial policy player. Theoretical guarantees and a provably convergent algorithm are provided, with experiments showing reduced prediction error and improved real-world policy transfer.

0 favorites 0 likes

#minimax-game

The Distillation Game: Adaptive Attacks & Efficient Defenses

Hugging Face Daily Papers ↗ · 2026-05-29 Cached

This paper studies distillation attacks where model outputs can enable imitation, proposing a minimax game framework and a forward-pass-only defense called Product-of-Experts, showing that adaptive students recover more capability than passive evaluation suggests.

0 favorites 0 likes

minimax-game

Theoretical Foundations and Effective Algorithms for Policy-Aware Simulator Learning

The Distillation Game: Adaptive Attacks & Efficient Defenses

Submit Feedback