Tag
This paper introduces Quantum Frog, a two-player cooperative game with a quantized-time mechanic, and uses reinforcement learning to analyze difficulty scaling, optimal strategies, and emergent cooperation between agents.
Introduces LPDS, a framework to systematically evaluate LLM robustness by scaling difficulty of logic-preserving variations, finding that performance drops up to 5x compared to random sampling and that training on harder variations improves robustness.