rollout-slim

#rollout-slim

Not All Transitions Matter: Evidence from PPO

arXiv cs.LG ↗ · 2026-05-26 Cached

This paper investigates the temporal correlation problem in on-policy reinforcement learning with PPO, showing that randomly dropping a fixed fraction of transitions from rollouts reduces gradient redundancy and stabilizes training without degrading performance.

0 favorites 0 likes

rollout-slim

Not All Transitions Matter: Evidence from PPO

Submit Feedback