rollout-slim

Tag

Cards List
#rollout-slim

Not All Transitions Matter: Evidence from PPO

arXiv cs.LG · 2026-05-26 Cached

This paper investigates the temporal correlation problem in on-policy reinforcement learning with PPO, showing that randomly dropping a fixed fraction of transitions from rollouts reduces gradient redundancy and stabilizes training without degrading performance.

0 favorites 0 likes
← Back to home

Submit Feedback