flow-reversal

#flow-reversal

@svlevine: A new way to do off-policy RL with diffusion: if we have off-policy data, we need to figure out what the diffusion late…

X AI KOLs Following ↗ · 3d ago Cached

A new method for off-policy reinforcement learning with diffusion models, using flow reversal to handle off-policy data by reversing the diffusion process on it.

0 favorites 0 likes

#flow-reversal

@aditya_oberai: What if we treat flow steps as RL actions? Combined with our “flow reversal” technique, this leads to a really clean & …

X AI KOLs Timeline ↗ · 3d ago Cached

Proposes treating flow steps as RL actions combined with a 'flow reversal' technique for flow offline reinforcement learning.

0 favorites 0 likes

flow-reversal

@svlevine: A new way to do off-policy RL with diffusion: if we have off-policy data, we need to figure out what the diffusion late…

@aditya_oberai: What if we treat flow steps as RL actions? Combined with our “flow reversal” technique, this leads to a really clean & …

Submit Feedback