off-policy-rl

#off-policy-rl

@svlevine: A new way to do off-policy RL with diffusion: if we have off-policy data, we need to figure out what the diffusion late…

X AI KOLs Following ↗ · 3d ago Cached

A new method for off-policy reinforcement learning with diffusion models, using flow reversal to handle off-policy data by reversing the diffusion process on it.

0 favorites 0 likes

off-policy-rl

@svlevine: A new way to do off-policy RL with diffusion: if we have off-policy data, we need to figure out what the diffusion late…

Submit Feedback