off-policy-rl

Tag

Cards List
#off-policy-rl

@svlevine: A new way to do off-policy RL with diffusion: if we have off-policy data, we need to figure out what the diffusion late…

X AI KOLs Following · 3d ago Cached

A new method for off-policy reinforcement learning with diffusion models, using flow reversal to handle off-policy data by reversing the diffusion process on it.

0 favorites 0 likes
← Back to home

Submit Feedback