self-teaching

Tag

Cards List
#self-teaching

@SOURADIPCHAKR18: We describe early experiments on *pedagogical RL*: A bitter-lesson-pilled paradigm of *training* privileged self-teache…

X AI KOLs Following · yesterday Cached

Introduces pedagogical RL, a paradigm where privileged self-teachers are trained to generate correct and easy-to-follow rollouts, showing it is a relatively easy RL problem.

0 favorites 0 likes
← Back to home

Submit Feedback