distributional-analysis

Tag

Cards List
#distributional-analysis

SFT, RL, and On-Policy Distillation Through a Distributional Lens (19 minute read)

TLDR AI · 2026-05-11 Cached

This article analyzes post-training methods for language models through a distributional perspective, comparing how SFT, RL, and on-policy distillation reshape model distributions and impact phenomena like catastrophic forgetting.

0 favorites 0 likes
← Back to home

Submit Feedback