offline-alignment

Tag

Cards List
#offline-alignment

Offline Preference Optimization for Rectified Flow with Noise-Tracked Pairs

Hugging Face Daily Papers · 2026-05-10 Cached

This paper introduces PNAPO, an offline preference optimization framework for rectified flow models that augments preference data with noise samples and uses dynamic regularization to improve training efficiency and sample efficiency.

0 favorites 0 likes
← Back to home

Submit Feedback