FlowLM: Few-Step Language Modeling via Diffusion-to-Flow Adaptation

arXiv cs.CL 05/21/26, 04:00 AM Papers

Summary

FlowLM introduces a flow matching language model derived from pre-trained diffusion models via efficient fine-tuning, enabling high-quality few-step text generation that rivals 2,000-step diffusion sampling with far fewer training epochs.

arXiv:2605.20199v1 Announce Type: new Abstract: We present FlowLM, a flow matching language model transformed from pre-trained diffusion language models via efficient fine-tuning. By re-aligning the curved sampling trajectories of diffusion models into straight-line flows, FlowLM enables high quality few-step generation that rivals or even outperforms the quality of 2,000-step diffusion sampling with very few training epochs. Remarkably, finetuned FlowLM reaches performance saturation with only half as many training epochs as training from scratch, both approaches greatly outperforming the original diffusion model, thereby validating our method. Furthermore, we validate a more effective training objective for flow matching: predicting clean data to consistently guide the sampling process towards the true data distribution. Empirical results demonstrate that our approach is highly effective for high-quality, few-step text generation.

Original Article

View Cached Full Text

Cached at: 05/21/26, 06:31 AM

# FlowLM: Few-Step Language Modeling via Diffusion-to-Flow Adaptation
Source: [https://arxiv.org/abs/2605.20199](https://arxiv.org/abs/2605.20199)
[View PDF](https://arxiv.org/pdf/2605.20199)

> Abstract:We present FlowLM, a flow matching language model transformed from pre\-trained diffusion language models via efficient fine\-tuning\. By re\-aligning the curved sampling trajectories of diffusion models into straight\-line flows, FlowLM enables high quality few\-step generation that rivals or even outperforms the quality of 2,000\-step diffusion sampling with very few training epochs\. Remarkably, finetuned FlowLM reaches performance saturation with only half as many training epochs as training from scratch, both approaches greatly outperforming the original diffusion model, thereby validating our method\. Furthermore, we validate a more effective training objective for flow matching: predicting clean data to consistently guide the sampling process towards the true data distribution\. Empirical results demonstrate that our approach is highly effective for high\-quality, few\-step text generation\.

## Submission history

From: Runzhe Zhang \[[view email](https://arxiv.org/show-email/b36be57c/2605.20199)\] **\[v1\]**Mon, 6 Apr 2026 10:36:22 UTC \(3,537 KB\)

FlowLM: Few-Step Language Modeling via Diffusion-to-Flow Adaptation

Similar Articles

LangFlow: Continuous Diffusion Rivals Discrete in Language Modeling

Language Generation as Optimal Control: Closed-Loop Diffusion in Latent Control Space

Language Modeling with Hyperspherical Flows

TextLDM: Language Modeling with Continuous Latent Diffusion

Continuous Latent Diffusion Language Model

Submit Feedback

Similar Articles

LangFlow: Continuous Diffusion Rivals Discrete in Language Modeling

Language Generation as Optimal Control: Closed-Loop Diffusion in Latent Control Space

Language Modeling with Hyperspherical Flows

TextLDM: Language Modeling with Continuous Latent Diffusion

Continuous Latent Diffusion Language Model