diffusion-language-model

Tag

Cards List
#diffusion-language-model

I built a diffusion language model from scratch. It writes flawless sentences that mean nothing, and that is the interesting part.

Reddit r/AI_Agents · 2026-06-08

The author built Joey, a 170M parameter masked diffusion language model from scratch, trained on FineWeb-Edu and fine-tuned on DailyDialog, achieving fluent but incoherent sentences due to capacity limitations. The project highlights the differences from autoregressive LLMs and the lessons learned from building and debugging the system.

0 favorites 0 likes
#diffusion-language-model

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

Hugging Face Blog · 2026-05-23 Cached

NVIDIA introduces Nemotron-Labs Diffusion, a family of diffusion language models that generate text in parallel and iteratively refine it, offering faster generation and the ability to revise previous tokens.

0 favorites 0 likes
#diffusion-language-model

@probablynotaz9: Solo-author ICML paper alert Ever wanted to post-train your diffusion LLM with good old policy gradients, without havin…

X AI KOLs Following · 2026-05-09 Cached

This solo-author ICML paper introduces Amortized Group Relative Policy Optimization (AGRPO) to enable effective reinforcement learning post-training for diffusion language models.

0 favorites 0 likes
#diffusion-language-model

Chainwash: Multi-Step Rewriting Attacks on Diffusion Language Model Watermarks

arXiv cs.CL · 2026-05-08 Cached

This research paper introduces Chainwash, a multi-step rewriting attack that effectively removes statistical watermarks from diffusion language model (LLaDA-8B-Instruct) outputs, reducing detection rates from 87.9% to 4.86% after five chained rewrites.

0 favorites 0 likes
← Back to home

Submit Feedback