Diffusion for generating/editing ASTs? [D]
Summary
A user proposes using diffusion models to generate or edit Abstract Syntax Trees (ASTs) to ensure syntactic correctness in code generation, contrasting this with the token-based limitations of current LLMs.
Similar Articles
Diffusion Language Models: An Experimental Analysis
A systematic experimental analysis evaluating eight state-of-the-art Diffusion Language Models across multiple benchmarks, analyzing trade-offs between generation quality and computational efficiency.
Bulding my own Diffusion Language Model from scratch was easier than I thought [P]
Developer shares a minimalist 7.5M-parameter diffusion language model trained from scratch on Shakespeare, releasing the code as a learning resource.
Language Generation as Optimal Control: Closed-Loop Diffusion in Latent Control Space
This paper reformulates language generation as a stochastic optimal control problem, addressing limitations of autoregressive and diffusion models, and proposes a closed-loop diffusion method in latent control space using Flow Matching, achieving high-fidelity generation and efficient parallel sampling.
EPIC: Efficient and Parallel Inference under CFG Constraints for Diffusion Language Models
This paper presents EPIC, an efficient framework for context-free grammar constrained decoding in diffusion language models that reduces inference time by up to 67.5% while maintaining syntactic correctness.
Self-Generated Error Training for Token Editing in Diffusion Language Models
Proposes Self-Generated T2T, a training method that aligns token editing training with inference by using the model's own predictions as error sources, improving accuracy on LLaDA2.1.