sampling

#sampling

A Mathematical Introduction to Diffusion Models

arXiv cs.LG ↗ · 12h ago Cached

This paper provides a proof-oriented introduction to diffusion models, covering Langevin dynamics, score-based models, discretization, discrete diffusion, and inference-time control, intended for graduate students.

0 favorites 0 likes

#sampling

Making LLMs Better at Creative Writing using Entropy

Reddit r/LocalLLaMA ↗ · yesterday Cached

This article presents a technique to improve LLM creative writing by modifying the sampling process using entropy, aiming to reduce the generic 'LLM feel' in generated text.

0 favorites 0 likes

#sampling

New sampler + verifier drastically improves tiny 0.5b model coding performance

Reddit r/LocalLLaMA ↗ · 2026-06-25 Cached

The paper introduces VGB, a process-guided sampling algorithm with probabilistic backtracking, which significantly improves coding performance on tiny 0.5B models by being robust to verifier errors.

0 favorites 0 likes

#sampling

A Time-Reparameterized Cumulative Intensity Extrapolation Sampler for Discrete Flow Matching

arXiv cs.LG ↗ · 2026-06-24 Cached

This paper proposes the Time-Reparameterized Cumulative Intensity Extrapolation (TR-CIE) sampler for discrete flow matching, which improves sampling quality under limited function evaluations by rescaling the time grid and reusing cached model outputs, with theoretical analysis and experiments on text and image generation.

0 favorites 0 likes

#sampling

Forget Without Compromise: Nexus Sampling for Streaming KV-Cache Eviction Under Fixed Budgets

arXiv cs.LG ↗ · 2026-06-24 Cached

Introduces Nexus Sampling, a training-free KV-cache eviction method using weighted reservoir sampling instead of deterministic top-k, improving long-context LLM inference under fixed memory budgets, matching dense attention performance at 80% eviction.

0 favorites 0 likes

#sampling

@mylifcc: A treasure-level LLM sampling guide is here — Title: "Dummy's Guide to Modern LLM Sampling" — This article thoroughly explains the sampler mechanisms in modern LLM text generation: from Temperature, Top-P/Mi…

X AI KOLs Timeline ↗ · 2026-06-21 Cached

Recommended a deep guide on modern LLM sampling mechanisms, covering methods such as Temperature, Top-P, Mirostat, etc., of significant reference value for developers aiming to improve output quality.

0 favorites 0 likes

#sampling

Emergent retokenization symmetry in large language models: phenomenology and applications

arXiv cs.CL ↗ · 2026-06-16 Cached

This paper discovers that large language models partially exhibit emergent symmetry under retokenization—replacing a prompt's canonical tokenization with an alternative valid segmentation while preserving bytes exactly. The authors use this phenomenon to probe compositional understanding and propose retokenization as a novel inference-time sampling strategy that can recover solutions not found by conventional temperature sampling.

0 favorites 0 likes

#sampling

Attention-Discounted Adaptive Sampler for Masked Diffusion Language Models

arXiv cs.CL ↗ · 2026-06-10 Cached

This paper introduces ADAS, a training-free reranking rule for parallel masked diffusion decoding that uses attention to discount tokens that strongly attend to uncertain positions, improving low-NFE performance on reasoning and code tasks with minimal runtime overhead.

0 favorites 0 likes

#sampling

@lateinteraction: very cool work !!

X AI KOLs Timeline ↗ · 2026-05-29 Cached

Guowei Xu discusses limitations of Best-of-N and tree search methods for LLMs on hard reasoning problems, noting sparse verification signals and that candidates remain within the model's distribution.

0 favorites 0 likes

#sampling

Hierarchical Variational Policies for Reward-Guided Diffusion

arXiv cs.LG ↗ · 2026-05-22 Cached

Proposes a hierarchical variational policy framework for reward-guided diffusion, enabling high-quality sampling with reduced inference cost. Achieves strong quality-speed tradeoff on tasks like super-resolution.

0 favorites 0 likes

#sampling

Lossless Anti-Distillation Sampling

arXiv cs.LG ↗ · 2026-05-20

This paper proposes Lossless Anti-Distillation Sampling (LADS), a novel sampling scheme that counters multi-account distillation by correlating responses across accounts while preserving exact statistical fidelity for individual benign users. Theoretical analysis and experiments show LADS degrades distilled student performance on image, math, and code generation.

0 favorites 0 likes

#sampling

Drifting Objectives for Refining Discrete Diffusion Language Models

arXiv cs.CL ↗ · 2026-05-20 Cached

This paper introduces TokenDrift, a drifting objective that refines discrete diffusion language models by lifting categorical predictions to a continuous semantic space for anti-symmetric drifting, significantly improving generation quality under a fixed number of denoising steps.

0 favorites 0 likes

#sampling

Don't Stop Me Yet: Sampling Loss Minima via Dissipative Riemannian Mechanics

arXiv cs.LG ↗ · 2026-05-18 Cached

This paper introduces DiMS, a dynamical system sampler that guarantees exact sampling from the submanifold of minimum loss solutions in neural networks, enabling better uncertainty quantification in Bayesian inference.

0 favorites 0 likes

#sampling

Synthesizing POMDP Policies: Sampling Meets Model-checking via Learning

arXiv cs.AI ↗ · 2026-05-15 Cached

This paper presents a novel framework for synthesizing finite-state controllers for Partially Observable Markov Decision Processes (POMDPs) by integrating sampling, automata learning, and model-checking. The approach provides formal guarantees for threshold-safety problems that elude existing formal synthesis tools.

0 favorites 0 likes

#sampling

Sampling More, Getting Less: Calibration is the Diversity Bottleneck in LLMs

arXiv cs.CL ↗ · 2026-05-13 Cached

This paper introduces a validity-diversity framework attributing diversity collapse in LLMs to order and shape miscalibration during decoding, validated across 14 language models.

0 favorites 0 likes

#sampling

Optimizing Tail Sampling in OpenTelemetry with Retroactive Sampling

Hacker News Top ↗ · 2026-04-18 Cached

VictoriaMetrics presented retroactive sampling at KubeCon EU 2026, a new method that significantly reduces traffic, CPU, and memory overhead compared to traditional tail sampling in OpenTelemetry pipelines.

0 favorites 0 likes

sampling

Submit Feedback