entropy

Tag

Cards List
#entropy

Integrating Local and Global Entropy for Uncertainty Quantification in LLMs

arXiv cs.LG · 3d ago Cached

This paper proposes Global-Local Uncertainty (GLU), an unsupervised single-pass score that fuses token-level local entropy with hidden-state geometric global entropy for uncertainty quantification in LLMs, showing that the two are near-orthogonal and together capture confident-but-wrong failures.

0 favorites 0 likes
#entropy

Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling

Hugging Face Daily Papers · 3d ago Cached

Bebop proposes entropy-aware multi-token prediction with rejection sampling and a novel TV loss to accelerate RL training of LLMs, achieving up to 1.8x speedup. The method addresses the degradation of acceptance rates during RL by optimizing training objectives.

0 favorites 0 likes
#entropy

When Does Multi-Agent Collaboration Help? An Entropy Perspective

arXiv cs.AI · 5d ago Cached

This paper examines multi-agent systems (MAS) from an entropy perspective, analyzing intra- and inter-agent dynamics. It finds that single agents often outperform MAS and introduces the Entropy Judger algorithm to improve MAS performance.

0 favorites 0 likes
#entropy

Entropy

Lobsters Hottest · 6d ago Cached

A technical blog post exploring randomness, Linux entropy, and building a tool called morerandom that uses WASM plugins to feed the system entropy pool.

0 favorites 0 likes
#entropy

Entropy as a Structural Prior: How a Log-Barrier on DiT Belief Space Drives Musical Diversity and Development

Hugging Face Daily Papers · 2026-06-05 Cached

This paper introduces the Eisbach log-barrier, a parameter-free weight derived from the entropy of DiT output's spatial energy distribution, which when applied to LoRA fine-tuning of Stable Audio 3 improves musical diversity and thematic development without causing mode collapse.

0 favorites 0 likes
#entropy

Fine-Tuning Improves Information Conveyance in Language Models

arXiv cs.CL · 2026-06-01 Cached

This paper introduces Canopy Entropy (CE⋆) to measure the effective size of the generation space in language models, and finds that fine-tuning reorganizes uncertainty into more informative and semantically meaningful outputs, nearly tripling the correlation between entropy rate and semantic diversity.

0 favorites 0 likes
#entropy

Entropy-KL Divergence-based Token Masking: A Novel Approach for Selective Fine-tuning of Large Language Models

arXiv cs.AI · 2026-05-29 Cached

Proposes EKSFT, a selective fine-tuning method for large language models that masks tokens with high entropy or high KL divergence from a reference model, preserving pre-trained distribution while injecting task knowledge. Experiments on mathematical reasoning benchmarks show it outperforms standard SFT and improves subsequent RL fine-tuning.

0 favorites 0 likes
#entropy

Unified Data Selection for LLM Reasoning

arXiv cs.CL · 2026-05-22 Cached

The paper proposes High-Entropy Sum (HES), a training-free metric for selecting high-quality reasoning data for LLM training, validated across SFT, RFT, and RL paradigms.

0 favorites 0 likes
#entropy

Hallucination as Commitment Failure: Larger LLMs Misfire Despite Knowing the Answer

arXiv cs.CL · 2026-05-22 Cached

This paper investigates the phenomenon where large language models hallucinate despite having the correct answer available in their generation-time distribution. By introducing a semantic notion of answer availability, the authors show that 16-47% of instruction-tuned model hallucinations occur when the correct concept is already represented, and that this rate increases with scale. They identify that instruction tuning sharpens answer commitment, making helpfulness and confident hallucination two sides of the same coin.

0 favorites 0 likes
#entropy

Probabilistic Attribution For Large Language Models

arXiv cs.CL · 2026-05-22 Cached

This paper proposes a model-agnostic probabilistic token attribution measure for LLMs using Bayes' rule to invert next-token log probabilities, capturing the model's internal representation of token sequences and improving interpretability through entropy analysis.

0 favorites 0 likes
#entropy

DEL: Digit Entropy Loss for Numerical Learning of Large Language Models

arXiv cs.CL · 2026-05-21 Cached

This paper introduces Digit Entropy Loss (DEL), a novel loss function for numerical learning in large language models that reformulates entropy optimization to improve digit-level prediction accuracy and handle floating-point numbers, consistently outperforming existing methods on mathematical reasoning benchmarks.

0 favorites 0 likes
#entropy

Dimensional Balance Improves Large Scale Spatiotemporal Prediction Performance

arXiv cs.LG · 2026-05-20 Cached

This paper proposes a framework that uses entropy-based diagnostics to harmonize spatial and temporal feature representations, achieving substantial accuracy gains on large-scale spatiotemporal prediction tasks across urban traffic, meteorology, and epidemic datasets.

0 favorites 0 likes
#entropy

The Futility of Lava Lamps: What Random Really Means

Lobsters Hottest · 2026-05-16

An article exploring the philosophical and practical meaning of randomness, using lava lamps as a metaphor for entropy generation in computing.

0 favorites 0 likes
#entropy

production agents don't break because they're dumb. they break because nobody manages the entropy

Reddit r/AI_Agents · 2026-05-16

A reflection on how AI agents fail in production due to accumulated state issues (stale context, expired tokens, conflicting memory) rather than reasoning flaws, emphasizing the need for better state management.

0 favorites 0 likes
#entropy

What is random generation?

Lobsters Hottest · 2026-05-11 Cached

An exploration of pseudo-random number generation in computers, focusing on linear congruential generators (LCGs) and their quality visualization. The article also touches on entropy sources like Cloudflare's lava lamps and serves as a precursor to property-based testing.

0 favorites 0 likes
#entropy

KV Cache Compression 900000x Beyond TurboQuant and Per-Vector Shannon Limit

Hacker News Top · 2026-04-21 Cached

A new paper proposes sequential KV cache compression using probabilistic language tries and predictive delta coding, achieving theoretical compression ratios of ~914,000× beyond TurboQuant by exploiting the sequential structure of language model tokens rather than treating vectors independently.

0 favorites 0 likes
#entropy

Measuring AI Freedom

ML at Berkeley · 2022-10-09

The article explores the concept of 'Value Freedom' in AI agents using Reinforcement Learning, framing it as a measure of unpredictability and entropy derived from Q-values.

0 favorites 0 likes
← Back to home

Submit Feedback