entropy

Tag

Cards List
#entropy

Making LLMs Better at Creative Writing using Entropy

Reddit r/LocalLLaMA · yesterday Cached

This article presents a technique to improve LLM creative writing by modifying the sampling process using entropy, aiming to reduce the generic 'LLM feel' in generated text.

0 favorites 0 likes
#entropy

@snowboat84: I've been pondering this for years: the relationship between statistical mechanics and AI. Statistical mechanics, using a statistical approach to molecular dynamics, reproduces the elegant fundamental theorems of thermodynamics, especially the beautiful relationships between macroscopic quantities like entropy, free energy, and of course temperature and pressure. The question is, does AI have these thermodynamic mac...

X AI KOLs Timeline · 2026-06-25 Cached

This tweet explores the relationship between statistical mechanics and artificial intelligence, citing a paper that proposes a thermodynamic theory for machine learning systems, introducing concepts like temperature, entropy, and energy, and treating the training process as a phase transition.

0 favorites 0 likes
#entropy

Beyond Entropy: Learning from Token-Level Distributional Deviations for LLM Reasoning

arXiv cs.AI · 2026-06-20 Cached

Introduces Independent Combinatorial Tokens (ICT) framework that uses Jensen-Shannon divergence between token logit distributions to identify critical branching points, preventing entropy collapse and explosion in RLVR for LLM reasoning. Achieves up to 14.9% pass@4 improvement on Qwen models.

0 favorites 0 likes
#entropy

@johnschulman2: PPO had a second wave in the LLM era for reasons unanticipated by the original paper - the importance-ratio objective f…

X AI KOLs Following · 2026-06-18 Cached

This paper reveals that the clipping mechanism in PPO and GRPO biases entropy in RLVR for LLMs: clip-low increases entropy, clip-high decreases it. The authors prove that standard clipping reduces entropy even with random rewards, and show that adjusting clip-low can prevent entropy collapse and promote exploration.

0 favorites 0 likes
#entropy

@snowboat84: Several years ago, dissipative systems and nonlinear complex systems were extremely popular in academic and cultural circles. To fully review dissipative systems, one must start with non-dissipative thermodynamics. The second law of thermodynamics (entropy law) states that everything should move towards chaos and stillness. But life grows, forests succeed, and even the large models in data centers are constantly "learning" order. …

X AI KOLs Timeline · 2026-06-18 Cached

This is a popular science article of over 25,000 characters, starting from the origin of entropy, reviewing the development of dissipative system theory, and exploring a three-level analysis of whether AI belongs to dissipative systems (hardware level, training level, static model).

0 favorites 0 likes
#entropy

Integrating Local and Global Entropy for Uncertainty Quantification in LLMs

arXiv cs.LG · 2026-06-10 Cached

This paper proposes Global-Local Uncertainty (GLU), an unsupervised single-pass score that fuses token-level local entropy with hidden-state geometric global entropy for uncertainty quantification in LLMs, showing that the two are near-orthogonal and together capture confident-but-wrong failures.

0 favorites 0 likes
#entropy

Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling

Hugging Face Daily Papers · 2026-06-10 Cached

Bebop proposes entropy-aware multi-token prediction with rejection sampling and a novel TV loss to accelerate RL training of LLMs, achieving up to 1.8x speedup. The method addresses the degradation of acceptance rates during RL by optimizing training objectives.

0 favorites 0 likes
#entropy

When Does Multi-Agent Collaboration Help? An Entropy Perspective

arXiv cs.AI · 2026-06-08 Cached

This paper examines multi-agent systems (MAS) from an entropy perspective, analyzing intra- and inter-agent dynamics. It finds that single agents often outperform MAS and introduces the Entropy Judger algorithm to improve MAS performance.

0 favorites 0 likes
#entropy

Entropy

Lobsters Hottest · 2026-06-07 Cached

A technical blog post exploring randomness, Linux entropy, and building a tool called morerandom that uses WASM plugins to feed the system entropy pool.

0 favorites 0 likes
#entropy

Entropy as a Structural Prior: How a Log-Barrier on DiT Belief Space Drives Musical Diversity and Development

Hugging Face Daily Papers · 2026-06-05 Cached

This paper introduces the Eisbach log-barrier, a parameter-free weight derived from the entropy of DiT output's spatial energy distribution, which when applied to LoRA fine-tuning of Stable Audio 3 improves musical diversity and thematic development without causing mode collapse.

0 favorites 0 likes
#entropy

Fine-Tuning Improves Information Conveyance in Language Models

arXiv cs.CL · 2026-06-01 Cached

This paper introduces Canopy Entropy (CE⋆) to measure the effective size of the generation space in language models, and finds that fine-tuning reorganizes uncertainty into more informative and semantically meaningful outputs, nearly tripling the correlation between entropy rate and semantic diversity.

0 favorites 0 likes
#entropy

Entropy-KL Divergence-based Token Masking: A Novel Approach for Selective Fine-tuning of Large Language Models

arXiv cs.AI · 2026-05-29 Cached

Proposes EKSFT, a selective fine-tuning method for large language models that masks tokens with high entropy or high KL divergence from a reference model, preserving pre-trained distribution while injecting task knowledge. Experiments on mathematical reasoning benchmarks show it outperforms standard SFT and improves subsequent RL fine-tuning.

0 favorites 0 likes
#entropy

Unified Data Selection for LLM Reasoning

arXiv cs.CL · 2026-05-22 Cached

The paper proposes High-Entropy Sum (HES), a training-free metric for selecting high-quality reasoning data for LLM training, validated across SFT, RFT, and RL paradigms.

0 favorites 0 likes
#entropy

Hallucination as Commitment Failure: Larger LLMs Misfire Despite Knowing the Answer

arXiv cs.CL · 2026-05-22 Cached

This paper investigates the phenomenon where large language models hallucinate despite having the correct answer available in their generation-time distribution. By introducing a semantic notion of answer availability, the authors show that 16-47% of instruction-tuned model hallucinations occur when the correct concept is already represented, and that this rate increases with scale. They identify that instruction tuning sharpens answer commitment, making helpfulness and confident hallucination two sides of the same coin.

0 favorites 0 likes
#entropy

Probabilistic Attribution For Large Language Models

arXiv cs.CL · 2026-05-22 Cached

This paper proposes a model-agnostic probabilistic token attribution measure for LLMs using Bayes' rule to invert next-token log probabilities, capturing the model's internal representation of token sequences and improving interpretability through entropy analysis.

0 favorites 0 likes
#entropy

DEL: Digit Entropy Loss for Numerical Learning of Large Language Models

arXiv cs.CL · 2026-05-21 Cached

This paper introduces Digit Entropy Loss (DEL), a novel loss function for numerical learning in large language models that reformulates entropy optimization to improve digit-level prediction accuracy and handle floating-point numbers, consistently outperforming existing methods on mathematical reasoning benchmarks.

0 favorites 0 likes
#entropy

Dimensional Balance Improves Large Scale Spatiotemporal Prediction Performance

arXiv cs.LG · 2026-05-20 Cached

This paper proposes a framework that uses entropy-based diagnostics to harmonize spatial and temporal feature representations, achieving substantial accuracy gains on large-scale spatiotemporal prediction tasks across urban traffic, meteorology, and epidemic datasets.

0 favorites 0 likes
#entropy

The Futility of Lava Lamps: What Random Really Means

Lobsters Hottest · 2026-05-16

An article exploring the philosophical and practical meaning of randomness, using lava lamps as a metaphor for entropy generation in computing.

0 favorites 0 likes
#entropy

production agents don't break because they're dumb. they break because nobody manages the entropy

Reddit r/AI_Agents · 2026-05-16

A reflection on how AI agents fail in production due to accumulated state issues (stale context, expired tokens, conflicting memory) rather than reasoning flaws, emphasizing the need for better state management.

0 favorites 0 likes
#entropy

What is random generation?

Lobsters Hottest · 2026-05-11 Cached

An exploration of pseudo-random number generation in computers, focusing on linear congruential generators (LCGs) and their quality visualization. The article also touches on entropy sources like Cloudflare's lava lamps and serves as a precursor to property-based testing.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback