information-theory

#information-theory

Information-Theoretic Classifier-Free Guidance with Adaptive Schedule Optimization

arXiv cs.LG ↗ · yesterday Cached

Proposes an information-theoretic framework for optimizing classifier-free guidance schedules in diffusion models, achieving improved trade-offs between condition consistency and sample diversity on ImageNet and COCO benchmarks.

0 favorites 0 likes

#information-theory

@rohanpaul_ai: This paper argues that intelligence is the ability to make rare but valid futures more likely. So an intelligent system…

X AI KOLs Following ↗ · yesterday Cached

This paper proposes a thermodynamic measure of intelligence, defining intelligence as the ability to make rare but valid futures more likely. It introduces a metric called 'rare-valid lift' that quantifies how much more often a system produces unlikely but acceptable outcomes compared to a passive baseline.

0 favorites 0 likes

#information-theory

Researchers used math to crack Wordle

Hacker News Top ↗ · 4d ago Cached

Researchers at Binghamton University used Shannon entropy to develop a mathematical method that solves Wordle puzzles with a 99% success rate, prioritizing informative guesses over likely answers.

0 favorites 0 likes

#information-theory

Data Compression Explained (2012)

Hacker News Top ↗ · 2026-06-16 Cached

A comprehensive book explaining data compression techniques including information theory, coding methods, modeling, and transforms, targeting programmers with math skills.

0 favorites 0 likes

#information-theory

Biological evolution and information acquisition

Hacker News Top ↗ · 2026-06-11 Cached

This article draws parallels between biological evolution and technological evolution, explaining how modularity and sexual reproduction allow populations to increase the rate of information acquisition. Simulations demonstrate that mixing genetic material accelerates the spread of beneficial mutations, analogous to how technologies build on existing components.

0 favorites 0 likes

#information-theory

Information-Theoretic Decomposition for Multimodal Interaction Learning

arXiv cs.LG ↗ · 2026-06-11 Cached

This paper presents an information-theoretic analysis of multimodal learning, revealing the need to capture sample-specific interactions, and proposes DMIL, a paradigm that explicitly models and learns from these interactions via variational decomposition and fine-tuning, achieving superior performance.

0 favorites 0 likes

#information-theory

A Geometric Profile of Semantic Information in Text: Frame-Conditional Uniqueness and a Trade-Off Triangle for Scalar Summaries

arXiv cs.CL ↗ · 2026-06-11 Cached

This paper develops a geometric framework to measure semantic content of texts using sentence embeddings, proposing a three-coordinate semantic profile (novelty, breadth, integration) and a scalar trade-off triangle, validated across synthetic categories and novels.

0 favorites 0 likes

#information-theory

Principles and Practice of Deep Representation Learning: or a Mathematical Theory of Memory

arXiv cs.LG ↗ · 2026-06-08 Cached

This book presents a mathematical theory of deep representation learning, aiming to demystify the internal mechanisms of large deep networks using optimization and information theory, making architecture design a matter of linear algebra and calculus.

0 favorites 0 likes

#information-theory

InfoShield: Privacy-Preserving Speech Representations for Mental Health Screening via Information-Theoretic Optimization

arXiv cs.CL ↗ · 2026-06-05 Cached

InfoShield introduces a privacy-preserving method for speech representations in mental health screening using information-theoretic optimization, reducing sensitive attribute inference while maintaining diagnostic accuracy. A novel TimeAwareMINE estimator addresses temporal-static misalignment in sequential speech.

0 favorites 0 likes

#information-theory

The Loss Is Not Enough: Sampling Conditions and Inductive Bias in Contrastive Representation Learning

arXiv cs.LG ↗ · 2026-06-04 Cached

This paper develops a measure-theoretic framework analyzing when contrastive learning recovers meaningful latent geometry, introducing a 'diversity condition' on positive-pair sampling and a support-corrected InfoNCE variant, with experiments validating that sampling diversity and architectural inductive bias interact critically in contrastive representation learning.

0 favorites 0 likes

#information-theory

Bayes-Sufficient Representations in Supervised Learning

arXiv cs.LG ↗ · 2026-06-04 Cached

This paper formalizes the concept of Bayes-sufficient representations in supervised learning, defining when a representation retains exactly the information needed for Bayes-optimal prediction under a given loss function. It introduces the Bayes quotient as a canonical loss-dependent object and connects the framework to property elicitation, illustrating distinctions between sufficiency, minimality, and excess retained information through experiments.

0 favorites 0 likes

#information-theory

Destruction is a General Strategy to Learn Generation; Diffusion's Strength is to Take it Seriously; Exploration is the Future

arXiv cs.LG ↗ · 2026-06-01 Cached

This paper presents diffusion models as part of a family of techniques that withhold information and train models to guess it, arguing that diffusion's destroying approach is flexible and advantageous, especially in data-scarce settings; it also discusses exploration problems and introduces a novel kind of probabilistic graphical model.

0 favorites 0 likes

#information-theory

InfoQuant: Shaping Activation Distributions for Low-Bit LLM Quantization

arXiv cs.LG ↗ · 2026-05-27 Cached

InfoQuant introduces a train-free method, Peak Suppression Orthogonal Transformation (PSOT), to reshape activation distributions for low-bit LLM quantization, preserving 97% floating-point accuracy under W4A4KV4 and outperforming prior PTQ methods.

0 favorites 0 likes

#information-theory

Human-Centered Learning Mechanics: A Dynamical Framework for Entropy-Regulated Representation Learning

arXiv cs.LG ↗ · 2026-05-25 Cached

This paper proposes Human-Centered Learning Mechanics (HCLM), a dynamical and information-theoretic framework for studying open and controlled learning systems. It formalizes entropy regularization through effective information force, derives convergence and generalization results, and provides a conditional interpretation of scaling-law behavior.

0 favorites 0 likes

#information-theory

LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws

Hugging Face Daily Papers ↗ · 2026-05-22 Cached

The paper proposes a Shannon Scaling Law that models LLM training as information transmission over a noisy channel, explaining non-monotonic performance phenomena like catastrophic overtraining and quantization-induced degradation, and demonstrating superior predictive accuracy over traditional scaling laws.

0 favorites 0 likes

#information-theory

The Expense of Seeing: Attaining Trustworthy Multimodal Reasoning Within the Monolithic Paradigm

Hugging Face Daily Papers ↗ · 2026-05-21 Cached

This paper challenges the assumption that current Vision-Language Models faithfully synthesize multimodal data, proposing an information-theoretic Modality Translation Protocol with new metrics (Toll, Curse, Fallacy of Seeing) to evaluate trustworthiness over traditional multimodal gain.

0 favorites 0 likes

#information-theory

The Measurement of the Relational Field

Reddit r/ArtificialInteligence ↗ · 2026-05-19

A new paper applies Partial Information Decomposition and Time-Delayed Mutual Information to multi-agent LLM systems, demonstrating that relational information between agents is measurable and that genuine coordination requires both differentiation and shared purpose, echoing findings from organizational psychology.

0 favorites 0 likes

#information-theory

An Information-Theoretic Criterion for Efficient Data Synthesis

arXiv cs.LG ↗ · 2026-05-19 Cached

This paper provides an information-theoretic account of when synthetic data improves or degrades LLM training, distinguishing between information-open and information-closed generation loops and explaining collapse via the data processing inequality.

0 favorites 0 likes

#information-theory

Phase Transitions in Driven Informational Systems: A Two-Field Perspective on Learning Theory and Non-Equilibrium Chemistry

arXiv cs.LG ↗ · 2026-05-19 Cached

This paper proposes a unified theoretical framework for phase transitions in deep learning (grokking, emergent capabilities) and non-equilibrium chemistry, describing both as driven informational systems governed by two gradient fields.

0 favorites 0 likes

#information-theory

When Can Human-AI Teams Outperform Individuals? Tight Bounds with Impossibility Guarantees

arXiv cs.AI ↗ · 2026-05-12 Cached

This paper derives tight theoretical bounds for human-AI teams, proving when confidence-based aggregation leads to complementarity and establishing impossibility results under specific error correlations.

0 favorites 0 likes

information-theory

Submit Feedback