arXiv

Enforcing Constraints in Generative Sampling via Adaptive Correction Scheduling

arXiv cs.LG ↗ · 13h ago Cached

This research paper introduces adaptive correction scheduling for enforcing hard constraints in generative sampling, demonstrating that it improves the cost-accuracy frontier compared to terminal or stepwise projection methods.

0 favorites 0 likes

Measuring Five-Nines Reliability: Sample-Efficient LLM Evaluation in Saturated Benchmarks

arXiv cs.LG ↗ · 13h ago Cached

This paper proposes a sample-efficient framework using the cross-entropy method to estimate extreme reliability ('five-nines') in LLMs, addressing the limitations of standard benchmarks in detecting rare failures.

0 favorites 0 likes

The Scaling Law of Evaluation Failure: Why Simple Averaging Collapses Under Data Sparsity and Item Difficulty Gaps, and How Item Response Theory Recovers Ground Truth Across Domains

arXiv cs.LG ↗ · 13h ago Cached

This paper argues that simple averaging in AI benchmarks fails under data sparsity and difficulty heterogeneity, proposing Item Response Theory (IRT) as a robust alternative to recover ground truth rankings.

0 favorites 0 likes

FeatMap: Understanding image manipulation in the feature space and its implications for feature space geometry

arXiv cs.LG ↗ · 13h ago Cached

This paper investigates the geometric structure of intermediate feature representations in deep neural networks by analyzing how various image manipulations map in feature space. It suggests that feature spaces are organized in linear structures to a first approximation, using generative image editing models to probe these representations.

0 favorites 0 likes

Variational Linear Attention: Stable Associative Memory for Long-Context Transformers

arXiv cs.LG ↗ · 13h ago Cached

This paper introduces Variational Linear Attention (VLA), a method that stabilizes memory states in linear attention mechanisms for long-context transformers. VLA reframes memory updates as an online regularized least-squares problem, proving bounded state norms and demonstrating significant speedups and improved retrieval accuracy over standard linear attention and DeltaNet.

0 favorites 0 likes

Deep Learning for Protein Complex Prediction and Design

arXiv cs.LG ↗ · 13h ago Cached

This PhD thesis introduces deep learning methods for protein complex prediction and design, including GLINTER for contact prediction, ESMPair for homolog pairing, and RedNet for binder design.

0 favorites 0 likes

CATS: Cascaded Adaptive Tree Speculation for Memory-Limited LLM Inference Acceleration

arXiv cs.LG ↗ · 13h ago Cached

This paper introduces CATS, a cascaded adaptive tree speculation framework designed to accelerate LLM inference on memory-constrained edge devices by optimizing memory usage while maintaining high token acceptance rates.

0 favorites 0 likes

Muon is Not That Special: Random or Inverted Spectra Work Just as Well

arXiv cs.LG ↗ · 13h ago Cached

This paper challenges the geometric justification for the Muon optimizer, arguing that precise structure is less important than step-size optimality. It introduces Freon and Kaon optimizers to demonstrate that random or inverted spectra can perform as well as Muon.

0 favorites 0 likes

Oversmoothing as Representation Degeneracy in Neural Sheaf Diffusion

arXiv cs.LG ↗ · 13h ago Cached

This paper analyzes oversmoothing in Neural Sheaf Diffusion (NSD) as a representation degeneracy phenomenon using quiver theory and Geometric Invariant Theory. It proposes moment-map-inspired regularizers and explores non-uniform stalk dimensions to mitigate this issue in heterophilic graph benchmarks.

0 favorites 0 likes

Optimistic Dual Averaging Unifies Modern Optimizers

arXiv cs.LG ↗ · 13h ago Cached

This paper introduces SODA, a generalization of Optimistic Dual Averaging that unifies various modern optimizers like Muon and Lion. It proposes a practical wrapper that improves performance across different scales without requiring additional hyperparameter tuning for weight decay.

0 favorites 0 likes

Unlearning with Asymmetric Sources: Improved Unlearning-Utility Trade-off with Public Data

arXiv cs.LG ↗ · 13h ago Cached

This paper introduces Asymmetric Langevin Unlearning (ALU), a framework that leverages public data to improve the privacy-utility trade-off in machine unlearning. It demonstrates that ALU reduces unlearning costs and enables mass unlearning while maintaining high model utility.

0 favorites 0 likes

COSMOS: Model-Agnostic Personalized Federated Learning with Clustered Server Models and Pseudo-Label-Only Communication

arXiv cs.LG ↗ · 13h ago Cached

This paper introduces COSMOS, a model-agnostic personalized federated learning framework that uses clustered server models and pseudo-label-only communication. It provides theoretical analysis showing exponential personalization risk contraction and demonstrates superior performance over existing baselines in heterogeneous environments.

0 favorites 0 likes

Interpretability Can Be Actionable

arXiv cs.LG ↗ · 13h ago Cached

This position paper argues that interpretability research should be evaluated based on actionability—the extent to which insights enable concrete decisions and interventions. The authors propose a framework with evaluation criteria aligned with practical outcomes to address the lack of real-world impact in current interpretability work.

0 favorites 0 likes

CORE: Cyclic Orthotope Relation Embedding for Knowledge Graph Completion

arXiv cs.LG ↗ · 13h ago Cached

This paper introduces CORE, a new knowledge graph completion model that uses cyclic orthotope relation embeddings on a torus manifold to address boundary constraints in region-based models. Experiments show competitive performance in link prediction tasks.

0 favorites 0 likes

Rank Is Not Capacity: Spectral Occupancy for Latent Graph Models

arXiv cs.LG ↗ · 13h ago Cached

This paper proposes Spectra, a method using spectral occupancy to analyze and control the realized capacity of latent graph models, arguing that rank is not equivalent to model capacity.

0 favorites 0 likes

Spurious Correlation Learning in Preference Optimization: Mechanisms, Consequences, and Mitigation via Tie Training

arXiv cs.LG ↗ · 13h ago Cached

This paper analyzes spurious correlation learning in preference optimization methods like DPO, identifying mechanisms such as mean spurious bias and causal-spurious leakage. It proposes 'tie training' using equal-utility preference pairs as a mitigation strategy to reduce reliance on spurious features without degrading causal learning.

0 favorites 0 likes

Steerable Neural ODEs on Homogeneous Spaces

arXiv cs.LG ↗ · 13h ago Cached

This paper introduces steerable neural ordinary differential equations on homogeneous spaces, providing a geometric framework for learning continuous-time equivariant dynamics.

0 favorites 0 likes

HEPA: A Self-Supervised Horizon-Conditioned Event Predictive Architecture for Time Series

arXiv cs.LG ↗ · 13h ago Cached

This paper introduces HEPA, a self-supervised architecture for predicting rare critical events in time series using a Joint-Embedding Predictive Architecture (JEPA) pretraining strategy. It demonstrates superior performance across multiple domains with significantly fewer labeled data and tuned parameters compared to leading models.

0 favorites 0 likes

Language Modeling with Hyperspherical Flows

arXiv cs.LG ↗ · 13h ago Cached

This paper introduces S-FLM, a novel flow-based language model that operates in a hyperspherical latent space to address the computational costs and semantic limitations of existing discrete diffusion and continuous flow models.

0 favorites 0 likes

GRAFT-ATHENA: Self-Improving Agentic Teams for Autonomous Discovery and Evolutionary Numerical Algorithms

arXiv cs.LG ↗ · 13h ago Cached

This paper introduces GRAFT-ATHENA, a self-improving agentic framework that autonomously discovers and evolves numerical algorithms for scientific problems. It demonstrates near-machine-precision accuracy on physics-informed machine learning benchmarks and successfully tackles complex engineering challenges.

0 favorites 0 likes

arXiv

Submit Feedback