loss-functions

#loss-functions

Bayes-Sufficient Representations in Supervised Learning

arXiv cs.LG ↗ · 2026-06-04 Cached

This paper formalizes the concept of Bayes-sufficient representations in supervised learning, defining when a representation retains exactly the information needed for Bayes-optimal prediction under a given loss function. It introduces the Bayes quotient as a canonical loss-dependent object and connects the framework to property elicitation, illustrating distinctions between sufficiency, minimality, and excess retained information through experiments.

0 favorites 0 likes

#loss-functions

Symmetrization of Loss Functions for Robust Training of Neural Networks in the Presence of Noisy Labels

arXiv cs.LG ↗ · 2026-05-21 Cached

This paper studies symmetrization of loss functions for robust training under label noise, introducing SGCE and alpha-MAE loss functions that interpolate between multi-class unhinged loss and Mean Absolute Error, with theoretical guarantees and competitive empirical performance.

0 favorites 0 likes

#loss-functions

On Semantic Loss Fine-Tuning Approach for Preventing Model Collapse in Causal Reasoning

arXiv cs.LG ↗ · 2026-05-08 Cached

This paper identifies a critical 'model collapse' issue in standard fine-tuning for causal reasoning and proposes a semantic loss function with graph-based logical constraints to prevent it.

0 favorites 0 likes

#loss-functions

Losses that Cook: Topological Optimal Transport for Structured Recipe Generation

arXiv cs.CL ↗ · 2026-04-20 Cached

This paper proposes topological optimal transport-based loss functions for improving structured recipe generation in language models, addressing the limitations of standard cross-entropy training by better handling ingredient composition, quantities, and procedural accuracy. The approach shows significant improvements on recipe-specific metrics with 62% human preference over baseline methods.

0 favorites 0 likes

loss-functions

Bayes-Sufficient Representations in Supervised Learning

Symmetrization of Loss Functions for Robust Training of Neural Networks in the Presence of Noisy Labels

On Semantic Loss Fine-Tuning Approach for Preventing Model Collapse in Causal Reasoning

Losses that Cook: Topological Optimal Transport for Structured Recipe Generation

Submit Feedback