Are you sure? A Comprehensive and Comprehensible Survey of Uncertainty Quantification in Symbolic Regression
Summary
A comprehensive survey on uncertainty quantification in symbolic regression, reviewing frequentist, Bayesian, and model selection approaches to address the lack of reliability support in real-world decision processes.
View Cached Full Text
Cached at: 06/08/26, 09:16 AM
# Are you sure? A Comprehensive and Comprehensible Survey of Uncertainty Quantification in Symbolic Regression Source: [https://arxiv.org/abs/2606.06567](https://arxiv.org/abs/2606.06567) [View PDF](https://arxiv.org/pdf/2606.06567) > Abstract:Symbolic regression \(SR\) is a class of methods that systematically explore the space of mathematical functions to discover models that accurately capture the underlying relationships in a dataset\. Despite recent advances in the field, a lack of support for uncertainty quantification \(UQ\) limits its adoption in real\-world decision processes\. In regression analysis, UQ provides important information about the model reliability, which can both help to avoid overfitting by accounting for uncertainty in the data, and provide insights for decision\-making\. This survey is the first to clearly address this issue, with the objective of introducing essential UQ concepts and reviewing the current literature on UQ in SR, which can be broadly organized into three research directions: frequentist, Bayesian, and model selection\. Despite its importance, UQ in SR is still underexplored, which motivates further research into reliable UQ methods for SR\. ## Submission history From: Julia Reuter \[[view email](https://arxiv.org/show-email/9bf85f8e/2606.06567)\] **\[v1\]**Thu, 4 Jun 2026 17:29:56 UTC \(187 KB\)
Similar Articles
Large-scale Uncertainty Quantification for Latent Variable Models Using Subsampling Markov Chain Monte Carlo
This paper develops a scaling limit theory for SGLD-Gibbs to provide principled hyperparameter tuning guidance for meaningful uncertainty quantification in large-scale latent variable models.
Position: Uncertainty Quantification in LLMs is Just Unsupervised Clustering
This position paper argues that current uncertainty quantification methods for large language models are essentially unsupervised clustering, measuring internal consistency rather than external correctness, and therefore fail to detect confident hallucinations. The authors advocate for a paradigm shift to ground uncertainty in objective truth.
Uncertainty Quantification for Large Language Diffusion Models
This paper presents the first systematic study of uncertainty quantification (UQ) for Large Language Diffusion Models (LLDMs), proposing lightweight zero-shot uncertainty signals derived from the iterative denoising process and showing that LLDMs can achieve both fast inference and reliable hallucination detection with up to 100x lower computational overhead compared to sampling-based baselines.
Scalable Uncertainty Reasoning in Knowledge Graphs
This thesis proposes a modular framework for scalable uncertainty reasoning in knowledge graphs, addressing imprecise attribute values, probabilistic triple existence, and incomplete schema through tailored algebraic, logical, and geometric techniques.
Fuzzy, Neutrosophic, and Uncertain Graph Theory: Properties and Applications
This book presents a comprehensive survey of graph theory under uncertainty, covering fuzzy, neutrosophic, and uncertain graph models, their properties, extensions, and applications in decision-making, graph neural networks, and knowledge graphs.