statistical-inference

#statistical-inference

On design-unbiased algorithmic Machine Learning

arXiv cs.LG ↗ · yesterday Cached

This paper investigates conditions for algorithmic machine learning (e.g., kNN, random forest) to achieve design-unbiased prediction and classification for finite populations, using probability sampling designs rather than assumed data models. It extends design-based inference from survey sampling to ML algorithms.

0 favorites 0 likes

#statistical-inference

Sequential statistical inference for Large Language Models: Representation, validity, and monitoring

arXiv cs.LG ↗ · 2026-06-09 Cached

This paper argues for a sequential inference framework to enhance LLM trustworthiness by modeling interactions as dependent stochastic processes, ensuring validity under repeated use, and enabling online monitoring for behavioral shifts.

0 favorites 0 likes

#statistical-inference

Heuristic Pathologies and Further Variance Reduction via Uncertainty Propagation in the AIVAT Family of Techniques

arXiv cs.AI ↗ · 2026-05-15 Cached

This paper identifies vulnerabilities in the AIVAT variance reduction technique when the heuristic value function is not fixed prior to evaluation, and shows how to propagate heuristic uncertainty to further reduce variance, achieving a 43% reduction in the number of samples needed for statistical conclusions.

0 favorites 0 likes

#statistical-inference