statistical-inference

Tag

Cards List
#statistical-inference

On design-unbiased algorithmic Machine Learning

arXiv cs.LG · yesterday Cached

This paper investigates conditions for algorithmic machine learning (e.g., kNN, random forest) to achieve design-unbiased prediction and classification for finite populations, using probability sampling designs rather than assumed data models. It extends design-based inference from survey sampling to ML algorithms.

0 favorites 0 likes
#statistical-inference

Sequential statistical inference for Large Language Models: Representation, validity, and monitoring

arXiv cs.LG · 2026-06-09 Cached

This paper argues for a sequential inference framework to enhance LLM trustworthiness by modeling interactions as dependent stochastic processes, ensuring validity under repeated use, and enabling online monitoring for behavioral shifts.

0 favorites 0 likes
#statistical-inference

Heuristic Pathologies and Further Variance Reduction via Uncertainty Propagation in the AIVAT Family of Techniques

arXiv cs.AI · 2026-05-15 Cached

This paper identifies vulnerabilities in the AIVAT variance reduction technique when the heuristic value function is not fixed prior to evaluation, and shows how to propagate heuristic uncertainty to further reduce variance, achieving a 43% reduction in the number of samples needed for statistical conclusions.

0 favorites 0 likes
#statistical-inference

Statistical Inference and Quality Measures of KV Cache Quantisations Inspired by TurboQuant

arXiv cs.LG · 2026-05-12 Cached

This paper analyzes KV cache quantization schemes inspired by TurboQuant, using statistical inference and a new 6D error framework to evaluate quality measures like KL divergence and geometric error.

0 favorites 0 likes
#statistical-inference

Adaptive auditing of AI systems with anytime-valid guarantees

arXiv cs.AI · 2026-05-11 Cached

This paper introduces a statistical framework for adaptively auditing AI systems using Safe Anytime-Valid Inference (SAVI) to draw rigorous conclusions with limited data. It proposes a 'testing by betting' approach to validate model robustness while controlling type-I errors during adaptive sampling.

0 favorites 0 likes
← Back to home

Submit Feedback