90% of the T Distribution

Hacker News Top News

Summary

Explains the Student's t-distribution correction for small sample confidence intervals, providing a memorizable table for 90% intervals and a rule-of-thumb for estimating standard deviation from two samples.

No content available
Original Article
View Cached Full Text

Cached at: 05/31/26, 01:30 AM

# 90 % of the t distribution Source: [https://entropicthoughts.com/ninety-percent-of-the-t-distribution](https://entropicthoughts.com/ninety-percent-of-the-t-distribution) ![ninety-percent-of-the-t-distribution.jpg](https://entropicthoughts.com/image/banner/ninety-percent-of-the-t-distribution.jpg) William Sealy Gosset was great\. He improved beer at Guinness by using the statistics that existed at the time\. Not happy with that, he invented new statistics to brew even better beer\. The things he invented are used all over the place now, but Guinness wanted to keep him a secret weapon, so they made him publish his results under the fake name*Student*\. One thing Gosset realised is that it is wrong to compute 90 % confidence intervals for the mean by taking the standard deviation of the sample, and*assume a normal distribution*, like\-a\-so: \\\[\\hat\{\\mu\} \\pm 1\.645 \\hat\{\\sigma\}\\\] When we do this we get too narrow a range, because while we recognise \\\(\\hat\{\\mu\}\\\) is just an approximation, we are assuming we know \\\(\\sigma = \\hat\{\\sigma\}\\\) with certainty\! Gosset came up with correction tables based on the number of samples used in the estimation of the confidence interval, to account for our uncertainty in the estimation of \\\(\\hat\{\\sigma\}\\\)\. Here are some useful values, rounded to be easier to memorise: Number of samplesCorrection factor for 90 % interval24×32×41\.5×51\.3×6–81\.2×9–201\.1×To use this table, count how many samples the estimation of the standard deviation is based on, multiply the estimation of the standard deviation \\\(\\hat\{\\sigma\}\\\) with the correction factor, and then multiply again with 1\.645 to get a 90 % interval\. If the number of samples is greater than 20, the naïve estimation of the standard deviation is good enough for a 90 % interval\. Thus, if we have 7 samples and these have lead us to estimate a mean of 32 minutes with a standard deviation of 8 minutes, we should not think of the 90 % confidence interval as \\\[ 32 \\pm 8×1\.645\\\] but rather as \\\[32 \\pm 8×1\.2×1\.645\\\] Already with 7 samples, the actual 90 % confidence interval is fairly close to the naïve one, being only a factor of 1\.2 too narrow\. With fewer samples, the uncertainty in the standard deviation is larger, so we should estimate a similarly wider confidence interval\.11A stronger confidence interval, like the 95 % or even 99 % interval will be correspondingly much wider after the Student t correction\. This is the table for 90 % intervals because that’s what I need most often\. Gosset didn’t actually come up with any specific approximation table; he came up with the entire*Student’s t*distribution which lets us create any table of correction factors we need\. ## [Variation from just two values](https://entropicthoughts.com/ninety-percent-of-the-t-distribution#variation-from-just-two-values) Although the above table is what you need for getting a 90 % confidence interval, we can also use a similar technique to get a sloppy estimation of the standard deviation based on just two samples\. The sample standard deviation of two values is given by \\\[\\frac\{\\left\(\\mathrm\{high\} \- \\mathrm\{low\}\\right\)\}\{\\sqrt\{2\}\}\\\] This massively underestimates the actual standard deviation, because it is based on just two values\. But one standard deviation corresponds to a t score of 1\.846, so we can multiply the above by that, and we get a better approximation of the standard deviation\. If we round the constant factors for convenience, we’ll find that the appropriate estimation of the standard deviation \(corrected through the t distribution\) is 1\.3 times the distance between the two numbers we have\. That’s incredibly useful in practice\! ## [Example of how to use it](https://entropicthoughts.com/ninety-percent-of-the-t-distribution#example-of-how-to-use-it) I’m sure you’ve been in a situation where someone has asked something like “Is 49 litres a good result?” You don’t know, of course, so you ask “Compared to what?” Maybe they respond “Compared to 43 litres\!” That sounds impressive, but you don’t want me to chastise you, so you say, “That still tells me nothing because I don’t know the variation inherent in the process\. Give me another typical result\!” They might then say “Uhh, 47 litres\.” Now you let your guard down and think, “Oh, 49 is above both the typical results\. Very good\!” And then i chastise you\! So you turn on your brain instead\. You have received two typical numbers: 43 and 47\. These don’t tell you much about how the inherent variation, but they do tell you a little\. The distance between them is four\. If we multiply that by 1\.3, we get our estimation of the standard deviation, which is something like 5 litres\. That means 49 litres is less than one standard deviation away from the midpoint of 45 litres\. That’s a normal result, not unusually good or bad\.

Similar Articles

TabularMath: Understanding Math Reasoning over Tables with Large Language Models

arXiv cs.CL

TabularMath introduces a benchmark and AutoT2T framework for evaluating LLMs' mathematical reasoning over tabular data, revealing that table complexity, data quality, and modality significantly impact model performance. The study addresses a gap in LLM evaluation by systematically assessing robustness to incomplete or inconsistent table information in real-world scenarios.

Behavior-Aware Auxiliary Corrections for Off-Policy Temporal-Difference Prediction

arXiv cs.AI

This paper proposes behavior-aware auxiliary corrections for off-policy temporal-difference prediction, introducing BA-TDC and BA-TDRC algorithms that replace the auxiliary covariance matrix with the behavior Bellman matrix to improve stability and convergence. Theoretical analysis and experiments on standard benchmarks validate the effectiveness of the proposed methods.

[R] Measuring the Symmetry--Data Exchange Rate

Reddit r/MachineLearning

This paper empirically measures the symmetry–data exchange rate predicted by equivariance theory, finding that wrong-group symmetry constraints are actively harmful, augmentation with test-time orbit averaging matches equivariant architectures, and the theoretical |G|-fold sample complexity reduction is only weakly confirmed with wide confidence intervals. The study is explicitly exploratory and not pre-registered.

Elucidating the SNR-t Bias of Diffusion Probabilistic Models

Hugging Face Daily Papers

This paper identifies a Signal-to-Noise Ratio timestep (SNR-t) bias in diffusion probabilistic models during inference, where SNR-timestep alignment from training is disrupted at inference time. The authors propose a differential correction method that decomposes samples into frequency components and corrects each separately, improving generation quality across models like IDDPM, ADM, DDIM, EDM, and FLUX with minimal computational overhead.

Consistently Informative Soft-Label Temperature for Knowledge Distillation

arXiv cs.LG

Proposes CIST, a method that assigns separate sample-wise adaptive temperatures to teacher and student in knowledge distillation, producing consistently informative soft labels and relaxing rigid logit-scale matching. Experiments on vision and language tasks show consistent improvements over standard KD.