What distribution is used when standard deviation is unknown?

What Is a T-Distribution?

The t-distribution, also known as the Student’s t-distribution, is a type of probability distribution that is similar to the normal distribution with its bell shape but has heavier tails. It is used for estimating population parameters for small sample sizes or unknown variances. T-distributions have a greater chance for extreme values than normal distributions, and as a result have fatter tails.

The t-distribution is the basis for computing t-tests in statistics.

Key Takeaways

  • The t-distribution is a continuous probability distribution of the z-score when the estimated standard deviation is used in the denominator rather than the true standard deviation.
  • The t-distribution, like the normal distribution, is bell-shaped and symmetric, but it has heavier tails, which means it tends to produce values that fall far from its mean.
  • T-tests are used in statistics to estimate significance.

What Does a T-Distribution Tell You? 

Tail heaviness is determined by a parameter of the t-distribution called degrees of freedom, with smaller values giving heavier tails, and with higher values making the t-distribution resemble a standard normal distribution with a mean of 0, and a standard deviation of 1.

Image by Sabrina Jiang © Investopedia 2020

When a sample of n observations is taken from a normally distributed population having mean M and standard deviation D, the sample mean, m, and the sample standard deviation, d, will differ from M and D because of the randomness of the sample.

A Z-score can be calculated with the population standard deviation as Z = (x – M)/D, and this value has the normal distribution with mean 0 and standard deviation 1. But when using the estimated standard deviation, a t-score is calculated as T = (m – M)/{d/sqrt(n)}, the difference between d and D makes the distribution a t-distribution with (n - 1) degrees of freedom rather than the normal distribution with mean 0 and standard deviation 1. 

Example of How to Use a T-Distribution

Take the following example for how t-distributions are put to use in statistical analysis. First, remember that a confidence interval for the mean is a range of values, calculated from the data, meant to capture a “population” mean. This interval is m +- t*d/sqrt(n), where t is a critical value from the t-distribution.

For instance, a 95% confidence interval for the mean return of the Dow Jones Industrial Average in the 27 trading days prior to 9/11/2001, is -0.33%, (+/- 2.055) * 1.07 / sqrt(27), giving a (persistent) mean return as some number between -0.75% and +0.09%. The number 2.055, the amount of standard errors to adjust by, is found from the T distribution.

Because the t-distribution has fatter tails than a normal distribution, it can be used as a model for financial returns that exhibit excess kurtosis, which will allow for a more realistic calculation of Value at Risk (VaR) in such cases.

T-Distribution vs. Normal Distribution 

Normal distributions are used when the population distribution is assumed to be normal. The t-distribution is similar to the normal distribution, just with fatter tails. Both assume a normally distributed population. T-distributions thus have higher kurtosis than normal distributions. The probability of getting values very far from the mean is larger with a t-distribution than a normal distribution.

Normal vs. T-Distribution.

Limitations of Using a T-Distribution 

The t-distribution can skew exactness relative to the normal distribution. Its shortcoming only arises when there’s a need for perfect normality. The t-distribution should only be used when population standard deviation is not known. If the population standard deviation is known and the sample size is large enough, the normal distribution should be used for better results.

What Is the T-Distribution in Statistics?

The t-distribution is used in statistics to estimate the population parameters for small sample sizes or undetermined variances. It is also referred to as the Student’s t-distribution.

When Should the T-Distribution Be Used?

The t-distribution should be used if the population sample size is small and the standard deviation is unknown. If not, the normal distribution should be used.

What Does Normal Distribution Mean?

The Bottom Line

The t-distribution is used in statistics to estimate the significance of population parameters for small sample sizes or unknown variations. Like the normal distribution, it is bell-shaped and symmetric. Unlike normal distributions it has heavier tails, which results in a greater chance for extreme values.

What to use if the standard deviation is unknown?

Population Standard Deviation Unknown If the population standard deviation, sigma is unknown, then the mean has a student's t (t) distribution and the sample standard deviation is used instead of the population standard deviation. . The t here is the t-score obtained from the Student's t table.

When should Z distribution be used?

z-Distribution: The z-distribution, also called the standard normal distribution, is used in calculations for inference when the population standard deviation is known, or when sample sizes are large (at least 30).

What is the difference between t

The Z distribution is a special case of the normal distribution with a mean of 0 and standard deviation of 1. The t-distribution is similar to the Z-distribution, but is sensitive to sample size and is used for small or moderate samples when the population standard deviation is unknown.

What is t

The t-distribution is used as an alternative to the normal distribution when sample sizes are small in order to estimate confidence or determine critical values that an observation is a given distance from the mean.