Expert Reviewed
Prof. Emily Rodriguez, Ph.D. MathematicsUpdated June 1, 2026Our Standards →

Last updated:

Standard Deviation Calculator

Calculate population or sample standard deviation, variance, coefficient of variation, and standard error with deviation table.

Back to Math

Standard Deviation Calculator

Ad-FreeAI-Powered

Free online standard deviation calculator — compute population and sample std dev, variance, CV, and more with step-by-step deviation tables and AI insights.

Enter values above to see results.

About This Calculator

Related Articles

📉 Standard Deviation — Complete Guide

σ
Population std deviation (known all data)
s
Sample std deviation (subset of data)
68–95–99.7
% of data within 1–2–3 std deviations (normal)
CV
Coefficient of Variation = σ/μ × 100%

Measures of Spread

MeasureFormulaBest ForUnits
RangeMax − MinQuick overview; simpleSame as data
Variance (σ²)Σ(xᵢ−μ)²/NMathematical operationsSquared units
Std Deviation (σ)√VarianceInterpretability in original unitsSame as data
IQRQ3 − Q1Robust to outliersSame as data
MADΣ|xᵢ−median|/NRobust to outliersSame as data
CVσ/μ × 100%Comparing different units/scalesUnitless (%)

Frequently Asked Questions

What is standard deviation?

Standard deviation measures how spread out data is from the mean. A low SD means data points are clustered near the mean; a high SD means they are spread out. E.g., test scores of [75, 76, 74, 75] have low SD; [20, 50, 90, 100] have high SD.

What is the difference between population and sample standard deviation?

Population SD (σ) divides by N (total count) — used when you have all data. Sample SD (s) divides by N−1 (Bessel's correction) — used when data is a sample from a larger population. Sample SD is larger, correcting for underestimation bias.

What is the 68-95-99.7 rule?

In a normal (bell-curve) distribution: ~68% of data falls within 1 SD of the mean, ~95% within 2 SD, and ~99.7% within 3 SD. A score 2 SD above the mean is at approximately the 97.7th percentile.

What is a z-score?

A z-score = (x − μ) / σ. It expresses a data point in units of standard deviation from the mean. A z-score of 2.0 means the value is 2 standard deviations above average. Used to compare values across different scales.

When should I use variance vs standard deviation?

Variance (σ²) is used in mathematical operations and statistical tests (ANOVA, regression). Standard deviation is easier to interpret because it's in the same units as the original data. For communication, always report SD; for calculations, use variance.

What is the coefficient of variation (CV)?

CV = (σ / μ) × 100%. It expresses spread as a percentage of the mean, enabling comparison across datasets with different units or scales. CV 10% is less variable than CV 30%. A dataset with mean 100 and SD 10 has CV 10%.

Explore All Math Calculators

Reviewed by CalculatorApp.me Math Team

Standard Deviation Calculator — Complete Guide

Population & sample standard deviation, variance, z-scores, and the empirical rule.

σ

Population SD

s

Sample SD

68-95-99.7

Empirical Rule

z-score

Standardization

What Is Standard Deviation?

Standard deviation measures the amount of variation or dispersion in a dataset. A low standard deviation indicates data points cluster near the mean, while a high standard deviation indicates they spread over a wider range. It is the most widely used measure of statistical spread, reported in the same units as the original data.

There are two versions: population standard deviation (σ) divides by N when the data represents the entire population, and sample standard deviation (s) divides by N−1 (Bessel's correction) to provide an unbiased estimate from sample data. In practice, sample SD is far more common since we rarely have access to entire populations.

Standard deviation is the square root of variance. While variance has desirable mathematical properties (additive for independent variables), its units are squared — making it less intuitive. Standard deviation returns to original units: a dataset of test scores in points has a SD in points, not points².

Standard Deviation Formulas

Population Standard Deviation (σ)
σ = √[ Σᵢ(xᵢ − μ)² / N ]

Step-by-step:
  1. Calculate mean: μ = Σxᵢ / N
  2. Subtract mean from each value
  3. Square each difference
  4. Sum the squares
  5. Divide by N
  6. Take square root

Example: Data = {2, 4, 4, 4, 5, 5, 7, 9}
  μ = (2+4+4+4+5+5+7+9)/8 = 40/8 = 5
  Differences: -3,-1,-1,-1,0,0,2,4
  Squares: 9,1,1,1,0,0,4,16 → sum=32
  σ² = 32/8 = 4
  σ = √4 = 2

Use σ only when you have data for the ENTIRE population — all test scores, all factory outputs, etc. This is rare in practice.

Sample Standard Deviation (s)
s = √[ Σᵢ(xᵢ − x̄)² / (n − 1) ]

Bessel's Correction: divide by (n−1)
  • Compensates for using x̄ instead of μ
  • Makes s² an unbiased estimator of σ²
  • Note: s itself is slightly biased

Example: Same data, but as sample
  x̄ = 5 (same)
  Sum of squares = 32 (same)
  s² = 32/(8−1) = 32/7 = 4.571
  s = √4.571 = 2.138

As n → ∞, s → σ
  n=10: n−1=9  (10% correction)
  n=100: n−1=99 (1% correction)
  n=1000: negligible difference

Almost always use sample SD in practice. Dividing by n−1 corrects the bias from estimating the mean from the same sample.

Variance & Coefficient of Variation
Variance:
  σ² = Σ(xᵢ − μ)² / N      (population)
  s² = Σ(xᵢ − x̄)² / (n−1)  (sample)

Variance properties:
  Var(X+Y) = Var(X) + Var(Y)
    (if X,Y independent)
  Var(cX) = c² × Var(X)
  Var(X+c) = Var(X)

Coefficient of Variation (CV):
  CV = (s / x̄) × 100%

Example:
  Test A: mean=80, SD=10 → CV=12.5%
  Test B: mean=50, SD=10 → CV=20%

Same SD, but B is more variable
relative to its mean.

CV allows comparing variability between datasets with different units or scales. A stock with CV=30% is more volatile than one with CV=10% regardless of price.

Z-Score (Standard Score)
z = (x − μ) / σ   (population)
z = (x − x̄) / s   (sample)

Interpretation:
  z = 0:   at the mean
  z = 1:   1 SD above mean
  z = −2:  2 SD below mean

Example: SAT score = 1350
  Mean (μ) = 1060, SD (σ) = 195
  z = (1350 − 1060) / 195 = 1.49
  → 93.2 percentile

Standardization:
  Converts any normal distribution
  to N(0,1) — standard normal
  Enables comparison across scales

Outlier detection:
  |z| > 2: unusual (5% chance)
  |z| > 3: rare (0.3% chance)

Z-scores transform raw data into a 'how many SDs from the mean?' scale, enabling comparison between SAT, GPA, height, weight — any normally distributed variable.

Empirical Rule (68-95-99.7)

Range% of DataProbability OutsideDescriptionExample (μ=100, σ=15)
μ ± 1σ68.27%31.73%Majority of data85–115
μ ± 2σ95.45%4.55%Nearly all data70–130
μ ± 3σ99.73%0.27%Virtually all data55–145
μ ± 4σ99.9937%0.0063%Extreme outliers40–160
μ ± 5σ99.99994%0.00006%One in 1.7 million25–175
μ ± 6σ99.9999998%0.0000002%Six Sigma quality10–190

Interpreting Standard Deviation

ContextMeanSDCVInterpretation
IQ scores1001515%68% of people score 85–115
SAT scores106019518.4%Score of 1255 = +1 SD = 84th %ile
Adult male height (US)5'9"2.8"4.0%Low CV → heights cluster tightly
S&P 500 annual return~10%~16%160%High CV → volatile investment
Manufacturing (Six Sigma)targetprocess σ<0.001%3.4 defects per million
Body temperature98.6°F0.7°F0.7%Very tight distribution

History of Standard Deviation

1733

De Moivre — The Normal Curve

Abraham de Moivre derived the normal (bell) curve as an approximation to the binomial distribution. He showed that deviations from the mean follow a predictable pattern — the precursor to standard deviation.

1809

Gauss — Least Squares & Error Theory

Carl Friedrich Gauss formalized the method of least squares and the normal distribution of errors — demonstrating that measurement errors cluster around the mean with a characteristic spread. The Gaussian distribution bears his name.

1893

Pearson Coins 'Standard Deviation'

Karl Pearson first used the term 'standard deviation' in a lecture, replacing the older 'mean error' and 'probable error.' He chose 'σ' as the symbol. This standardized the terminology still used worldwide.

1908

Gosset — The t-Distribution

William Sealy Gosset (pen name 'Student') developed the t-distribution for small samples, showing that sample SD with Bessel's correction follows a different distribution than population SD. His work at Guinness Brewery birthed modern small-sample statistics.

1925

Fisher — Degrees of Freedom

R.A. Fisher rigorously proved why we divide by n−1 (Bessel's correction) using the concept of degrees of freedom — the sample mean 'uses up' one degree of freedom, leaving n−1 independent deviations.

1986

Motorola — Six Sigma

Bill Smith at Motorola developed Six Sigma — using standard deviation as a quality metric. Achieving 6σ quality means only 3.4 defects per million opportunities. This framework was adopted by GE, Toyota, and hundreds of companies worldwide.

Key Research & Data

Myths vs. Facts

Standard deviation and variance are interchangeable.

Variance (σ²) is in squared units — if data is in meters, variance is in m². Standard deviation (σ) is in the original units. They contain the same information but SD is interpretable. Variance is preferred in proofs and formulas because it's additive for independent variables.

A large standard deviation means the data is 'bad' or unreliable.

SD describes natural variation, not data quality. Human height has low SD (people are similar heights); income has high SD (enormous range). Neither is 'bad.' High SD simply means more spread. Context determines whether variation is problematic.

The empirical rule (68-95-99.7) applies to all data.

The 68-95-99.7 rule applies ONLY to normally distributed data. For skewed, bimodal, or heavy-tailed distributions, these percentages can be very different. Chebyshev's inequality provides weaker but universal bounds: at least 75% within ±2 SD for ANY distribution.

You always divide by n−1 for standard deviation.

Divide by N for population SD (when you have the entire population). Divide by n−1 for sample SD (when estimating from a sample, which is the usual case). Using the wrong denominator introduces systematic bias.

Frequently Asked Questions

When should I use population vs. sample standard deviation?
Use population SD (σ, divide by N) when your data IS the entire population — e.g., all employees' salaries. Use sample SD (s, divide by n−1) when your data is a sample from a larger population — e.g., surveying 100 of 10,000 customers.
What is Bessel's correction and why does it matter?
Dividing by n−1 instead of n compensates for the fact that the sample mean (x̄) is calculated from the same data. Since x̄ minimizes the sum of squared deviations, dividing by n underestimates the true variance. The n−1 correction makes s² an unbiased estimator of σ².
How do I interpret the coefficient of variation?
CV = (SD/mean) × 100%. It's a relative measure of spread — useful for comparing variability between datasets with different units. A CV of 10% means the SD is 10% of the mean. Lower CV = more consistent data.
Can standard deviation be negative?
Never. SD is the square root of variance (which is a sum of squares — always non-negative). SD = 0 means all data points are identical. SD > 0 always.
What is the standard error of the mean?
SEM = s/√n. It measures how precisely you've estimated the mean, not the spread of data. As sample size grows, SEM decreases (more precise estimate), while SD stays roughly constant.
How does SD relate to confidence intervals?
For a 95% confidence interval: x̄ ± 1.96×(s/√n). Note it uses SEM (s/√n), not SD alone. Wider CI = less certainty about the true mean. CI width depends on both SD and sample size.
What is the MAD (Mean Absolute Deviation)?
MAD = Σ|xᵢ−x̄|/n. It's more robust than SD because it doesn't square outliers. However, SD has better mathematical properties (additive variance, connection to normal distribution) and is the standard in most fields.
How many data points do I need for reliable SD?
Rule of thumb: 30+ for the Central Limit Theorem to apply. For the empirical rule to be meaningful, you need normally distributed data AND at least 50+ observations. With <10 points, SD is highly unstable.
What does Six Sigma mean?
Process quality where the nearest specification limit is 6 standard deviations from the mean. In a normal distribution, this allows 3.4 defects per 1,000,000 opportunities — near-perfection. The methodology was developed at Motorola in 1986.
How do outliers affect standard deviation?
Heavily. SD squares deviations, so outliers have disproportionate impact. A single outlier can dramatically inflate SD. For datasets with outliers, consider the IQR (Interquartile Range) or MAD as more robust alternatives.
Is SD meaningful for non-normal distributions?
SD always measures spread, but the 68-95-99.7 rule only applies to normal distributions. For skewed data, the median and IQR may be more informative. Chebyshev's theorem guarantees at least 89% of data within ±3 SD for ANY distribution.
How do I compare standard deviations of different samples?
Use the F-test (ratio of variances) or Levene's test to formally compare SDs. For quick comparison, use CV if means differ. If datasets are on different scales, standardize (z-scores) first.

References

Related Calculators

Explore All Math Calculators

Precision math tools for students, teachers, and professionals — CalculatorApp.me.

Browse Math Calculators →

See Also