Skip to main content

Descriptive Statistics

Measures of Dispersion โ€“ Range, Variance, Standard Deviation, Interquartile Range ๐Ÿ“

๐ŸŽฏ Learning Objectives

By the end of this lesson, you will be able to:
โœ” Define and calculate Range, Variance, Standard Deviation (SD), and Interquartile Range (IQR)
โœ” Understand their interpretation in medical data
โœ” Know why dispersion is important alongside central tendency


1๏ธโƒฃ Introduction

Knowing the average (mean, median) is not enough. We also need to know how spread out the data is.
This is called dispersion. Two datasets can have the same mean but very different spreads, which can change interpretation in clinical research.

Common measures of dispersion:
โœ… Range
โœ… Variance
โœ… Standard Deviation (SD)
โœ… Interquartile Range (IQR)


2๏ธโƒฃ Range

Definition:

  • The difference between the largest and smallest value in the dataset.

โœ” Formula:

โœ” Example:
BP readings: 110, 120, 130, 140, 150
Range = 150 โ€“ 110 = 40 mmHg

โœ” Key Features:

  • Simple but affected by outliers
  • Gives only the extreme spread, not the distribution

3๏ธโƒฃ Variance

Definition:

  • The average of the squared differences from the mean.

โœ” Formula:

โœ” Example:
If mean = 100, values = 90, 100, 110
Differences = -10, 0, +10 โ†’ Squares = 100, 0, 100
Variance = (100 + 0 + 100)/3 = 66.67

โœ” Key Features:

  • Always positive
  • Units are squared (not very interpretable clinically)

4๏ธโƒฃ Standard Deviation (SD)

Definition:

  • The square root of the variance.
  • Indicates how much values typically deviate from the mean.

โœ” Formula:

โœ” Example:
From above, Variance = 66.67 โ†’ SD = โˆš66.67 โ‰ˆ 8.16

โœ” Key Features:

  • Most widely used measure of dispersion
  • Used in confidence intervals, normal distribution

โœ” Clinical Interpretation:

  • In a normal distribution:
    • 68% values lie within ยฑ1 SD
    • 95% values within ยฑ2 SD
    • 99.7% values within ยฑ3 SD

5๏ธโƒฃ Interquartile Range (IQR)

Definition:

  • The range between the 25th percentile (Q1) and 75th percentile (Q3).
  • It captures the middle 50% of the data, removing extreme values.

โœ” Formula:

โœ” Example:
Data: 5, 7, 8, 10, 12, 13, 15
Q1 = 7, Q3 = 13 โ†’ IQR = 13 โ€“ 7 = 6

โœ” Key Features:

  • Not affected by outliers
  • Best for skewed data

6๏ธโƒฃ Comparison Table

MeasureFormulaAffected by Outliers?Use Case
RangeMax โ€“ Minโœ… YesQuick estimate of spread
VarianceAverage of squared differencesโœ… YesBasis for SD
Standard DeviationโˆšVarianceโœ… YesMost common in research
IQRQ3 โ€“ Q1โŒ NoBest for skewed data

7๏ธโƒฃ Clinical Relevance

  • Range: Quick spread in BP or lab values
  • SD: Used in reporting clinical trial results
  • IQR: Used in non-parametric statistics, skewed hospital stay data

8๏ธโƒฃ Quick Check: Test Your Understanding โœ…

Q1: Which measure is most affected by extreme values?
a) Range
b) SD
c) IQR
๐Ÿ‘‰ Answer: a) Range

Q2: If Q1 = 20 and Q3 = 30, what is IQR?
๐Ÿ‘‰ Answer: 10

Q3: Which measure is best for skewed data?
๐Ÿ‘‰ Answer: IQR


โœจ Key Takeaways

  • Range: Simple but outlier-sensitive
  • Variance: Basis for SD, not easily interpretable
  • Standard Deviation: Most widely used, key for normal distribution
  • IQR: Best for skewed data, ignores extremes