Measures of Dispersion Overview
Measures of Dispersion
Measures of dispersion indicate the variability in a sample.
Range (R): Difference between the largest and smallest values in the sample.
Formula: X = XL - XS
Mean and Deviations
When the center is the arithmetic mean, deviations from it can be summarized.
The sum of deviations from the mean is always zero
Mean Deviation, Variance, and Standard Deviation
Mean Deviation: Can summarize the spread but is less common.
Sample Variance (): Average of the squared deviations from the mean.
Sample Standard Deviation (SD or ): Measures the spread around the mean.
A larger signifies more variability.
can equal 0 if there is no spread.
Units of are the same as the data units.
Degrees of Freedom
Variance is divided by (degrees of freedom) rather than .
Explanation:
The sum of deviations from the mean is always zero.
If data points are known, the last one is determined automatically.
Empirical Rule
Applicable to unimodal, moderately skewed distributions.
States:
Approximately 68% of data lies between and .
Approximately 95% lies between and .
Over 99% lies between and .
1.262cc^2c$$.
Standard deviation is preferred as it shares units with the data.
Coefficient of Variation (CV)
The CV compares relative variation between datasets with different units.
CV is defined as: s/x̅ * 100%
Remains consistent regardless of unit changes; both mean and standard deviation change similarly, keeping the ratio the same.
Example: Comparing cholesterol (mg/100 ml) with body weight (lbs) using CV.
Displaying Data
Numeric and graphic methods help summarize and present data effectively:
Numeric Summary includes measures of location (mean, median) and measures of spread (standard deviation, range).
Graphic Methods include bar graphs, stem-and-leaf plots, and box plots.