Topic 9 Dispersion
NCC Education - Dispersion Topic 9 Notes
Introduction to Dispersion
Dispersion refers to the spread of values around a central point in a data set.
This topic covers the following key concepts:
Range
Inter-Quartile Range (IQR)
Standard Deviation
Variance
Key Calculations
Students will learn how to:
Calculate the range, quartiles, and quantiles.
Calculate variance and standard deviation.
Measures of Central Tendency vs. Dispersion
Averages (mean, median, and mode) provide a central point of data distribution but do not convey information about the spread of data.
Example:
Mode = 5
Median = 6
Mean = 4
The data's spread could include extreme values that average out to a misleading central point.
Calculation of the Range
The range is the simplest measure of dispersion, calculated as:
Example:
Data set: 58, 14, 11, 13, 8
Highest = 58, Lowest = 8
Quartiles and Percentiles
Quartiles divide a data set into four equal parts:
First Quartile (Q1): 25th percentile (lower quartile)
Second Quartile (Q2): 50th percentile (median)
Third Quartile (Q3): 75th percentile (upper quartile)
Percentiles are used to indicate the relative standing of a value in a dataset.
Quartile Calculation Steps
To find quartiles:
Sort data in ascending order.
Identify position for each quartile using:
For Q1:
For Q2 (Median):
For Q3:
Use these positions to find quartile values in the sorted data set.
Interquartile Range (IQR)
IQR measures the middle 50% of the data:
The IQR indicates variability within the middle portion of the dataset.
Calculating Mean Deviation
Mean Deviation provides insight into how far values are from the mean:
Calculate the mean.
Use the formula:
Where ( x_i ) is each value, ( \bar{x} ) is the mean, and ( n ) is the number of items.
Variance
Variance assesses how far a set of numbers is spread out from their mean:
\text{Variance (s^2)} = \frac{\sum{i=1}^{n} (xi - \bar{x})^2}{n}This provides a measure in squared units, which sometimes requires returning to standard units for interpretation.
Standard Deviation
Standard Deviation (σ) is the square root of variance and expresses the variability in the same units as the data:
It effectively measures how distributed the data points are around the mean, with a small standard deviation indicating data points are close to the mean and a large standard deviation indicating a spread out dataset.
Coefficient of Variation
The Coefficient of Variation provides a normalized measure of dispersion without units:
This is useful for comparing relative variability between different data sets.
Common Errors
Failing to order data before determining median or quartiles can lead to incorrect calculations.
Confusing range with interquartile range can result in the wrong dispersion measures being used.
Applications of Dispersion in Real-World Contexts
Understanding variance and standard deviation is critical in fields such as finance to gauge market volatility. High variability indicates higher risk, whereas low variability suggests stability.