Median, Mean, and Mode
Introduction to Statistics
A statistic is a number that describes a set of data.
The mean, median, and mode are three key statistics that describe different aspects of a data set.
Each statistic provides unique information about the data.
Key Statistical Measures
Mean
The mean, often referred to as the average, is calculated by:
Formula:
Steps to find the mean:
Add all numbers in the data set to find their sum.
Divide the sum by the total number of items in the data set.
Example:
Given a set of numbers (e.g., test scores), add them together.
If there are 9 scores, divide the total sum by 9.
Resulting mean: approximately 85.6.
Median
The median is the middle number in a data set when the numbers are ordered from least to greatest.
Steps to find the median:
Order the numbers from least to greatest.
Identify the middle number. If the number of values is odd, the median is the central value.
If the number of values is even, calculate the mean of the two middle numbers.
Example:
For a data set ordered from least to greatest, if the values are (x1, x2, …, x_n):
Odd count leads to median = x_{{(n+1)/2}}.
Even count gives median = .
In a set with an odd number of scores, the median found was 88.
Interpretation: 50% of the data values lie above and below the median.
Mode
The mode is the number(s) that occurs most frequently in a data set.
Determining the mode:
It can be helpful to have the data ordered from least to greatest.
Identify which number appears most frequently.
Example:
In a test score data set, if the numbers were two 74s and two 83s, both 74 and 83 would be modes (bimodal).
However, if another number (e.g., 83) appears three times, 83 is the sole mode.
Further Examples and Comparisons
First Example
Data set: nine test scores.
Mean: Sum of scores divided by 9 = 85.6.
Median: Ordered data shows 88 as the middle value.
Mode: 74 and 83, since they occur most often (bimodal).
Second Example
Data set: twelve test scores.
Mean: Approximately 80.7 after adding all scores and dividing by 12.
Median: When ordered, values of 83 and 88 are the middle numbers, yielding a median of:
.
Mode: The number 83 occurs most frequently, making it the sole mode.
Understanding Variations Between Statistics
Notably, in the first example, the mean and median were close (85.6 vs. 88).
In contrast, the second example had a larger difference between the mean (80.7) and median (85.5).
Explanation:
A low test score can heavily influence the mean and create a notable discrepancy between the mean and median, while the median remains resilient to such extremes.
Understanding Outliers
Outliers are extreme values that can significantly skew the mean but have a minimal effect on the median.
Illustration provides a graphical view showing stability of the median versus varying influences of extreme values on the mean.
An example visual showed the mean increasing dramatically as an outlier was adjusted, while the median remained constant.
Conclusion
Utilizing multiple statistical measures (mean, median, mode) is crucial for accurately describing and understanding a data set.
These measures provide a comprehensive view and help to mitigate the impact of anomalies and outlier values.
This lesson reiterated that understanding data requires more than one statistical perspective.