QL 3.3 Comparing Mean and Median
Notes on Section 3.3: Difference Between the Median and the Mean
Definitions
Median
Described as the middle value of a dataset when arranged in order from least to greatest.
To find the median:
Arrange all numbers in ascending order.
Eliminate the first and last elements iteratively until reaching the middle.
Mean (Average)
Known as the average of a set of numbers.
Calculated using the formula:
Data Representation
Dot Plots
A visual representation of data points.
Different from listing numbers in a plain format, as it provides an immediate visual comparison.
Case Study: Olivia's Fruit Consumption Survey
Olivia surveyed freshmen and seniors about their daily fruit consumption.
Freshmen Data (Dot Plot Analysis)
Data points from the dot plot represent the following counts:
1 at 0 pieces of fruit: 1 zero
2 at 1 piece of fruit: 2 ones
4 at 2 pieces of fruit: 4 twos
3 at 3 pieces of fruit: 3 threes
1 at 4 pieces of fruit: 1 four
1 at 5 pieces of fruit: 1 five
1 at 6 pieces of fruit: 1 six
1 at 19 pieces of fruit: 1 nineteen
Calculating the Mean for Freshmen
Total:
Number of data points (dots): 15
Mean calculation:
Calculating the Median for Freshmen
Steps to find the median: Remove the first and last data points iteratively until reaching the center:
Resulting median value: 3.
Comparison of Mean and Median for Freshmen
Mean (4) is greater than Median (3).
Reason for difference: Presence of an outlier (19).
Effect on Median
The median remains unaffected by the outlier.
If 19 is removed, the numbers would still allow for a median of 3, affirming that the median is robust against outliers.
Observations from Senior Data
Next step involves analyzing the seniors' data.
Senior Data (Dot Plot Analysis)
Breakdown of counts:
1 at 0: 1 zero
2 at 1: 2 ones
5 at 2: 5 twos
3 at 3: 3 threes
2 at 5: 2 fives
1 at 16: 1 sixteen
1 at 17: 1 seventeen
Calculating the Mean for Seniors
Total: 55
Number of data points (dots): 16
Mean calculation:
Calculating the Median for Seniors
Middle value identified by removing the first and last until reaching the center:
Resulting median: 3.
Comparative Analysis and Observations
The means for both groups are relatively close to their respective medians:
Freshmen: Mean = 4, Median = 3
Seniors: Mean = 3.4375, Median = 3
Implication: The mean is closer to the median when the data distribution is symmetric.
Observation of symmetry in the distributions shown via the dot plots.
Additional Insights
Outliers significantly affect the mean more than the median.
A skewed distribution may lead to a higher mean than median, emphasizing the necessity of understanding data shape.