QL 3.3 Comparing Mean and Median

Notes on Section 3.3: Difference Between the Median and the Mean

Definitions

  • Median

    • Described as the middle value of a dataset when arranged in order from least to greatest.

    • To find the median:

    • Arrange all numbers in ascending order.

    • Eliminate the first and last elements iteratively until reaching the middle.

  • Mean (Average)

    • Known as the average of a set of numbers.

    • Calculated using the formula:
      Mean=Sum of all data valuesNumber of values\text{Mean} = \frac{\text{Sum of all data values}}{\text{Number of values}}

Data Representation

  • Dot Plots

    • A visual representation of data points.

    • Different from listing numbers in a plain format, as it provides an immediate visual comparison.

Case Study: Olivia's Fruit Consumption Survey

  • Olivia surveyed freshmen and seniors about their daily fruit consumption.

  • Freshmen Data (Dot Plot Analysis)

    • Data points from the dot plot represent the following counts:

    • 1 at 0 pieces of fruit: 1 zero

    • 2 at 1 piece of fruit: 2 ones

    • 4 at 2 pieces of fruit: 4 twos

    • 3 at 3 pieces of fruit: 3 threes

    • 1 at 4 pieces of fruit: 1 four

    • 1 at 5 pieces of fruit: 1 five

    • 1 at 6 pieces of fruit: 1 six

    • 1 at 19 pieces of fruit: 1 nineteen

  • Calculating the Mean for Freshmen

    • Total: 0×1+1×2+2×4+3×3+4×1+5×1+6×1+19×1=600\times1 + 1\times2 + 2\times4 + 3\times3 + 4\times1 + 5\times1 + 6\times1 + 19\times1 = 60

    • Number of data points (dots): 15

    • Mean calculation:
      Mean=6015=4\text{Mean} = \frac{60}{15} = 4

  • Calculating the Median for Freshmen

    • Steps to find the median: Remove the first and last data points iteratively until reaching the center:

    • Resulting median value: 3.

Comparison of Mean and Median for Freshmen

  • Mean (4) is greater than Median (3).

  • Reason for difference: Presence of an outlier (19).

  • Effect on Median

    • The median remains unaffected by the outlier.

    • If 19 is removed, the numbers would still allow for a median of 3, affirming that the median is robust against outliers.

Observations from Senior Data

  • Next step involves analyzing the seniors' data.

  • Senior Data (Dot Plot Analysis)

    • Breakdown of counts:

    • 1 at 0: 1 zero

    • 2 at 1: 2 ones

    • 5 at 2: 5 twos

    • 3 at 3: 3 threes

    • 2 at 5: 2 fives

    • 1 at 16: 1 sixteen

    • 1 at 17: 1 seventeen

  • Calculating the Mean for Seniors

    • Total: 55

    • Number of data points (dots): 16

    • Mean calculation:
      Mean=5516=3.4375\text{Mean} = \frac{55}{16} = 3.4375

  • Calculating the Median for Seniors

    • Middle value identified by removing the first and last until reaching the center:

    • Resulting median: 3.

Comparative Analysis and Observations

  • The means for both groups are relatively close to their respective medians:

    • Freshmen: Mean = 4, Median = 3

    • Seniors: Mean = 3.4375, Median = 3

  • Implication: The mean is closer to the median when the data distribution is symmetric.

  • Observation of symmetry in the distributions shown via the dot plots.

Additional Insights
  • Outliers significantly affect the mean more than the median.

  • A skewed distribution may lead to a higher mean than median, emphasizing the necessity of understanding data shape.