Median, Mean, and Mode

Introduction to Statistics

  • A statistic is a number that describes a set of data.

  • The mean, median, and mode are three key statistics that describe different aspects of a data set.

  • Each statistic provides unique information about the data.

Key Statistical Measures

Mean

  • The mean, often referred to as the average, is calculated by:

    • Formula: extMean=Sum of all numbersNumber of valuesext{Mean} = \frac{\text{Sum of all numbers}}{\text{Number of values}}

  • Steps to find the mean:

    1. Add all numbers in the data set to find their sum.

    2. Divide the sum by the total number of items in the data set.

    • Example:

    • Given a set of numbers (e.g., test scores), add them together.

    • If there are 9 scores, divide the total sum by 9.

    • Resulting mean: approximately 85.6.

Median

  • The median is the middle number in a data set when the numbers are ordered from least to greatest.

  • Steps to find the median:

    1. Order the numbers from least to greatest.

    2. Identify the middle number. If the number of values is odd, the median is the central value.

    • If the number of values is even, calculate the mean of the two middle numbers.

      • Example:

      • For a data set ordered from least to greatest, if the values are (x1, x2, …, x_n):

      • Odd count leads to median = x_{{(n+1)/2}}.

      • Even count gives median = x<em>n/2+x</em>(n/2)+12\frac{x<em>{n/2} + x</em>{(n/2)+1}}{2}.

      • In a set with an odd number of scores, the median found was 88.

      • Interpretation: 50% of the data values lie above and below the median.

Mode

  • The mode is the number(s) that occurs most frequently in a data set.

  • Determining the mode:

    1. It can be helpful to have the data ordered from least to greatest.

    2. Identify which number appears most frequently.

  • Example:

    • In a test score data set, if the numbers were two 74s and two 83s, both 74 and 83 would be modes (bimodal).

    • However, if another number (e.g., 83) appears three times, 83 is the sole mode.

Further Examples and Comparisons

First Example

  • Data set: nine test scores.

  • Mean: Sum of scores divided by 9 = 85.6.

  • Median: Ordered data shows 88 as the middle value.

  • Mode: 74 and 83, since they occur most often (bimodal).

Second Example

  • Data set: twelve test scores.

  • Mean: Approximately 80.7 after adding all scores and dividing by 12.

  • Median: When ordered, values of 83 and 88 are the middle numbers, yielding a median of:

    • Median=83+882=85.5\text{Median} = \frac{83 + 88}{2} = 85.5.

  • Mode: The number 83 occurs most frequently, making it the sole mode.

Understanding Variations Between Statistics

  • Notably, in the first example, the mean and median were close (85.6 vs. 88).

  • In contrast, the second example had a larger difference between the mean (80.7) and median (85.5).

  • Explanation:

    • A low test score can heavily influence the mean and create a notable discrepancy between the mean and median, while the median remains resilient to such extremes.

Understanding Outliers

  • Outliers are extreme values that can significantly skew the mean but have a minimal effect on the median.

  • Illustration provides a graphical view showing stability of the median versus varying influences of extreme values on the mean.

  • An example visual showed the mean increasing dramatically as an outlier was adjusted, while the median remained constant.

Conclusion

  • Utilizing multiple statistical measures (mean, median, mode) is crucial for accurately describing and understanding a data set.

  • These measures provide a comprehensive view and help to mitigate the impact of anomalies and outlier values.

  • This lesson reiterated that understanding data requires more than one statistical perspective.