LV

Statistical Measures Summary

Chapter 1 – Statistical Measures

1.1 What are Statistics?

  • Statistics encompass the study of:
    • Collection of data
    • Processing of data
    • Summarization of data
    • Analysis of data
    • Interpretation of data

What is Data?

  • Quantitative Data: Numbers referring to a characteristic (e.g., ages).
    • Example: Ages of 500 individuals: 16, 45, 62, 81, …, 54.
  • Qualitative Data: Symbols/letters referring to a characteristic (e.g., gender).
    • Example: Genders of 500 individuals: M, F, F, M, …, M, F.
  • Focus on quantitative data throughout the course.

1.2 Populations and Samples

  • Population: Entire set of measurements under consideration.
  • Sample: Subset of the population.
    • A random sample ensures each population entry has an equal chance of selection.

1.3 Statistical Measures

  • Measures are computed numbers that summarize data characteristics.
  • Types of Measures:
    • Measures of Central Tendency (1.4)
    • Measures of Variation (1.5)

1.4 Measures of Central Tendency

  • Describes the center of data; key types:

The Mean

  • Most common measure of central tendency.
  • Calculation:
    • Each data entry is represented by x, with n as the number of entries.
    • Mean = (Sum of data entries) / n
  • Example Calculation:
    • Sample Data (scores): 35, 82, 61, 39, 92, 53, 72
    • Sample Mean Calculation:
    • n = 7
    • Mean = (35 + 82 + 61 + 39 + 92 + 53 + 72) / 7 = 62.

The Weighted Mean

  • Accounts for varying importance of data entries.
  • Example: John's Course Score Calculation:
    • Participation: 87 (weight 0.1)
    • Midterm: 72 (weight 0.4)
    • Final: 47 (weight 0.5)
    • Weighted Mean = (87 * 0.1 + 72 * 0.4 + 47 * 0.5) / (0.1 + 0.4 + 0.5) = 61.

The Median

  • Middle value when data is ordered.
  • Calculation Steps:
    1. Order data from smallest to largest.
    2. Identify median.
  • For an even number of entries, average the two middle values.
  • Example:
    • Data: 25, 24, 25, 22, 23, 20, 20, 24, 21, 22, 25, 24 (Total 12 entries);
    • Ordered: 20, 20, 21, 22, 22, 23, 24, 24, 24, 25, 25, 25;
    • Median = (22 + 23) / 2 = 22.5.

1.5 Measures of Variation

  • Measures extent and nature of data spread.

The Range

  • Formula: Range = Largest data entry - Smallest data entry.
  • Example: Range of Example 8 = 25 - 20 = 5.

Variance

  • Represents average squared distance of each data entry from the mean.
  • Formula:
    • Variance (s²) = (Sum of (x - mean)²) / (n - 1).
  • Standard Deviation (s): Square root of variance.
    • Indicates average deviation from the mean.
  • Example Calculation:
    • Given temperatures: 20, 23, 18, 25, 21, 25;
    • Compute mean, followed by variance and standard deviation.

Interpretation of Standard Deviation

  • Low standard deviation indicates data entries are close to the mean.
  • High standard deviation indicates a wider spread of data entries.
  • It's useful for comparing variations across samples only if they share similar means.
  • Example 10 tasks include finding mean, median, variance, and standard deviation of six numbers.