Probability and Samples: The Distribution of Sample Means

Chapter Learning Objectives

  • Define the Distribution of Sample Means: Understand what the distribution of sample means is, and for a specific sampling situation, describe its characteristics.

  • Describe the Distribution's Characteristics: Identify its shape, the expected value of M (\muM), and the standard error of M (\sigmaM).

  • Utilize the Unit Normal Table: Be able to use the unit normal table interchangeably to determine probabilities or corresponding z-scores.

  • Understand Z-score Location: Recognize that each sample mean, M, has a specific location within the distribution of sample means, which can be quantified by a z-score.

  • Determine Probabilities for Sample Means: Use the distribution of sample means, z-scores, and the unit normal table to calculate probabilities associated with specific sample means.

Samples, Populations, and the Distribution of Sample Means

  • Individual Score Z-scores and Probabilities: Whenever an individual score is selected from a population, you should be able to calculate a z-score to pinpoint its exact location in the distribution. If the population is normally distributed, you can also determine the probability of obtaining any specific individual score.

  • Challenges with Samples: Generally, working with samples presents a challenge because a sample inherently provides an incomplete representation of the entire population.

  • Sampling Error: This refers to the natural discrepancy or amount of error that exists between a sample statistic (e.g., sample mean, M) and its corresponding population parameter (e.g., population mean, \mu).

The Distribution of Sample Means

  • Definition: The distribution of sample means is the comprehensive collection of all possible sample means that could be obtained when taking random samples of a specific size (n) from a particular population.

  • Sampling Distribution: This is a broader term defined as any distribution of statistics (such as means, variances, etc.) obtained by selecting every possible sample of a specific size from a given population.

Characteristics of the Distribution of Sample Means

  • Central Tendency: The sample means derived from different samples tend to cluster around the population mean (\mu).

  • Shape: The collection of sample means typically forms a normal-shaped distribution.

  • Effect of Sample Size: As the sample size (n) increases, the individual sample means generally get closer to the population mean (\mu). This illustrates the law of large numbers in action, suggesting that larger samples are more accurate representations.

The Central Limit Theorem (CLT)

  • Core Principle: The Central Limit Theorem is a fundamental principle in statistics. It states that for any population, regardless of its original shape, with a known mean (\mu) and a known standard deviation (\sigma), the distribution of sample means for sample size n will possess the following properties:

    • Mean of the Distribution of Sample Means: The mean of this distribution will be equal to the population mean ($\mu_M = \mu$). This is also referred to as the expected value of M.

    • Standard Deviation of the Distribution of Sample Means: The standard deviation of this distribution, known as the standard error of M, will be \frac{\sigma}{\sqrt{n}}.

    • Shape Approximation: The distribution of sample means will increasingly approach a normal distribution as the sample size (n) approaches infinity.

  • Rapid Normal Approximation: The