Confidence Intervals Summary

Confidence Intervals

Overview

  • Use sample mean MM to estimate population mean μ\mu.
  • Assess the accuracy of MM as an estimator of μ\mu.

Sampling Distribution of Mean

  • The mean of the sampling distribution is μ\mu.
  • The standard deviation is σM=σn\sigma_M = \frac{\sigma}{\sqrt{n}}, also known as the standard error of the mean.
  • Shape: normal distribution.

Using Sampling Distribution

  • For a population with μ=100\mu=100 and σ=10\sigma=10, with a sample size of n=25n=25, the sampling distribution of the mean is approximately normal.
  • Mean = 100
  • Standard deviation = σM=1025=2\sigma_M = \frac{10}{\sqrt{25}} = 2

Area Under the Curve

  • The sampling distribution is an idealized frequency distribution.
  • Area under the curve corresponds to the proportion of cases in a particular range of scores and the probability of sampling a case within that range.

Normal Distribution

  • Symmetrical.
  • The mean is the center.
  • 50% of the distribution is above and below the mean.
  • The area between any two points can be calculated.

Z-Scores

  • Convert raw scores to Z scores to use the normal curve table.
  • Z=MμσMZ = \frac{M - \mu}{\sigma_M}

Examples of Probability Calculations:

  • Example 1: Probability of a sample mean higher than 103
    • Calculate σM=2\sigma_M = 2
    • Calculate Z score: Z=1031002=1.5Z = \frac{103 - 100}{2} = 1.5
    • From tables, the area beyond z=1.5z = 1.5 is 0.50.433=0.0670.5 - 0.433 = 0.067.
    • Therefore, the probability is 0.067.
  • Example 2: Probability that a sample mean will be within ±4\pm 4 points of μ\mu
    • Calculate σM=2\sigma_M = 2
    • Convert 96 and 104 to Z scores: Z=961002=2Z = \frac{96 - 100}{2} = -2, Z=1041002=2Z = \frac{104 - 100}{2} = 2
    • From tables, the area between Z=±2Z = \pm 2 is 0.477×2=0.9540.477 \times 2 = 0.954.
    • Therefore, the probability is 0.954.
  • Example 3: Value exceeded by only 2.5% of sample means
    • Calculate σM=2\sigma_M = 2
    • From tables, the Z score that corresponds to an area of 0.475 is 1.96.
    • Convert Z to M: M=μ+ZσM=100+1.96×2=103.92M = \mu + Z \sigma_M = 100 + 1.96 \times 2 = 103.92
  • Example 4: Limits within which the central 95% of sample means fall
    • Calculate σM=2\sigma_M = 2
    • From tables, z=±1.96z = \pm 1.96 bounds the central area of 95%.
    • Upper limit: M<em>upper=μ+zσ</em>M=100+1.96×2=103.92M<em>{upper} = \mu + z \sigma</em>M = 100 + 1.96 \times 2 = 103.92
    • Lower limit: M<em>lower=μzσ</em>M=1001.96×2=96.08M<em>{lower} = \mu - z \sigma</em>M = 100 - 1.96 \times 2 = 96.08

Estimation Concepts

  • Point Estimate:
    • The sample mean MM is the best point estimate of μ\mu, as it is an unbiased estimator.
  • Interval Estimate (Confidence Interval):
    • Construct a range of values that, with a certain level of confidence, covers the true value of μ\mu.
    • If 95% of sample means are within 1.96×σ<em>M1.96 \times \sigma<em>M from μ\mu, then for 95% of samples, μ\mu is within 1.96×σ</em>M1.96 \times \sigma</em>M from MM.

Confidence Interval for μ\mu (σ\sigma known)

  • Let α\alpha = error rate as a probability
    • For 95% confidence, α=0.05\alpha = 0.05
    • For 90% confidence, α=0.10\alpha = 0.10
    • For 99% confidence, α=0.01\alpha = 0.01
  • A 100(1α)%100(1 - \alpha)\% confidence interval for μ\mu has limits:
    • μ<em>upper=M+(z</em>c×σM)\mu<em>{upper} = M + (z</em>c \times \sigma_M)
    • μ<em>lower=M(z</em>c×σM)\mu<em>{lower} = M - (z</em>c \times \sigma_M)
    • Where zcz_c = “critical value”: the value of z that cuts off α/2\alpha/2 in each tail of the distribution, e.g., 1.96 for α=0.05\alpha = 0.05.

Finding zcz_c

  • For a 90% confidence interval, find the Z score that corresponds to an area of 0.45 (since 5% is in each tail).

Example

  • Scores on a statistics exam are normally distributed with σ2=144\sigma^2 = 144. Let n=25n = 25 and M=60M = 60. Obtain 90% confidence limits for μ\mu.
    • Step 1: Find z<em>cz<em>c and σ</em>M\sigma</em>M
      • zc=1.645z_c = 1.645
      • σ=144=12\sigma = \sqrt{144} = 12
      • σM=1225=2.4\sigma_M = \frac{12}{\sqrt{25}} = 2.4
    • Step 2: Find upper and lower limits for μ\mu
      • μ<em>upper=M+(z</em>c×σM)=60+(1.645×2.4)=63.95\mu<em>{upper} = M + (z</em>c \times \sigma_M) = 60 + (1.645 \times 2.4) = 63.95
      • μ<em>lower=M(z</em>c×σM)=60(1.645×2.4)=56.05\mu<em>{lower} = M - (z</em>c \times \sigma_M) = 60 - (1.645 \times 2.4) = 56.05
    • Step 3: Conclusion
      • We can be 90% confident that the population mean exam mark lies between 56.05 and 63.95.

Effect of Confidence Level on Width of Interval

  • Calculate 99% confidence limits for μ\mu for statistics exam data. n=25n = 25, M=60M = 60, σM=2.4\sigma_M = 2.4.
    • From tables, ZC=2.58Z_C = 2.58 (area = 0.495).
    • Upper and lower limits for μ\mu
      • μ<em>upper=M+(z</em>c×σM)=60+(2.58×2.4)=66.19\mu<em>{upper} = M + (z</em>c \times \sigma_M) = 60 + (2.58 \times 2.4) = 66.19
      • μ<em>lower=M(z</em>c×σM)=60(2.58×2.4)=53.81\mu<em>{lower} = M - (z</em>c \times \sigma_M) = 60 - (2.58 \times 2.4) = 53.81
    • Compare these with 90% confidence limits [56.05, 63.95]. The greater the level of confidence desired, the wider the interval.