STAT1170 4: Sample Means + Confidence Intervals

Sampling Distributions for Proportions

  • Proportions are averages of dichotomous data (0 or 1).

  • In repeated sampling, sample proportions approach a Normal distribution if sample size n is sufficiently large.

  • Population proportion: p; Sample proportion: \hat{p}.

Conditions for Normal Distribution

  • Central Limit Theorem (CLT) requires:

    • n p \geq 5

    • n (1 - p) \geq 5

  • If both conditions are met, sample proportions approximate Normal, centered at p.

  • Standard error of sample proportions: \sigma_{\hat{p}} = \sqrt{\frac{p(1 - p)}{n}}.

Example Calculations

  • Population proportion of white cars: p = 0.4

  • In a sample of 25 cars, calculate \hat{p}:

    • \hat{p} = \frac{12}{25} = 0.48

  • To find probabilities:

    • Calculate z-score for \hat{p}:
      z = \frac{\hat{p} - p}{\sqrt{\frac{p(1 - p)}{n}}}.

    • Probability of sample proportion at least 0.48: P(\hat{p} \geq 0.48) = 0.2071.

Confidence Intervals for Population Proportions

  • 95% Confidence Interval (CI):

    • When CLT applies, CI for population proportion:
      \hat{p} \pm 1.96 \times \sqrt{\frac{\hat{p}(1 - \hat{p})}{n}}.

  • Example using Sydney teenagers:

    • \hat{p} = \frac{216}{995} = 0.2171,

    • CI: (0.191, 0.243).

Confidence Intervals for Population Mean

  • When \sigma is known:

    • y \pm 1.96 \times \frac{\sigma}{\sqrt{n}}.

  • When \sigma is unknown:

    • Use sample standard deviation s and t-distribution:

    • y \pm t_{\alpha/2} \times \frac{s}{\sqrt{n}}.

Notable Points

  • Student’s t-distribution is used when \sigma is estimated.

  • It has heavier tails for small samples, adjusts with degrees of freedom (n - 1).

  • Ensure independence of observations for valid CI estimates.