PSTAT 5LS – Theory-Based Inference for a Population Proportion

Course Logistics and Housekeeping

Course: PSTAT 5LS – Theory-Based Inference for $p$ (Slide Set 5)
Today’s topic: Introduction to theory-based inference for a population proportion $p$
Next time: Continuation of the same topic
Upcoming homework deadlines
• HW 2 due Tue Jul 8 @ 11:59 PM
• HW 3 due Mon Jul 14 @ 11:59 PM
• (HW 4 appears on slides but due date not shown: “F <118”)
Office-hours reminder
• Instructor OH: Tue & Thu 2–3 PM via Zoom
• Encouragement: “Visit us in office hours!”

Bridging Simulation & Theory

Previous slide set used simulation (randomization & bootstrap) to approximate sampling distributions.
Empirical observation: these simulated sampling distributions looked nearly normal.
• Example 1 (dolphin communication): histogram of simulated proportion of correct guesses.
• Example 2 (community recycling): histogram of simulated proportion of recyclers.
Take-away: the normal pattern hints that theory (specifically the Central Limit Theorem) can describe $\hat p$ ’s behaviour without repeated simulation.

Sampling Distributions

Definition: A sampling distribution is the distribution of a statistic (e.g., sample proportion $\hat p$ or sample mean $\bar x$ ) over all possible random samples of fixed size $n$ from the population.
• Describes shape, centre, and variability attributable purely to random sampling.
• Knowing this distribution lets us judge how “unusual” any observed statistic is when $H_0$ is true.

Distribution of the Sample Proportion $\hat p$

Mean (centre): $E(\hat p)=p$ (the true population proportion).
Standard deviation (variability), termed standard error (SE):
$SE = \sqrt{ \dfrac{p(1-p)}{n} }$
• Acts as a new “ruler” to quantify how far an observed $\hat p$ is from the hypothesised mean.

Central Limit Theorem (CLT) for Proportions

When conditions are met, the sampling distribution of $\hat p$ is approximately normal:
$\hat p \sim N\Bigl(\, p,\,\dfrac{p(1-p)}{n} \Bigr).$
Importance: Allows analytic (theory-based) inference instead of computational simulation.

Conditions Required for Normal Approximation

Independence (a.k.a. Randomness) Condition
• Individual observations must not influence each other.
• Usually satisfied by a simple random sample (SRS) or well-designed randomised study.
• If sampling without replacement, ensure population is at least 10× larger than sample.
Success–Failure Condition
• Expected counts—not necessarily observed counts—must include at least 10 successes and 10 failures.
• For inference on $p$ we check:
$np \ge 10 \quad\text{and}\quad n(1-p) \ge 10.$
• For hypothesis tests we substitute $p0$ (value posited by $H0$ ) because under $H0$ we assume $p=p0$ .
• The threshold “10” is chosen empirically: ensures normal curve approximates the true distribution well enough.

Using the Normal Model in Hypothesis Testing

Unknown true $p$ → plug in hypothesised $p0$ when computing SE during a test. $SE{H0} = \sqrt{ \dfrac{p0(1-p_0)}{n} }$
This substitution keeps calculations self-consistent with the null model.

Formal Steps for a Proportion Hypothesis Test

State hypotheses
• Null: $H0 : p = p0$
• Alternative: $HA : p < p0,\; p\ne p0, \text{ or } p > p0$ (direction dictated by research question before seeing data).
Check conditions (independence + success–failure using $p_0$ ).
Compute test statistic (z-score):
$z = \dfrac{\hat p - p0}{\sqrt{\dfrac{p0(1-p_0)}{n}}}.$
Find p-value
• Use normal distribution areas; tail(s) chosen according to $H_A$ .
Decision
• Compare p-value to significance level $\alpha$ (commonly 0.05).
• p-value $\le \alpha$ → reject $H0$ (result is “statistically significant”). • p-value $> \alpha$ → fail to reject $H0$ (not significant).
Contextual conclusion
• Translate statistical outcome to plain language relating to the study subject.

Interpreting the z Test Statistic

$z$ measures distance of observed $\hat p$ from $p_0$ in standard-error units.
• Example: $z=2$ means observation lies 2 SEs above hypothesised mean.
Link to Empirical (68-95-99.7) Rule:
• |z|≈1 → ordinary; |z|≈2 → somewhat unusual; |z|≈3 → very rare under $H_0$ .
• For non-integer z, exact areas require technology.

Computing Areas with R’s pnorm()

Syntax: pnorm(q, mean = 0, sd = 1, lower.tail = TRUE)
• q = z-score (quantile).
• lower.tail = TRUE returns $P(Z \le q)$ , FALSE returns P(Z > q).
Tail selection depends on $H_A$ :
• Left-tailed (

Worked Examples

1. Dolphin Communication (Doris & Buzz)

Hypotheses:
H0: p = 0.50 \quad vs. \quad HA: p > 0.50.
Data: $n=16,\; \hat p = 15/16 = 0.9375.$
Test statistic:
$z = \dfrac{0.9375 - 0.50}{\sqrt{\dfrac{0.50\times 0.50}{16}}} = 3.50.$
p-value (right tail):
pnorm(3.50, mean = 0, sd = 1, lower.tail = FALSE) → $0.0002326291.$
Interpretation: Extremely small p-value → strong evidence dolphins / Buzz perform better than chance.

2. Community Recycling Rate

Hypotheses:
$H0: p = 0.70 \quad vs. \quad HA: p \ne 0.70.$
Data: $n=800,\; \hat p = 530/800 = 0.6625.$
Test statistic:
$z = \dfrac{0.6625 - 0.70}{\sqrt{\dfrac{0.70\times 0.30}{800}}} = -2.315.$
p-value (two-tailed):
2*pnorm(-2.315, mean = 0, sd = 1, lower.tail = TRUE) → $0.020613.$
Interpretation: At $\alpha=0.05$ , result is statistically significant (p ≈ 0.021 < 0.05); observed recycling proportion differs from 70% (specifically, it is lower).

Visual Memory Aids

Normal curve with mean $p_0$ labelled at centre, tick marks at $\pm SE, \pm2SE, \pm3SE$ .
• Approx. 68% within ±1 SE, 95% within ±2 SE, 99.7% within ±3 SE.
• Extreme regions (tails) correspond to p-value areas.

Tail Direction Summary

Alternative $HA$ dictates tail(s) used: • $p < p0$ → left tail.
• p > p0 → right tail. • $p \ne p0$ → both tails (double smaller area).

Decision Table (α-Level Rule)

If p-value ≤ $\alpha$ ⇒ Reject $H_0$ ⇒ “statistically significant.”
If p-value > $\alpha$ ⇒ Fail to reject $H_0$ ⇒ “not statistically significant.”

Ethical & Practical Considerations (implicit reminders)

Independence assumption critical; violating it invalidates SE & p-values.
Choosing $H_A$ direction must come before examining data to avoid bias.
Success–failure rule ensures ethical communication of uncertainty: prevents overstating evidence when normal approx. is unreliable.
Real-world relevance: findings (e.g., dolphin cognition, community recycling) can inform environmental or cognitive research initiatives.

Connections to Earlier Material

Builds directly on earlier simulation-based inference: both aim to quantify sampling variability.
Standardised statistic $z$ is analogue to earlier randomisation-distribution counts of “how far from center” but now parameterised by theoretical SE.

PSTAT 5LS – Theory-Based Inference for a Population Proportion

Course Logistics and Housekeeping

Bridging Simulation & Theory

Sampling Distributions

Distribution of the Sample Proportion p^\hat pp^​

Central Limit Theorem (CLT) for Proportions

Conditions Required for Normal Approximation

Using the Normal Model in Hypothesis Testing

Formal Steps for a Proportion Hypothesis Test

Interpreting the z Test Statistic

Computing Areas with R’s pnorm()

Worked Examples

1. Dolphin Communication (Doris & Buzz)

2. Community Recycling Rate

Visual Memory Aids

Tail Direction Summary

Decision Table (α-Level Rule)

Ethical & Practical Considerations (implicit reminders)

Connections to Earlier Material

Distribution of the Sample Proportion $\hat p$