Polling Margin of Error & Sampling: Quick Notes

Key Concepts

  • Sampling error vs non-sampling error: sampling error arises from observing a sample; non-sampling errors include nonresponse bias, response bias, recall bias, selection bias, etc.
  • Poll data typically uses categorical data (e.g., Labor vs National) to infer population behavior.
  • The “margin of error” (MOE) quantifies the sampling error for a reported percentage.

Margin of Error and Sampling Error

  • MOE (in percentage points) ≈
    \text{MOE} \approx \frac{100}{\sqrt{n}}
    for percentages in the 30\% to 70\% range (i.e., when p is not too close to 0 or 100).
  • Examples:
    • n = 400 ⇒ MOE ≈ 100/\sqrt{400} = 5\%.
    • n = 1000 ⇒ MOE ≈ 100/\sqrt{1000} ≈ 3.16\% (about 3.2\%).
  • For minor parties near 0% or 100%, MOE is smaller or the simple rule is less accurate; the exact formula would yield different values.
  • The MOE reflects only sampling error, not non-sampling errors.

Confidence Intervals

  • A 95\% confidence interval for a single percentage estimate is:
    \text{Estimate} \pm \text{MOE}
  • Interpretation: If we repeated the poll many times, about 95\% of such intervals would contain the true population value.
  • The interval accounts for sampling variability; true error also includes non-sampling errors (not captured here).

Sample Size and Variability

  • Larger samples reduce sampling variability (tighten the distribution of sample estimates).
  • Visual intuition: as n grows (e.g., 30 → 100 → 1000), the spread of sample proportions around the population value shrinks.
  • Variability is highest near 50\% and lower near 0\% or 100\%; mid-range proportions show the greatest uncertainty.

Two-Sample Comparisons (Margin of Error for Differences)

  • If comparing two proportions from the same sample (within-group):
    \text{MOE}_{\text{diff}} \approx 2 \times \frac{1}{\sqrt{n}}
  • If comparing two proportions from independent groups (two separate samples):
    \text{MOE}{\text{diff}} \approx 1.5 \times \frac{1}{2}\left( \frac{1}{\sqrt{n1}} + \frac{1}{\sqrt{n_2}} \right)
  • Example (same sample): n = 750, split into 375 and 375 ⇒ MOE for each ~ \frac{1}{\sqrt{375}} \approx 5.2\%, so \text{MOE}_{\text{diff}} \approx 2 \times 5.2\% = 10.4\%. The reported difference (e.g., 3.6\%) would be far within that margin.

Interpreting Poll Claims

  • Steps to evaluate a claim:
    1) Identify claim type: no comparison, within-group, or between two independent groups.
    2) Determine sample size and MOE; compute a 95\% confidence interval for the claim.
    3) Decide whether the claim is supported by the interval (i.e., whether the interval excludes zero for a difference).
  • Example framing: a reported lead of 6.8 percentage points from a single poll is not necessarily true if the 95\% CI for the difference includes 0.

Practical Takeaways for Exam/Quiz

  • For a single poll percentage (p) with n, assume robustness when p ∈ [30\%, 70\%] and use \text{MOE} \approx \dfrac{100}{\sqrt{n}}\%. For n = 1000, this is about 3.2\%.
  • For robust interpretation of minor parties near 0% or 100%, be cautious; margins may be much smaller than the 30-70% rule suggests.
  • When reporting, always present the range: \text{Estimate} \pm \text{MOE} to convey uncertainty.
  • If splitting the sample into subgroups, recalculate MOE using the appropriate rule of thumb (within same sample vs independent groups).
  • Remember: media often omits margin of error; your interpretation should hinge on the CI rather than the point estimate alone.

Quick Reference: Rule of Thumb and range

  • Base MOE rule of thumb:
    \text{MOE} \approx \dfrac{1}{\sqrt{n}} \quad \text{(as a proportion)},
    or equivalently \text{MOE (percentage points)} \approx \dfrac{100}{\sqrt{n}}.
  • Valid range: use for p ∈ [30\%, 70\%]; outside this range, the approximation is less reliable.
  • For comparing two proportions:
    • same sample: MOE_diff ≈ 2 × (1/√n)
    • independent groups: MOE_diff ≈ 1.5 × average(1/√n1, 1/√n2)
  • 95\% confidence means: long-run frequency of intervals capturing the true value is 95\%.