Statistical Inference for Proportions Summary

Statistical Inferences for Proportions

  • Key Concepts:

    • Estimation and inference for one proportion (population parameter $p$)
    • Comparison and inference between two population proportions
  • Statistical Methods:

    • Hypothesis testing and confidence intervals for one and two populations
    • Differences in proportions, relative rates, odds ratios

One Population Proportion

  • Estimation:
    • Point estimator: $p$
    • Standard error: SE(p)=extsqrt(extp<em>0(1extp</em>0))nSE(p) = \frac{ ext{sqrt}( ext{p}<em>0(1 - ext{p}</em>0))}{n}
    • Z-test for hypothesis testing against a constant $p_0$.
    • 95% Confidence Interval: CI=pext±1.96imesSE(p)CI = p ext{ ± } 1.96 imes SE(p)

Confidence Intervals for One Proportion

  • Data example: 837 out of 1810 respondents favored legalized abortion.
  • Point estimate: 46.24%.
  • R function: binom.confint(x, n, conf.level) from the binom package, providing multiple CI methods (11 types).

Two Population Proportions

  • Comparison Methods:

    • Difference: $p1 - p2$
    • Relative risk: RR=p<em>1p</em>2RR = \frac{p<em>1}{p</em>2}
    • Odds ratio: OR=p<em>1(1p</em>2)p<em>2(1p</em>1)OR = \frac{p<em>1(1 - p</em>2)}{p<em>2(1 - p</em>1)}
  • Data Structure:

    • Independent groups in rows, binary results in columns.

Chi-Squared Tests

  • Use:
    • Confirms independence of variables (e.g., gender and political party).
    • Standard output: Chi-squared statistic and p-value.

R Functions for Proportions

  • One population: prop.test() for CI and hypothesis testing.
  • Two populations: prop.test() for statistical inference.
  • Relative Risk CI: Function from Proxya package.
  • CMH test for ordinal data: Using CMH.test() function, requires ordinal data scoring.