Statistical Reasoning Lecture 10

Overview: The lecture discusses hypothesis testing in the context of comparing proportions and incidence rates between two populations, focusing on methods such as the two-sample z-test, chi-square test, and Fisher's exact test.

Upon completion, students will be able to:
- Estimate and interpret p-values when comparing proportions with the two-sample z-test.
- Explain the relationship between risk difference (RD), relative risk (RR), and odds ratio (OR), emphasizing why only one p-value is necessary despite the multiple measures of association.

The two-sample z-test is similar to the t-test for comparing means but is specifically for proportions. Its methodology includes:
- Setting up null () and alternative hypotheses.
- Assuming the null hypothesis is true to compute how far the observed sample estimate diverges from expectations under the null hypothesis.
- Translating this distance into a p-value for decision-making.

A study conducted on 1,000 HIV-positive patients:
- CD4 < 250: 127 out of 503 (25% response)
- CD4 ≥ 250: 79 out of 497 (16% response)
- Overall response: 206 out of 1,000.

Competing hypotheses:
- Null: H0: pC<250 = pC≥250
- Alternative: HA: pC<250 ≠ p_C≥250

Compute the standardized distance:
- z = rac{(p̂C<250 - p̂C≥250) - 0}{SE(p̂C<250 - p̂_C≥250)}
- Here, SE(p̂C<250 - p̂C≥250) = ext{sqrt}igg( rac{p̂C<250(1-p̂C<250)}{nC<250} + rac{p̂C≥250(1-p̂C≥250)}{n_C≥250}igg)
The resulting z-value was approximately 3.6, suggesting a significant difference in response rates (above the null of 0).
p-value results:
- Calculate: $p = 2<em>(1 - pnorm(3.6)) ext{ or } 2</em>pnorm(-3.6)$
- Result: p-value < 0.001, indicating strong evidence against the null hypothesis.

The chi-square test yields similar results to the two-sample z-test for comparing proportions.
General procedure:
- Setup null and alternative hypotheses, assume null is true, calculate the distance from observed to expected frequencies, convert this distance into a p-value.

Under the null hypothesis:
- Calculate expected counts for each outcome based on observed proportions.
Apply the chi-square statistic formula:
- $χ^2 = rac{(O<em>i - E</em>i)^2}{E_i}$

Resulting p-value from the chi-square test was approximately identical to that from the z-test, underscoring the tests' equivalence.

Fisher's exact test is utilized for small sample sizes and provides an exact p-value when comparing two groups.
Comparisons suggest the test's application in scenarios where either group is less than 5 in frequency.
Illustrative example:
- Using treatment response from the same HIV study:
- p-values derived from Fisher’s Exact Test were consistent with results from z-test and chi-square approach, reaffirming the findings across methods.

Similar logic applies across various tests. The p-value signifies the probability of observing data as extreme as the sample under the null hypothesis.
In practice, researchers generally use rejection levels at 0.05 and test norms with 95% confidence intervals. Different tests may yield slightly divergent p-values, yet they interpret identically in the context of studies comparing binary outcomes.