Hypothesis Testing for Two Proportions Study Notes

Overview of Hypothesis Testing for Two Proportions

Hypothesis Testing involves making inferences regarding population parameters based on sample statistics.
Null Hypothesis (H0): Assumes no difference in the proportions being compared. For example, if group one is girls and group two is boys, then (H0: p1 = p_2).
Alternative Hypothesis (H1): States that there is a difference between the two proportions, represented as (H1: p1 \neq p_2), or can be given as greater than or less than comparisons.
Confidence Interval: A range of values that is expected to cover the true population parameter a certain percentage of the time (e.g., 95%).

Step 1: Determine Grouping: Decide which group is group one and which is group two. There is no correct or wrong way to assign groups, but consistency is essential.
Step 2: Calculate the sample proportions for both groups, denoted as:
- (p1 = \frac{x1}{n1}) where (x1) is the number of successes in group one and (n_1) is the size of group one.
- (p2 = \frac{x2}{n2}) where (x2) is the number of successes in group two and (n_2) is the size of group two.
Step 3: Construct the Confidence Interval:
- The general formula for the confidence interval for the difference in proportions is
  \left( (p1 - p2) \pm Z \cdot \sqrt{ \frac{p1(1-p1)}{n1} + \frac{p2(1-p2)}{n2}} \right)
- Where (Z) is the critical value corresponding to the chosen confidence level.
Step 4: Interpretation of Results:
- If the confidence interval contains zero, it indicates no significant difference between the two groups.
- If it does not contain zero, there may be a statistically significant difference in proportions; a positive value indicates group one has a higher proportion than group two.

Consider a scenario with the following data from two groups regarding the ownership of credit cards:
- Group 1 (Girls): (n1 = 250), (x1 = 180) hence (p_1 = \frac{180}{250} = 0.72 ).
- Group 2 (Boys): (n2 = 242), (x2 = 136) hence (p_2 = \frac{136}{242} = 0.56).
Step 1: p-values need to be calculated from provided data, with a focus on the difference between the two proportions.
Step 2: Using the confidence interval formula to calculate:
1. Calculate (p1 - p2 = 0.72 - 0.56 = 0.16).
2. Calculate standard error:
  SE = \sqrt{ \frac{0.72 \cdot (1 - 0.72)}{250} + \frac{0.56 \cdot (1 - 0.56)}{242} }
3. Use a Z-score for the desired confidence level (e.g., 1.96 for 95% confidence).
Step 3: Calculate the margin of error using the Z-score and the standard error.

Set Up Hypotheses
- Null: (H0: p1 = p_2)
- Alternative: (H1: p1 \neq p_2) (two-tailed test)
Calculate p-Value
- Perform Z-test using its formula, relying on software/calculators for efficiency.
Decision Rule: Compare the p-value against a significance level (commonly (\alpha = 0.05)):
- If the p-value (< \alpha), reject the null hypothesis.
- If the p-value (\geq \alpha), do not reject the null hypothesis.

If rejecting the null hypothesis, interpret that there is evidence of a difference in credit card ownership between girls and boys without specifying which group is higher unless that was tested separately.
It is crucial to confirm that terms are correctly defined throughout the testing process, including clearly stating whether proportions being compared are intended to be equal or not.
Final Notes: Always double-check calculations for rounding and ensure data inputs are accurately represented in formulae utilized.