WK11: Statistical inference: Two categorical variables: Chi-square test for goodness of fit

Distribution of a Categorical Variable: Lists categories and their proportions.
If there are K categories: $P1, P2, …, P_K$ represent the proportions for each category.
$\sum{i=1}^{K} Pi = 1$ (sum of proportions equals 1).
Null Hypothesis ( $H_0$ ): Specifies proportions for all categories.
- $P1 = P{10}, P2 = P{20}, …, PK = P{K0}$
Alternative Hypothesis ( $H_a$ ): At least one proportion is different from what the null hypothesis states.
- Not all $Pi$ are equal to $P{i0}$ .

Claim: Births are not evenly distributed across days of the week.
Null Hypothesis ( $H_0$ ): Births are equally likely on all days of the week.
- $P1 = P2 = … = P_7 = \frac{1}{7}$
Alternative Hypothesis ( $H_a$ ): Births are not equally likely on all days of the week.
- Not all $P_i$ are equal to $\frac{1}{7}$ .
- This doesn't specify which days have different proportions.

Random Sample.
Expected counts under the null hypothesis must be:
- At least 1 for each cell.
- At least 80% of cells should have expected counts of at least 5.
Expected Counts Calculation: Multiply the proportion specified in the null hypothesis by the total sample size.

Chi-Square Statistic: Measures the difference between observed and expected counts.
Formula: $\chi^2 = \sum{i=1}^{K} \frac{(Oi - Ei)^2}{Ei}$
- $O_i$ = Observed count for category i
- $E_i$ = Expected count for category i
Degrees of Freedom: Number of categories minus one (K - 1).

P-value: Probability of observing a test statistic as extreme or more extreme than the one calculated, assuming the null hypothesis is true.
If p-value < level of significance ($\alpha$): Reject the null hypothesis.
- The sample gives statistically significant evidence supporting the alternative.
If p-value > level of significance ($\alpha$): Fail to reject the null hypothesis.
- The sample does not give statistically significant evidence to support the alternative.

P-value is between 0.0025 and 0.005.
Level of significance ($\alpha$) = 0.05
Since p-value < 0.05, reject the null hypothesis.
Conclusion: At the 5% level of significance, the data gives statistically significant evidence that local births are not equally likely on all days of the week.

When a categorical variable has only two categories, the chi-square goodness-of-fit test is equivalent to the test for the proportion of successes.
Both tests will provide the same decision and conclusion.

Success = Landing heads.
$H_0: P = 0.5$
$H_a: P \neq 0.5$
Sample proportion: $\hat{p} = \frac{168}{400} = 0.42$
Test statistic: $Z = \frac{\hat{p} - P}{\sqrt{\frac{P(1-P)}{n}}} = \frac{0.42 - 0.5}{\sqrt{\frac{0.5(1-0.5)}{400}}} = -3.2$
P-value: 2 * 0.0007 = 0.0014
Conclusion: P-value < 0.05, so reject the null hypothesis. The sample gives statistically significant evidence contradicting the 50/50 distribution.

Categories: Heads (H) and Tails (T)
$H0: PH = 0.5, P_T = 0.5$
$H_a$ : At least one is different from 0.5.
Expected counts: 200 for each category
Test statistic: $\chi^2 = \sum \frac{(Oi - Ei)^2}{E_i} = \frac{(168-200)^2}{200} + \frac{(232-200)^2}{200}= 10.24$
Degrees of freedom: 2 - 1 = 1
P-value: Between 0.001 and 0.002
Conclusion: P-value < 0.05, so reject the null hypothesis. The sample gives statistically significant evidence that $P_H \neq 0.5$ , which contradicts the 50/50 distribution.