Chi-Square Tests Notes

Chapter 16: Inference for Counts (Chi-Square Tests)

16.1 Goodness-of-Fit Tests

Deals with count data at the nominal or ordinal level.
Examines counts of items in different categories.
Asks if the actual distribution differs from a predicted model due to random error or a poor model fit.
Example: Analyzing "up" days in the stock market and comparing it to a population distribution.

16.1 Logic of Goodness-of-Fit Tests

Individuals are classified and counted into categories.
Observed frequencies are tabulated for each category.
Each individual is counted in only one category.
Expected frequencies are constructed based on the null hypothesis (H0).
Expected frequency: The frequency predicted by H0 and the sample size, representing an idealized distribution.
Compares observed frequencies from sample data with expected frequencies predicted by H0.
Uses the Chi-Square distribution to represent values for all possible random samples when H0 is true.

16.2 Interpreting Chi-Square Values

Chi-Square Distribution

Positively skewed distribution.
A family of distributions, each determined by degrees of freedom (df).
Each df value results in a slightly different shape.

16.2 Hypothesis Tests: Chi-Square Goodness-of-Fit

Step 1: Hypotheses

Case I: No preference (equal proportions) among categories
- H0: p1 = p2 = \cdots = pk = a, where 'a' is the probability value to test, and 'k' is the number of categories.
- HA: At least one pi is not equal to its specified value.
Case II: Specified proportions or preference from another known population
- H0: p1 = a1, p2 = a2, \cdots = pk = ak, where a1, a2, \cdots , ak are the probability values or relative preferences of each category.
- HA: At least one pi is not equal to its specified value.

Step 2: Decision Zone and Type of Test

\chi^2 is the test statistic (Chi-square).
Right-tailed test (0 < \chi^2 < \infty).
Reject H0 if \chi^2 > \chi^2{cv}, where \chi^2{cv} is the critical value from the chi-square distribution with (k - 1) degrees of freedom.

Step 3 and 4: Observed and Expected Frequencies

Observed Frequencies: Obs(oi). Property: \sum{i=1}^{k} o_i = n
Expected Frequencies: Exp(ei) = npi. Property: \sum{i=1}^{k} oi = \sum{i=1}^{k} ei = n, where p_i are the probabilities of categories specified in H0.

Step 5: Test Statistic

\chi^2 = \sum{i=1}^{k} \frac{(oi - ei)^2}{ei}

Step 6: Make a Decision (Critical Value Approach)

Reject H0 if the test statistic (\chi^2) is in the critical region.
Fail to reject H0 if the test statistic (\chi^2) is not in the critical region.
Rejection Region: Reject H0 if \chi^2 > \chi^2_{cv} (Right-tailed Test).

Step 6: Make a Decision (p-value Approach)

p-value: Probability of seeing the observed data (or more extreme) given H0 is true.
Calculation of p-value: P(\chi^2 > \chi^2_0) (Right-tailed Test).
Decision Rule:
- If p-value < \alpha, reject H0.
- If p-value \geq \alpha, do not reject H0.

Assumptions and Conditions

Counted Data Condition: Data must be counts for categories of a categorical variable.
Independence Assumption: Counts should be independent of each other.
Randomization Condition: Counted individuals should be a random sample of the population.
Sample Size Assumption: Enough data for the methods to work.
Expected Cell Frequency Condition: Expected cell frequency (ei) in each cell/category should be at least 5. If not, collapse adjacent cells meaningfully until ei \geq 5 and adjust df accordingly.

Example 1: Case I (No Preference or Equal Proportion)

Problem: Are technical support calls equal across all days of the week (uniform distribution)?
Sample data: Technical support calls for 10 days per day of week.
Step 1: Hypotheses
- H0: p1 = p2 = \cdots = pk = \frac{1}{7}
- HA: At least one p_i is not equal to \frac{1}{7}.
Step 2: Decision Zone and Critical Value
- \chi^2{\alpha,k-1} = \chi^2{.05,6} = 12.5916
Steps 3 and 4: Observed and Expected Frequencies
Assumptions and Conditions checked
Steps 5 and 6: Test Statistic and Make a Decision (CV and p-value Approach)
- Decision: Reject H0. The distribution of technical support calls is not uniform across the days of the week.

Example 2: Case II (Specified Preference or Proportion)

Problem: Market share analysis of fabric softener companies A and B after advertising campaigns.
Before campaigns: Company A (45%), Company B (40%), Others (15%).
Sample: 200 customers; A (102), B (82), Others (16).
Question: Have customer preferences changed at a 5% significance level?
Step 1: Hypotheses
- H0: p1 = 0.45, p2 = 0.40, p3 = 0.15
- HA: At least one p_i is not equal to its specified value.
Step 2: Decision Zone and Critical Value
- \chi^2{\alpha,k-1} = \chi^2{.05,2} = 5.99147
Steps 3 - 5: Observed, Expected Frequencies, and Test Statistic
Assumptions and Conditions checked
Step 6: Make a Decision (CV and p-value Approach)
- Decision: Reject H0. Market preferences have changed since the advertising campaigns.

Goodness-of-Fit Test for Normal Distribution: Case II (Specified Preference or Proportion)

Used to check the validity of the assumption of a normal distribution used in many statistical methods.
Can use frequency distributions, stem-and-leaf displays, histograms, and normal plots.
Alternatively can conduct a chi-square goodness-of-fit test.

Example 3: Case II (Specified Preference or Proportion)

Test of Normality: Histogram of 50 gasoline mileages is symmetrical and bell-shaped.
Sample selected from a normally distributed population.
Uses chi-square goodness-of-fit test to check normality.
Step 1: Hypotheses
- H0: The population of all mileages is normally distributed
- HA: The population of all mileages is NOT normally distributed.
Steps 3 - 4: Observed and Expected Frequencies
- where \bar{x} = 31.56, s = 0.7977. So, for example, P(X < 30.0) = P(\frac{X-\bar{x}}{S} < \frac{30-\bar{x}}{s} ) = P(Z < \frac{30.0-31.56}{0.7977} ) = P(Z < -1.96) \approx .0256
Assumptions and Conditions checked
Steps 3 - 4: Observed and Expected Frequencies (Collapsed Categories)
Step 6: Make a Decision
- Adjusted degrees of freedom for Chi-square is df = k - 1 - m = 5 - 1 - 2 = 2
- Decision: Do NOT reject H0 since \chi^2 = 0.30102 < \chi^2 = .05,df =2 = 5.99147.
- The population of mileages is normally distributed.

16.3 Examining the Residuals

Used when the null hypothesis is rejected, to discover which values are extraordinary or contribute most to the aggregate chi-square value.
Examine standardized residuals to compare cells with different counts: \frac{(oi - ei)}{\sqrt{e_i}}
Standardized residuals from goodness-of-fit tests are z-scores.

Example: Standardized Residuals for Technical Support Data

Largest value, Sunday, at −3.44, is impressive when viewed as a z-score.

16.4 The Chi-Square Test of Homogeneity or Independence (Contingency Table Analysis)

Categorical data summarized in a contingency table (cross-tab or pivot table).
Sample observations cross-classified according to two or more identifiable characteristics.
Used to determine independence of characteristics of interest and to test for homogeneity.
Testing for homogeneity is almost the same as testing for independence.

Contingency Table

Classifies sample observations according to two or more identifiable characteristics.
Also called a cross-tabulation or pivot table.
Referred to by row (r) and column (c) dimension, r \times c.

16.4 Hypothesis Tests: Chi-Square Test of Homogeneity or Independence

Step 1: Hypotheses
Using the Word “Independence”:
- H0: Variable 1 is independent of Variable 2.
- HA: Variable 1 is NOT independent of Variable 2.
Using the Word “Association”:
- H0: Variable 1 is NOT associated with Variable 2.
- HA: Variable 1 is associated with Variable 2.
Step 2: Decision Zone and Type of Test:
- \chi^2 is the lower-case Greek letter Chi.
- Right-tailed Test (0 < \chi^2 < \infty).
- Reject H0 if \chi^2 > \chi^2{cv} where \chi^2{cv} is the critical value from chi-square distribution with (r − 1)(c − 1) degrees of freedom.
Step 3 and 4: Observed and Expected Frequencies
- Observed Frequencies: Obs(o{ij}) (Property: \sum{i=1}^{r} \sum{j=1}^{c} o{ij} = n)
- Expected Frequencies: Exp(e_{ij}) = \frac{i^{th} \text{row total } \times j^{th} \text{column total}}{\text{grand total or sample size}}
- Property: \sum{i=1}^{r} \sum{j=1}^{c} o{ij} = \sum{i=1}^{r} \sum{j=1}^{c} e{ij} = n
- e{ij} = n p{ij} = n \times pi \times pj
Step 5: Test Statistic
- \chi^2 = \sum{i=1}^{r} \sum{j=1}^{c} \frac{(o{ij} - e{ij})^2}{e_{ij}}
Step 6: Make a Decision (Critical Value Approach)
- If test statistic (\chi^2) is located in the critical region, the null hypothesis is rejected.
- If the test statistic (\chi^2) is not located in the critical region, the researcher fails to reject the null hypothesis.
- Rejection Region: Reject H0 if \chi^2 > \chi^2_{cv} (Right-tailed Test).
Step 6: Make a Decision (p-value Approach)
- p-value: The p-value is the probability of seeing the observed data (or more extreme) given the null hypothesis is true.
- Calculation of p-value: P(\chi^2 > \chi^2_0) (Right-tailed Test).
- Decision Rule:
  - If p-value < \alpha, reject H0.
  - If p-value \geq \alpha, do not reject H0.
Assumptions and Conditions:
- Counted Data Condition: The data must be counts for the categories of a categorical variable.
- Independence Assumption: The counts should be independent of each other. Think about whether this is reasonable.
- Randomization Condition: The counted individuals should be a random sample of the population. Guard against auto-correlated samples.
- Sample Size Assumption: We must have enough data for the methods to work.
- Expected Cell Frequency Condition: Expected cell frequency (e{ij}) in each cell or category should be at least 5. If the expected frequencies (e{ij}) are less than 5, adjacent cells need to be meaningfully collapsed until benchmark (e_{ij} \geq 5) is achieved and df should be adjusted accordingly.

Example 1:

Problem: Is sex of yearbook editor independent of college’s funding source?
Step 1: Hypotheses
- Using the Word “Independence”:
  - H0: Sex of yearbook editor is independent of the college’s funding source.
  - HA: Sex of yearbook editor is NOT independent of college’s funding source.
- Using the Word “Association”:
  - H0: Sex of yearbook editor is NOT associated with college’s funding source.
  - HA: Sex of yearbook editor is associated with college’s funding source.
Steps 3 and 4: Observed and Expected Frequencies
Assumptions and Conditions
Steps 5 and 6: Test Statistic and Make a Decision (CV Approach)
Step 6: Make a Decision (p-value Approach)
- Decision: Since p-value < 0.005 < α = .05, we may reject the null hypothesis based on sample evidence and conclude that the sex and source of funding are not independent.