11. Chi-Square Statistics

At the end of this topic, students should be able to:
- Learn the basic logic behind hypothesis testing using chi-square statistic with nominal data.
- Show competency in calculating, analyzing, and interpreting results from a chi-square statistic.
- Recognize when to use and when not to use chi-square statistic.

Parametric Test
- A statistical test that makes assumptions about the population parameters and the distributions of the data (e.g., normality and homogeneity of variance).
Nonparametric Test
- Also known as distribution-free tests or rank-order tests.
- Statistical tests that do not assume anything about population parameters.

Utilizes sample data to test hypotheses about population proportions.
Determines how well the obtained sample proportions fit population proportions specified by the null hypothesis.
Formula:
[ \chi^2 = \sum \frac{(f_o - f_e)^2}{f_e} ]

Color Association and Hunger
- Participants choose colors associated with hunger: Red, Yellow, Green, Blue.
- Observed frequencies:
  - Red: 19
  - Yellow: 16
  - Green: 10
  - Blue: 5

Observed Frequency (f_o)
- Actual number of individuals found in a study category.
Expected Frequency (f_e)
- Number expected in a category if the null hypothesis is true.

Mathematically defined curve used as the comparison distribution in chi-square tests.
Reflects the distribution of the chi-square statistic.

Table of cutoff scores on the chi-square distribution for various degrees of freedom and significance levels.

Example report:
- Significant differences in color preference proportions.
- Conclusion: Certain colors more likely associated with hunger (e.g., Red: n = 19, Yellow: n = 16).

Examines if the distribution of frequencies over categories of one nominal variable is unrelated to the distribution of frequencies over another nominal variable.

Study on the relationship between personality (introvert/extrovert) and color preference (red/yellow/green/blue) among 200 students.

Two-dimensional chart showing frequencies in each combination of categories of two nominal variables.

Formula:
[ f_e = \frac{R \times n}{C} ]
- R = row total, C = column total, n = total respondents.

Formula:
[ df = (N_C - 1)(N_R - 1) ]
- Where N_C = number of columns, N_R = number of rows.

Phi Coefficient (φ)
- Effect-size measure for a 2x2 contingency table.
- Formula:
  [ \phi = \sqrt{\frac{\chi^2}{n}} ]
Cramer's Phi (φ)
- For contingency tables larger than 2x2.
- Formula:
  [ V = \frac{\chi^2}{n \cdot df} ]

Example report:
- Significant relationship between personality and color preference, with details of specific frequencies.

Procedures outlined for conducting goodness of fit tests using various statistical software including steps for setting up, running tests, and interpreting results.

Aron, A., Coups, E., & Aron, E. (2013). Statistics for psychology (6th ed.). Pearson Education Inc.
Goss-Sampson, M. A. (2024). Statistical Analysis in JASP 0.16.1: A Guide for Students.
Gravetter, F.J., Wallnau, L. B., & Forzano, L.B. (2020). Essentials of statistics for the behavioral science (9th ed.). Cengage Learning.
Navarro, D. J., & Foxcroft, D. R. (2019). Learning statistics with jamovi: A tutorial for psychology students and other beginners (Version 0.70).
Statistics How To (2018). Parametric statistics, tests and data.
Sharpe, Donald (2015). Your Chi-Square Test is Statistically Significant: Now What? Practical Assessment, Research & Evaluation.