Chapter 5: Association between Categorical Variables (Chi-Squared Test)

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/18

flashcard set

Earn XP

Description and Tags

Vocabulary flashcards covering key terms and definitions from the Chi-Squared Test of Independence and contingency tables.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

19 Terms

1
New cards

Contingency table

A table displaying counts of observations for all combinations of two categorical variables; row totals and column totals are called marginal distributions.

2
New cards

Marginal distribution

The row totals or the column totals in a contingency table.

3
New cards

Conditional distribution

The distribution of one variable given a fixed category of the other variable; used to assess independence.

4
New cards

Statistical independence

The population conditional distributions on one variable are identical across all categories of the other variable; probabilities are the same across levels.

5
New cards

Statistical dependence

The conditional distributions differ across categories, indicating an association between the variables.

6
New cards

Null hypothesis (H0) of independence

The two categorical variables are statistically independent.

7
New cards

Alternative hypothesis (Ha) of dependence

The two categorical variables are statistically dependent.

8
New cards

Observed frequency f0

The count actually observed in a cell of the contingency table.

9
New cards

Expected frequency fe

The count expected in a cell if the variables were independent; fe = (row total × column total) / overall sample size.

10
New cards

Pearson chi-squared statistic

χ2 = sum over all cells of (f0 − fe)² / fe; measures discrepancy between observed and expected frequencies under H0.

11
New cards

Degrees of freedom

df = (r − 1)(c − 1) for an r × c contingency table.

12
New cards

P-value

The right-tail probability of observing a χ2 as large or larger than the observed value under H0.

13
New cards

Significance level (α)

The threshold for rejecting H0; if P ≤ α, reject H0.

14
New cards

Large-sample condition

For the chi-squared test, each cell should have fe > 5 to rely on the χ2 approximation.

15
New cards

Fisher’s exact test

An exact test used for small samples when the chi-squared approximation may be unreliable.

16
New cards

Randomization

Assumption that the sample is random, ensuring valid inference for the chi-squared test.

17
New cards

2×3 contingency table

A table with 2 rows and 3 columns used to illustrate cross-classification of two variables (e.g., gender and party ID).

18
New cards

Chi-squared test of independence

A test comparing observed frequencies to expected frequencies under independence to determine if two categorical variables are related.

19
New cards

Interpretation caution: association vs strength

A large χ2 indicates evidence of association but does not quantify the strength; examine conditional distributions for strength.