Chapter 8: comparing more than two proportions

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/22

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

23 Terms

1
New cards

What kind of data leads to comparing multiple proportions?

Categorical data with more than two categories (e.g., color, class, opinion level). This is called a multinomial experiment.

2
New cards

How is a multinomial experiment different from a binomial one?

Binomial: Only two outcomes (e.g., success/failure)

Multinomial: More than two categories (e.g., ice cream flavors)

3
New cards

How is the p-value calculated in a multinomial simulation test?

It is the sum of probabilities of all possible outcomes as extreme (or more) as the observed one under the null model.

4
New cards

What is the null hypothesis for comparing proportions across categories?

• H0: The population proportions are equal to the historical values or assumed model.

• Ha: At least one population proportion differs.

5
New cards

What is the interpretation when p > alpha?

 Fail to reject H0: There is not enough evidence to conclude that the proportions differ.

6
New cards

What is Fisher’s Exact Test and when is it used?

It is a simulation-based test used for 2×2 tables, based on a multivariate hypergeometric distribution.

7
New cards

Why is Fisher’s Exact Test helpful in small samples?

It computes exact p-values when expected counts are too small for a chi-square approximation.

8
New cards

What does the chi-square goodness-of-fit test assess?

Whether observed categorical frequencies match expected frequencies from a theoretical distribution.

9
New cards

Chi-Square Distribution

knowt flashcard image
10
New cards

What is the test statistic for the chi-square goodness-of-fit test?

knowt flashcard image
11
New cards

What are the assumptions (validity conditions) for the chi-square test?

knowt flashcard image
12
New cards

How many degrees of freedom are used in the goodness-of-fit test?

The number of degrees of freedom is (𝒌 − 𝟏), where 𝒌 = {# 𝒐𝒇 𝒄𝒐𝒍𝒖𝒎𝒏𝒔/𝒄𝒂𝒕𝒆𝒈𝒐𝒓𝒊𝒆𝒔}.

13
New cards

What does a large chi-square value mean?

Large deviations between observed and expected counts → likely to reject H0.

14
New cards

When do we reject H_0 in the goodness-of-fit test?

If the p-value < significance level a , or if X² > critical value from table.

15
New cards

What is a contingency table?

 A table that displays the frequency distribution of variables grouped by two categorical variables.

16
New cards

What does the chi-square test for independence assess?

 Whether two categorical variables are associated (dependent) or not (independent).

17
New cards

What are the hypotheses for the test of independence?

• H_0: The variables are independent (no association)

• H_a: The variables are dependent (associated)

18
New cards

What is the test statistic formula for independence?

knowt flashcard image
19
New cards

How are degrees of freedom calculated for a contingency table?

knowt flashcard image
20
New cards

What do we do if validity conditions for the chi-square test aren’t met?

1. Combine rows/columns

2. Remove categories

3. Increase sample size

4. Use a simulation-based method

21
New cards

 What is Simpson’s Paradox?

A statistical phenomenon where a trend appears in several groups but reverses or disappears when the groups are combined.

22
New cards

Does association imply causation?

No. Even with significant association from a chi-square test, we cannot conclude causality without proper experimental design.

23
New cards

 Why must we be cautious when collapsing categories in a contingency table?

Collapsing improperly can hide or distort real effects, especially in sensitive or political data (e.g., discrimination cases).