Module 7: Chi-Square

0.0(0)
studied byStudied by 1 person
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/59

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

60 Terms

1
New cards

What are the 2 types of chi-square tests, and their alternate names?

  • One-way chi square (Goodness of Fit)

  • Two-way chi square (Test of independence)

2
New cards

What is the goodness of fit test?

One-way chi square

3
New cards

What is the test of independence?

Two-way chi square

4
New cards

What type of data does do chi square analyse?

Categorical, frequency data.

Think: 10 Girls and 5 Boys

5
New cards

Chi square allows us to test research questions with…

Categorical data

6
New cards

How is categorical data scored?

Using frequencies

7
New cards

Frequency data

The number of sampling units in each classification.

Think: Eye colour breakdown in a class

8
New cards

What test would be used to test this RQ?

“Are people/things distributed evenly across the categories of the variable?”

One way chi square

9
New cards

What do chi-square analyses tell us?

  • The expected likelihood of something happening and comparing it with observed frequencies

    • Think: Did we actually get what we were expecting?

  • Tells you how well the expected and observed frequencies match

Think: In a cohort of psych students, we expect there to be 90 girls and 10 boys. In the 2024 cohort there are 78 girls and 15 boys. To what extent do these match? Do these results fit the expected results?

10
New cards

Chi-square formula

knowt flashcard image
11
New cards

What is the basic idea of the chi-square formula?

X2 will be small if differences between f0 and fe are small

<p>X2 will be small if differences between f0 and fe are small</p>
12
New cards

If 284 people were asked what their favourite chupa chup flavour is, what question might we ask that would make a one-way chi square test appropriate?

Are people distributed evenly across the levels of chupa chup flavour preference, or are some flavours more popular than others?

13
New cards

If 284 people were asked what their favourite chupa chup flavour is, what outcome would mean the null hypothesis of a one way chi-square test was accepted?

The preferences of flavours were evenly distributed across the levels, and there was no significant difference to expected values. There was no significant difference between flavour preference. All were equally preferred.

14
New cards

What is meant by equiprobable distribution in a chi-square test?

Uniformly distributed - no significant difference in frequencies across all levels of the construct

Think: There is no super popular/unpopular chupa chup flavour.

15
New cards

fo

Observed frequency

16
New cards

fe

Expected frequency

17
New cards
<p>Calculate X2</p>

Calculate X2

knowt flashcard image
18
New cards

When do you reject the null hypothesis in a chi square test?

When p < 0.05

The observed scores are significantly different from the expected scores (expected that there would be a equiprobable distribution)

19
New cards

X2 is a family of distributions dependent on..

degrees of freedom

20
New cards

How do you calculate df in a chi-square

df = k - 1

  • df is based on number of cells/categories, not number of participants (n)

21
New cards

What does a negative X2 statistic suggest?

Impossible - They are always positive

22
New cards

How can you increase the obtained chi-square?

Increase the sample size

23
New cards

What happens to the critical X2 value when you increase the sample size? Why?

It stays the same

  • Because df in chi-square are based on number of categories not the number of participants, so when you increase the sample size it has no impact on the df formula of k-1

  • Remember that df are used to find a critical value in a lookup table, and when you match them, the critical value is a cutoff to determine whether an observed value is significant or not.

<p>It stays the same</p><ul><li><p>Because df in chi-square are based on number of categories not the number of participants, so when you increase the sample size it has no impact on the df formula of k-1</p></li><li><p>Remember that df are used to find a critical value in a lookup table, and when you match them, the critical value is a cutoff to determine whether an observed value is significant or not. </p></li></ul>
24
New cards

Critical value

Value that marks .05 region under the null

<p>Value that marks .05 region under the null</p>
25
New cards

In studies using chi square, what do we only need the critical value of X2 for?

Degrees of freedom

  • The df we calculate is used to match up with a critical value in a table that is used as a cutoff.

26
New cards

If the obtained X2 value is larger than the critical X2 value in a one-way chi squared, what do we conclude?

There is a significant difference. The sample is not consistent with a population in which units are distributed evenly across categories.

Think: There is a clear most popular chupa chup.

27
New cards

If the obtained X2 value is smaller than the critical X2 value in a one-way chi squared, what do we conclude?

The null hypothesis is accepted and it is concluded that there is an equiprobable distribution across categories.

Think: People like the flavours of chupa chups equally (WRONG BECAUSE STRAWBERRY IS SUPERIOR)

28
New cards

If you were using a chi square test to demonstrate that you had a similar number of men and women in your study, would you want to accept or reject the null?

Accept - you want equal distribution

29
New cards

One-way chi square means..

Only 1 variable (can have multiple levels though)

30
New cards

Two-way chi square means

Has 2 variables (each can have multiple levels)

31
New cards

2 Alternate names for two-way chi square

  • Test of independence

  • Contingency chi square

32
New cards

What test would you use when the research question is asking if there is an association between 2 categorical variables?

Two-way chi square

33
New cards

In a two-way chi square, what does the null hypothesis state?

There is no association between the two variables.

34
New cards

How many sets of frequencies do you need for a two-way chi square test and what are they?

  1. The observed frequencies in your dataset

  2. The expected frequencies under the null

35
New cards

Contingency table

Table of observed frequencies across the 2 variables in a two-way chi square›

<p>Table of observed frequencies across the 2 variables in a two-way chi square›</p>
36
New cards

How do you calculate expected frequencies in a two-way chi square?

  1. Find the marginal totals by adding the rows and columns, and the grand total by finding the sum of all row and column totals.

  2. Multiply every column by every row and divide by the grand total.

  3. Put the answer in respective cells.

<ol><li><p>Find the marginal totals by adding the rows and columns, and the grand total by finding the sum of all row and column totals.</p></li><li><p>Multiply every column by every row and divide by the grand total.</p></li><li><p>Put the answer in respective cells.</p></li></ol>
37
New cards

How do you calculate chi-square in a two way chi square?

Input the observed and expected values into the formula for each cell and find the total.

<p>Input the observed and expected values into the formula for each cell and find the total. </p>
38
New cards

How do you calculate df in a two-way chi square?

df = (number of rows -1) x (number of columns -1)

39
New cards

What do we conclude when our observed X2 is greater than the critical X2 in a two way chi square?

The null hypothesis can be rejected as there is a significant association between the variables.

40
New cards

Which column in SPSS do we look under to find the significance of the two-way chi square?

Asymp. Sig. (2-sided)

41
New cards

What are the assumptions of chi square?

  1. Independence of observations

  2. Expected frequencies should be greater than 5

  3. Inclusion of non-occurrences

42
New cards

Are chi square analyses used for independent measures or repeated measures designs?

Independent because an assumption is independence of observations.

43
New cards

Expected frequencies in chi square should be greater than…

5

44
New cards

What does inclusion of non-occurences mean?

All the data must be included in the analysis

45
New cards

Limitations of chi square

  • Limited to one or two variables

  • Extremely sensitive to sample size

46
New cards

What happens if you double the sample size in a chi square?

The X2 value also doubles, but the critical value stays the same.

47
New cards

What is the null hypothesis for a two-way chi square?

The two variables are distributed independently (not associated)

  • They do not depend on each other

48
New cards

Two variables are considered to be independent when the frequency distribution for one variable has the __________ for all levels of the second variable.

same shape

Think: This shows that salary and gender are independent. If the null wasn’t supported or actually reflected real life, the male figure would be negatively skewed, and the female would be positively skewed.

Think: Independent→indifferent

<p>same shape</p><p>Think: This shows that salary and gender are independent. If the null wasn’t supported or actually reflected real life, the male figure would be negatively skewed, and the female would be positively skewed.</p><p>Think: Independent→indifferent</p>
49
New cards
<p>Are these variables independent or dependent?</p>

Are these variables independent or dependent?

Dependent - shapes are different

Think: Dependent→different

50
New cards

What is the model in a chi square?

The expected frequencies

51
New cards

If we only focus on the marginals in a two-way chi square , what perspective are we viewing the data from?

The null hypothesis because they are the frequencies not influenced by the other variables.

Think: In this case, 6 people failed the exam REGARDLESS of their attendance.

<p>The null hypothesis because they are the frequencies not influenced by the other variables.</p><p>Think: In this case, 6 people failed the exam REGARDLESS of their attendance.</p>
52
New cards

20 people took an exam and we measured their attendance to tutorials (perfect/not perfect) and their mark (pass/fail). Looking at the marginal scores, 70% passed and 30% failed regardless of their attendance.

If the null hypothesis was true (they are independent/not associated), what would the relative proportions of pass/fail scores be in each attendance condition?

  • 70% of those with perfect attendance would pass and 30% would fail

and

  • 30% of those with less than perfect attendance would pass and 30% would fail.

Think: The shape of the frequency distributions would be the same if they were independent.

53
New cards

Why should expected frequencies be greater than 5?

Because small expected frequencies produce few possible values of chi-square obtained, but we compare to a continuous distribution

54
New cards

Describe what inclusion of non-occurances means

If you have a variable with multiple levels, you need to include data for all levels.

Think: If you were comparing girls and boys marks on a test, where the results were pass or fail, you can’t just see if there is a significant gender difference for those who passed. You also need to include the ones who failed because otherwise it doesn’t tell the whole story and can influence the significance of the chi square.

<p>If you have a variable with multiple levels, you need to include data for all levels.</p><p>Think: If you were comparing girls and boys marks on a test, where the results were pass or fail, you can’t just see if there is a significant gender difference for those who passed. You also need to include the ones who failed because otherwise it doesn’t tell the whole story and can influence the significance of the chi square.</p>
55
New cards

True or False: A significant X2 does not indicate the strength of the relationship

True

56
New cards

What is Phi?

Denotes the strength of the correlation or association between two dichotomous variables (2×2) in a chi square

  • Interpreted same as Pearson’s r

  • If chi square is significant, so is chi

57
New cards

Is Phi a measure of effect size?

No - but it is interpreted with the same conventions as Cohen’s D

I.e. phi = 0.50, there is a moderate level of association between the two variables.

58
New cards

How can you get % variance explained?

Phi squared

59
New cards

What has more statistical power when assumptions are met: Pearson’s r or phi?

Pearson’s r because it is parametric

60
New cards

What is Cramer’s Phi?

Generalisation of phi to (row)x(column) tables

  • Phi can only be used for 2×2 chi squares but Cramer’s Phi can be used for more

  • Interpret the same as phi

Think: You’re cramming more in