Ch. 2.2 Exploring the Relationship Between Two Categorical Variables

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/23

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

24 Terms

1
New cards

What do you use to summarize the relationship between two categorical variables?

a contingency table

2
New cards

What do we use to describe the joint behaviour of both categorical variables?

contingency table

3
New cards

What is a contingency table?

It’s a two-way table listing frequencies for each combination of the two categorical variables.

4
New cards
<p>What kind of table would this be considered as?</p>

What kind of table would this be considered as?

a contingency table

5
New cards

What can a contingency table also report?

the relative frequencies as %

6
New cards

What would you use when you want to describe the joint distribution of the categorical variables?

a contingency table that reports relative frequencies as %

7
New cards
<p>In this image, adding all the grey cells would give you what % value?</p>

In this image, adding all the grey cells would give you what % value?

100%

8
New cards
<p>Solve the question in the image.</p>

Solve the question in the image.

6.3%

9
New cards

What gives distributions of each categorical variable separately in contingency table?

margins

10
New cards
<p>What kind of distribution is shown in the grey cells found in the contingency table?</p>

What kind of distribution is shown in the grey cells found in the contingency table?

joint distribution - as adding all the cells makes 100%

11
New cards
<p>What kind of distibution do the pink cells display in this contingency table?</p>

What kind of distibution do the pink cells display in this contingency table?

marginal distribution of survival & marginal distribution of ticket class

12
New cards
<p>Answer the following questions.</p>

Answer the following questions.

  1. 23.33% (280/1200 × 100%)

  2. 58.33% (700/1200 × 100%)

13
New cards

What is a conditional distribution?

It’s a distribution seen in a contingency table where you have a fixed value of one variable & look at the distribution of the other for that value only.

14
New cards

Fill in the blanks. 

Often need the distribution of ____________ for a particular value of the other.

one variable

15
New cards
<p>Looking at this table, Fill in the blanks.</p><p>The table shows a conditional distribution of ______________ conditional on _____=_________.</p>

Looking at this table, Fill in the blanks.

The table shows a conditional distribution of ______________ conditional on _____=_________.

decision, gender = male (row%)

16
New cards

True or False. 

Conditional distribution can condition only on one variable of contingency table.

False. 

17
New cards
<p>Looking at this table, Fill in the blanks.</p><p>The table shows a conditional distribution of ______________ for &nbsp;_____=_________.</p>

Looking at this table, Fill in the blanks.

The table shows a conditional distribution of ______________ for  _____=_________.

gender, decision = accepted/rejected (column %)

18
New cards

Fill in the blanks.

If conditional distributions of one variable are the same for every value of the other, we say the two variables are _____________.

independent

19
New cards

What do we use to compare conditional distributions visually?

side-by-side bar plot

20
New cards
<p>What do you call this type of plot in this image?</p>

What do you call this type of plot in this image?

side-by-side bar plot

21
New cards
<p>Answer the following questions provided in the image: Using 1. …. answer … format</p>

Answer the following questions provided in the image: Using 1. …. answer … format

  1. 309/2455 × 100% = 12.6%

  2. Never : 996/4888 × 100% = 20.4% ,                            Rarely : 2455/4888 × 100% = 50.2%                                       Sometimes : 1437/4888 × 100% = 29.4%

22
New cards
<p>Answer the following questions provided in the image: Using 1. …. answer … format</p>

Answer the following questions provided in the image: Using 1. …. answer … format

knowt flashcard image
23
New cards

What is Simpson’s Paradox?

It's when a conclusion you get from looking at the total data is the exact opposite of the conclusion you get when you look at the data broken down into separate groups.

The reversal happens because you're ignoring a hidden factor that is unevenly spread across those groups. When you combine everything, that hidden factor makes the overall result misleading.

24
New cards
<p>What type of distribution is shown in the image?</p>

What type of distribution is shown in the image?

conditional distribution