Lesson 2: Working with Categorical Data

0.0(0)
studied byStudied by 0 people
0.0(0)
call with kaiCall with Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/26

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 5:18 PM on 2/1/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

27 Terms

1
New cards

side-by-side bar chart

bar chart that groups the bars of two or more conditional distributions side by side

2
New cards

contingency table

table showing counts and sometimes percentages of individuals falling into named categories on two or more variables

3
New cards

contingency tables are used to…

reveal possible patterns in one variable that may be contingent on the category of the other

4
New cards

categorical variable

variable that names categories

5
New cards

categories can be named using…

words or numbers

6
New cards

mosaic plot

special kind of segmented bar chart whose bars’ widths display the marginal distribution of the variable represented by the bars

7
New cards

Simpson’s Paradox

averages that are taken across different groups can appear to contradict the overall averages

8
New cards

the distribution of a variable gives…

the possible values of the variable and the relative frequency of each value

9
New cards

conditional distribution

distribution of a variable restricting the who in order to consider only a smaller group of individuals

10
New cards

categorical data condition

condition that data used to create frequency tables, relative frequency tables, bar charts, pie charts, contingency tables, segmented bar charts, side-by-side bar charts, and mosaic plots must be non-overlapping categorical data

11
New cards

frequency table

table that lists the categories of a categorical variable and gives the number of observations in each category

12
New cards

simulation

random re-enactment of data collection under one or more assumptions

13
New cards

if real data look very different from simulated data, then…

the assumptions are called into question

14
New cards

variables are said to be independent if…

the conditional distribution of one variable is the same for all categories of another variable

15
New cards

if the variables are independent…

there is no association between them

16
New cards

if the variables are not independent…

there is an association between the variables

17
New cards

bar chart

graph consisting of bars whose area represents the count of observations for each category of a categorical variable

18
New cards

a bar chart should always have…

gaps separating the bars

19
New cards

relative frequency table

table that lists the categories of a categorical variable and gives the relative frequency of each category

20
New cards

area principle

principle which states that the area occupied by a part of a graph should correspond to the amount of data it represents

21
New cards

pie chart

circle sliced into pieces whose sizes are proportional to the fraction of the whole in each category

22
New cards

relative frequency

ratio of the number of observations in a category to the total number of observations

23
New cards

a relative frequency is represented as…

a fraction or a percentage

24
New cards

segmented bar chart

bar chart whose bars are stacked on top of one another in a vertical graph or are lined up side by side in a horizontal graph

25
New cards

a segmented bar chart usually shows…

relative frequencies so that the distribution of the categorical variable can be more easily compared between different groups

26
New cards

marginal distribution

distribution of values for individual variables in a contingency table

27
New cards

for a two-way table, the counts or percentages are the…

totals found in the margins (last row of the column) of the table