1.2 Working with Categorical Data

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/17

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

18 Terms

1
New cards

simulation

a random reenactment of data collection under one or more assumptions - if real data looks significantly different from simulated data, then said assumptions are questioned

2
New cards

distribution

each of a variable gives the possible values of the variable and the relative frequency of each value

3
New cards

relative frequency table

a table that lists the categories of a categorical variable and gives the relative frequency of each category

4
New cards

side-by-side bar chart

a bar chart that groups the bars of two or more conditional distributions side by side

5
New cards

relative frequency

the ratio of the number of observations in a category to the total number of observations - represented as a fraction or percentage

6
New cards

pie chart

a circle sliced into pieces whose sizes are proportional to the fraction of the whole in each category

7
New cards

categorical data condition

the conditions that data uses to create frequency tables, relative frequency tables, bar charts, pie charts, contingency tables, segmented bar charts, side-by-side bar charts, and mosaic plots must be non-overlapping categorical data

8
New cards

area principle

the area occupied by a part of a graph should correspond to the amount of data it represents

9
New cards

categorical variable

a variable that names categories using words or numbers

10
New cards

bar chart

a graph consisting of bars whose area represents the count of observations for each category of a categorical variable, always with gaps separating the bars

11
New cards

mosaic plot

a special kind of segmented bar chart whose bars’ widths display the marginal distribution of the variable represented by the bars

12
New cards

simpson’s paradox

averages that are taken across different groups, which can appear to contradict the overall averages

13
New cards

marginal distribution

the distribution of values for individual variables in a contingency table

14
New cards

contingency table

a table showing counts and sometimes percentages of individuals falling into named categories on two or more variables, used to possibly reveal variables dependent on one another

15
New cards

conditional distribution

the distribution of a variable restricting the Who in order to consider only a smaller group of individuals

16
New cards

frequency table

a table that lists the categories of a categorical variable and gives the number of observations in each category

17
New cards

segmented bar chart

a bar chart whose bars are stacked on top of one another in a vertical graph or are lined up side by side in a horizontal graph

18
New cards

independence

variables are said to be if the conditional distribution of one variable is the same for all categories of another variable - if variables are to each other, there is no association between them