Unit 1: Exploring Data

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/30

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

31 Terms

1
New cards

data

recorded information, with context

2
New cards

categorical variable

a variable with named categories (usually named with words)

3
New cards

quantitative variable

a variable that uses numerical values (with units)

4
New cards

contingency table or 2-way table

a table that displays counts (or percents) for individuals organized by two categorical variables

5
New cards

marginal distribution

in a contingency table, the distribution of just one of variables, which can be seen in the margins of the table

6
New cards

conditional distribution

the distribution of a variable when adding a restriction to only a portion of the variable

7
New cards

independence

two variables are independent when knowing something about one variable does not add any new information to what you know about another variable

8
New cards

association

two variables have an association when knowing something about one of the variables increases what you know about the second variable

9
New cards

graphs for categorical variables

bar graphs and pie charts

10
New cards

area principle

the area of a graph should equal the magnitude of the data it is representing

11
New cards

graphs for quantitative variables

dotplot, stem & leaf, boxplot, histogram

12
New cards

shape descriptors (draw with your finger in the air as you say them)

knowt flashcard image
13
New cards

Four things to describe about a quantitative distribution

center, shape, spread, unusual features (outliers)

14
New cards

2 measures of center

mean and median

15
New cards

3 measures of spread

range, IQR, and standard deviation

16
New cards

how to describe symmetrical data

mean and standard deviation

17
New cards

how to describe skewed data

median and IQR

18
New cards

upper fence formula

Q3 + 1.5*IQR

19
New cards

lower fence formula

Q1 - 1.5*IQR

20
New cards

In a skewed right distribution, the mean is ______ than the median.

greater than

21
New cards

standard deviation is ....

the typical distance from the data to the mean

22
New cards

IQR

Interquartile Range = Q3 - Q1

23
New cards

percentile

The nth percentile means that n% is BELOW that value.

24
New cards

variance

standard deviation squared

25
New cards

z-score

standardized score

26
New cards

Standard normal distribution

The mean is 0 and the SD is 1.

27
New cards

Empirical rule

68-95-99.7

28
New cards

continuous variable

A variable (such as age, test score, or height) that can take on a wide or infinite number of values.

29
New cards

discrete variable

variable that has specific values and that cannot have values between these specific values (usually whole numbers only, for example)

30
New cards

The greek letter we use for the mean of the entire population.

mu

31
New cards

The greek letter we use for the standard deviation of the entire population.

sigma