stat dc - ch 1: the science and art of data

5.0(1)
studied byStudied by 15 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/33

flashcard set

Earn XP

Description and Tags

Statistics

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

34 Terms

1
New cards

Statistics

The science and art of collecting, analyzing, and drawing conclusions from data.

2
New cards

Individual

An object described in a set of data. Individuals can be people, animals, or things

3
New cards

Variable

An attribute that can take different values for different individuals.

4
New cards

Categorical Variable

Assigns labels that place each individual into a particular group, called a category.

5
New cards

Quantitative Variable

Takes a number of values that are quantities - counts or measurements.

6
New cards

Distribution

Distribution of a variable tells us what values the variable takes and how often it takes those values.

7
New cards

Frequency Table

shows the number of individuals having each value

8
New cards

relative frequency

the proportion or percent of individuals having each value

9
New cards

bar graph

shows each category as a bar. The heights of the bars show the category frequencies or relative frequencies

10
New cards

pie chart

a circular chart divided into triangular areas proportional to the percentages of the whole

11
New cards

two-way table

counts that summarizes data on the relationship between two categorical variables for some group of individuals

12
New cards

marginal relative frequency

the percent or proportion of individuals that have a specific value for one categorical variable

13
New cards

joint relative frequency

the percent or proportion of individuals that have a specific value for one categorical variable and a specific value for another categorical variable

14
New cards

conditional relative frequency

gives the percent or proportion of individuals that have a specific value for one categorical variable among individuals who share the same value of another categorical variable

15
New cards

association

Between two variables if specific values of one variable tends to occur in common with specific values of the other

16
New cards

dotplot

simple graph that shows each data value as a dot above its location on a number line

17
New cards

stemplot

simple graphical display for fairly small data sets that gives a quick picture of the shape of distribution while including the actual numerical values in the graph; each observation is separated into a stem and a leaf

18
New cards

histogram

a bar chart representing a frequency distribution; heights of the bars represent observed frequencies

19
New cards

shape

main features of the graph including major peaks, clusters, gaps, outliers, and symmetry or skewness

20
New cards

center

middle of data (mean/median usually)

21
New cards

variability

how spread out a set of data is

22
New cards

outliers

individual value that falls outside the overall pattern of distribution

23
New cards

symmetric

a graph in which the right and left sides are approximately mirror images of each other

24
New cards

skewed to the right

smaller values going towards the right

25
New cards

skewed to the left

smaller values going towards the left

26
New cards

mean

the average of all the individual data value

27
New cards

median

the midpoint of a distribution, the number such that about half of the observations are smaller and about half are larger

28
New cards

range

the distance between the minimum value and the maximum value of a distribution

29
New cards

standard deviation

measures the typical distance of the values in a distribution from the mean

30
New cards

interquartile range (IQR)

Q3-Q1=IQR

31
New cards

five-number summary

min, max, Q1, Q3, median

32
New cards

box plot

visual representation of the five-number summary

33
New cards

interference

make decisions or predictions based on the data

34
New cards

quartiles

divide the ordered data set into four groups having roughly the same number of values