Ch.1 Stats Vocab

0.0(0)
studied byStudied by 5 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/40

flashcard set

Earn XP

Description and Tags

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

41 Terms

1
New cards

categorical variable

assigns labels that place each individual into a particular group, called a category

  • can not take the average

  • distribution description is one-one

2
New cards

quantitative variable

takes number values that are counts or measurements

  • can take measurements

  • distribution description is SOCS

3
New cards

two-way table

table of counts that summarizes data on the relationship between two categorical variables for some group of individuals

  • try to include the total counts of the rows and columns

4
New cards

marginal relative frequency

gives the percent/proportion of individuals that have a specific value for one categorical variable

  • only describes ONE of the variables in the two-way table

  • row or column total/total total

5
New cards

joint relative frequency

gives the percent or proportion of individuals that have a specific value for one categorical variable and a specific value for ANOTHER categorical variable

  • answers information about BOTH variables in the two-way table

  • AND!

  • one box/total total

6
New cards

conditional relative frequency

gives the proportion of individuals that have a specific value for one categorical variable among individuals who share the same value of another categorical variable

  • proportion of individuals of a certain category within one overarching variable

  • one box/row or column total

7
New cards

side-by-side bar graph

displays the distribution of a categorical variable for each value of another categorical variable

  • groups based on values of the categorical variables and placed side by side

  • several bars in one category

8
New cards

segemented/ribbon bar graph

displays the distribution of a categorical variable as segments of a rectangle, with the area of each segment proportional to the percent of individuals in the corresponding category

  • one rectangle divided up per category

  • separate bars

9
New cards

mosaic plot

a modified segmented bar graph in which the width of each rectangle is proportional to the number of individuals in the corresponding category

  • one rectangle divided up per category

  • CONNECTED

  • width of the rectangle is as large as the population of the category

10
New cards

categorical graphs

side-by-side bar graphs

mosaic plots

segmented/ribbon bar graphs

pie charts

11
New cards

association

when knowing the value of one variable can help predict the value of the other

  • DOES NOT MEAN CAUSATION

12
New cards

no association

when knowing the value of one variable does NOT help predict the value of the other

13
New cards

dotplot

shows each data value as a dot above its location on a number line

14
New cards

full points for graph

  • title

  • correct graph

  • labeled axis

  • if necessary: subheadings and key

15
New cards

stemplot

shows each data value separated into two parts

  1. stem, all digits minus the final digits (ex: 10s or 100s)

  2. lead, the final digit (ex: 1s or 0.1s)

  • kind of like a dot plot on its side + grouped into 10s or 5s

16
New cards

splitting stems

stemplot when the data is very clustered

17
New cards

back-to-back stemplot

combining two distributions of the same quantitative variable

  • requires two keys and subheadings

18
New cards

describe the distrbution

SOCS in CONTEXT

  • use variable name, not just units

  • write in complete sentences

  • start with capital and end with period

  • NO bullets

19
New cards

shape

  • unimodal/bimodal/etc

  • skewed left/skewed right/symmetric

    • use QUALIFERS

    • skew towards the TAIL

20
New cards

outlier

  • can do approximate, but if possible do EXACT

21
New cards

High-End Outlier

Q3 + 1.5 x IQR

22
New cards

Low-End Outlier

Q1 - 1.5 x IQR

23
New cards

unimodal

one peak

24
New cards

bimodal

two distinct clusters

25
New cards

uniform

frequency about the same for all values

26
New cards

mean

the average of all the individual data values

  • better for more symmetric data

  • NOT resistant

27
New cards

resistant

is not sensitve to extreme values

28
New cards

median

midpoint of a distribution

  • resistant

29
New cards

symmetric

data forms a basic “U”

  • mean = median (approximately)

30
New cards

skewed left

tail end is at the left

  • mean < median

31
New cards

skewed right

tail end is at the right

  • mean > median

32
New cards

range

distance between minimum and maximum values

  • NOT resistant measure of variability

33
New cards

standard deviation

the typical distance of the values in a distribution from the mean

  • NOT resistant

34
New cards

quartiles

groups of the distribution divided into fours

35
New cards

first quartile (Q1)

median of the data values LEFT or LESS THAN the median

36
New cards

third quartile (Q3)

median of the data values RIGHT or GREATER THAN the median

37
New cards

interquartile range (IQR)

distance between first and third quartiles

  • Q3 - Q1

  • more resistant

38
New cards

mode

the data value that occurs the most

39
New cards

histogram

uses intervals of values as a bar

  • bin width = size of interval

  • heights = frequency/relative frequency

    • use relative frequency when comparing distributions of different total counts

  • good for large data sets

  • do not know individual values

40
New cards

how many bins

  1. count data points

  2. square root the number of data points

  3. round up

41
New cards

quantitative graphs

  • dotplots

  • histograms

  • stemplots