AP STATS Chapters 1-3

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/70

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

71 Terms

1
New cards

stem & leaf plot

a graph of a distribution of quantitative data in which all but the final digits of data values (written in numerical order) form a column called a stem and the final digit of each data value is written in increasing order outward from the column to form leaves

2
New cards

back-to-back stem & leaf plot

a stem-and-leaf plot or stemplot that is used to compare distributions of quantitative variables for two data sets

3
New cards

dot plot

a graph of a distribution of quantitative data in which each data value is shown as a dot above its location on a number line

4
New cards

histogram

a graph of a distribution of quantitative data in which nearby values are grouped together in what are often called "bins" or "classes"

5
New cards

sample mean

the average of a subset of the population, denoted by x-bar: x̄

6
New cards

population mean

the average of a population, denoted by the Greek letter mu: μ

7
New cards

resistant

a statistic that is not influenced (or influenced very little) by extreme observations

8
New cards

unimodal

describes the shape of a distribution whose graph has one major peak

9
New cards

bimodal

describes the shape of a distribution whose graph has two major peaks

10
New cards

symmetric

describes the shape of a distribution whose graph has roughly mirror images on the left and right sides

11
New cards

skewed

describes the shape of a distribution whose graph has one side that is much longer than the other (name the direction of this based upon where the longer end/tail is located)

12
New cards

quartiles

One quarter of the data values are smaller than the first quartile (Q1) and three quarters of the data values are smaller than the third quartile (Q3)

13
New cards

interquartile range (IQR)

The range for the middle 50% of the data values-- the range between the quartiles (Q3-Q1)

14
New cards

1.5 IQR Rule

used for identifying outliers: any values that are more than 1.5 times the IQR lower than the first quartile or higher than the third quartile are called outliers

Q1 - (1.5 x IQR) 

Q3 + (1.5 x IQR)

15
New cards

5 number summary

for a given data set: the minimum value, the first quartile, the median, the third quartile, and the maximum value (identified in this particular order)

16
New cards

standard deviation

measures the typical distance of values in a distribution from the mean, usually denoted s for a sample or lower case Greek sigma, σ, for a population

17
New cards

variance

the square of the standard deviation (σ^2)

18
New cards

categorical variable

places individuals into one of several categories

19
New cards

quantitative variable

takes numerical values for which it makes sense to find an average

20
New cards

distribution

tells all the possible values of a variable and how often they each occur

21
New cards

two-way table

array that displays counts of two categorical variables with columns indicating the distribution for one variable and rows indicating the distribution for the other variable

22
New cards

marginal distribution

in a two-way table, this counts the distribution of values of one of the categorical variables among all individuals described by the table

23
New cards

conditional distribution

in a two-way table, this describes the values of a variable among individuals who have a specific value of another variable (there's a separate conditional distribution for each value of the other variable)

24
New cards

association

two variables have this quality if knowing the value of one variable helps predict the value of the other

25
New cards

center

a typical value-- could be the median or the mean

26
New cards

spread

how much a data set varies-- could be the range, the IQR, or the standard deviation

27
New cards

shape

a description of the symmetry or asymmetry and the number of modes/peaks

28
New cards

outlier

any data value that is unusually low or unusually high

29
New cards

segmented bar chart

a bar graph that is used to compare distributions of a categorical variable for two or more data sets (each bar is divided in proportion to the distribution of the categorical variable)

30
New cards

bar graph

a graph of a distribution of categorical data in which bars extend to display the frequency of various categories that are placed on an axis

31
New cards

median

a graph of a distribution of categorical data in which bars extend to display the frequency of various categories that are placed on an axis

32
New cards

range

the distance (a single number) between the minimum and maximum values in a data set (max - min)

33
New cards

boxplot

a graph of a distribution of quantitative data in which a central box extends between the quartiles with a central line marking the the median, lines extend to the largest and smallest values that are not outliers, while outliers are marked with individual points/dots

34
New cards

pth percentile

the value with p percent of the observations less than it

35
New cards

cumulative relative frequency graph

for quantitative data, displays the cumulative relative frequency of each class of a frequency distribution

36
New cards

z-score

tells us how many standard deviations from the mean an observation falls, and in what direction

<p><span><span>tells us how many standard deviations from the mean an observation falls, and in what direction</span></span></p>
37
New cards

Effect of Adding (or Subtracting) a Constant c

adds c to (or subtracts c from) measures of center and location (mean, median, quartiles, percentiles)

does not change the shape of the distribution or measures of spread (range, IQR, standard deviation).

38
New cards

Effect of Multiplying (or Dividing) by a Constant b

multiplies (or divides) measures of center and location (mean, median, quartiles, percentiles) by b

multiplies (or divides) measures of spread (range, IQR, standard deviation) by |b|, but does not change the shape of the distribution

39
New cards

density curve

describes the overall pattern of a distribution

is always on or above the horizontal axis, and

has area exactly 1 underneath it

40
New cards

median of a density curve

the equal-areas point, the point that divides the area under the curve in half

<p><span><span>the equal-areas point, the point that divides the area under the curve in half</span></span></p>
41
New cards

mean of a density curve

the balance point, at which the curve would balance if made of solid material

42
New cards

normal distributions

The mean of a Normal distribution is the center of the symmetric Normal curve.

The standard deviation is the distance from the center to the inflection points (steepest locations) on either side.

We abbreviate the Normal distribution with mean µ and standard deviation σ as N(µ,σ).

43
New cards

68-95-99.7 Rule (also called the Empirical Rule)

In the Normal distribution with mean µ and standard deviation σ:

Approximately 68% of the observations fall within σ of µ.

Approximately 95% of the observations fall within 2σ of µ.

Approximately 99.7% of the observations fall within 3σ of µ.

44
New cards

standard Normal distribution

the Normal distribution with mean 0 and standard deviation 1

We abbreviate this as N(0,1).

45
New cards

Normal probability plot

provides a good assessment of whether a data set follows an approximately Normal distribution

If the points lie close to a straight line, the plot indicates that the data are close to Normal.

x-axis: data, y-axis: expected z-scores

46
New cards

uniform distribution

A symmetric distribution closely resembling a rectangle; each value of the quantitative variable occurs with roughly the same frequency

47
New cards

InvNorm

Input: area or probability (to the LEFT)
Output: z-score or variable value

48
New cards

normalcdf

Input: Z-score or variable value
Output: area or probability

49
New cards

Explanatory variable

Independent variable that may help explain or predict changes in a response variable (usually X)

50
New cards

response variable

Dependent variable that measures an outcome of a study (usually Y)

51
New cards

Scatterplot

Graph that shows relationships between 2 quantitative variables measured on the same individuals with each individual represented by an ordered pair

52
New cards

Interpreting scatterplots

DOTS- Direction, Outliers/High-Leverage Points (if any), Trend, and Strength

53
New cards

Strength

weak/moderate/strong

54
New cards

Trend

linear/nonlinear

55
New cards

direction

positive/negative

56
New cards

Correlation doesn't change

x and y are switched or we change units

57
New cards

Correlation (r) is not resistant

Correlation is strongly affected by outliers and high-leverage points, so we say that...

58
New cards

Correlation (r)

Strength and direction of linear relationship (close to 1 or -1 indicates a STRONG linear relationship)

59
New cards

Least Squares Regression Line (LSRL)

The linear equation that describes the relationship between x and y

60
New cards

Extrapolation

Trying to use a regression equation to predict y for values of x that are larger or smaller than those in the original data set (AVOID!)

61
New cards

Residual

observed - predicted
(y - ŷ)

62
New cards

(x̅ , y̅)

This point is always on LSRL

63
New cards

Moving one standard deviation from the x̅ moves...

r standard deviations from the y̅

64
New cards

The linear model is a good fit

The residual plot has no apparent pattern and appears scattered randomly, so we know...

65
New cards

Coefficient of determination
(r^2)

The percent (or proportion) of variation in the y variable that is explained by the linear relationship between the x and y variables.

66
New cards

interpreting slope of a least squares regression line

For each 1 (unit) increase in (x), we predict a (b) (unit) increase/decrease in (y).

67
New cards

interpreting the y-intercept of a least squares regression line

For a (x) of 0 (unit), we predict a (y) of (a) (unit).

68
New cards

residual plot

a graph with the explanatory variable on the horizontal axis and residuals on the vertical axis

69
New cards

standard deviation of residuals, s

the typical error when making predictions with the LSRL

70
New cards

outlier

point with large residual 

71
New cards

high-leverage point

a data point with a much larger or smaller x-value than the rest of the data set