unit 3 stats

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/34

flashcard set

Earn XP

Description and Tags

12th grade ap statistics

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

35 Terms

1
New cards

univariate data

one variable data set

2
New cards

bivariate data

relationship between two variables

3
New cards

explanatory variable

predict or explain changes in response varaible

4
New cards

response varaible

measures an outcome of a study

5
New cards

scatterplots

- shows relationship/association between two quantitative variables measured on the same individuals
- explanatory variable: x-axis
- response variable: y-axis
- no explanatory variable: either variable can go on x-axis

6
New cards

describing scatterplots

- direction: positive, negative, no association
- form: linear or nonlinear
- strength: weak, moderate, strong
- unusual features: points that fall outside of overall pattern and distinct clusters of points

7
New cards

correlation r

- measures the direction and strength of association for a linear relationship only
- between -1 and 1
- does not equal causation
- does not measure form
- not a resistant measure of strength
- both quantitative variables
- no distinction between explanatory and response variables
- does not change when units change
- no unit of measurement (just a number)

<p>- measures the direction and strength of association for a linear relationship only<br>- between -1 and 1<br>- does not equal causation<br>- does not measure form<br>- not a resistant measure of strength<br>- both quantitative variables<br>- no distinction between explanatory and response variables<br>- does not change when units change<br>- no unit of measurement (just a number)</p>
8
New cards

correlation r interpretation

"the linear relationship between X and Y is STRENGTH and DIRECTION"

9
New cards

coefficient of determination r² interpretation

"the percent of the variation in Y explained by the linear relationship with X"

10
New cards

used to make predictions

ŷ = a + bx

11
New cards

residual

actual - predicted
(difference between the actual value of y and the value of y predicted by the regression line)

12
New cards

residual interpretation

"the actual CONTEXT was RESIDUAL above/below the predicted value for X = #"

13
New cards

ŷ = a + bx interpretations

- "when X = 0 CONTEXT the predicted Y-CONTEXT is Y-INTERCEPT"
- "for each additional X-CONTEXT the predicted Y-CONTEXT increases/decreases by SLOPE"

14
New cards

regression line

summarizes relationship between two variables but only when one variable helps explain the other

15
New cards

extrapolation

- using a regression line to make a prediction for x-values outside (larger/smaller) the x-values used to obtain the data
- don't do it; not accurate

16
New cards

least-squares regression line

the line that makes the sum of the squared residuals as small as possible

17
New cards

residual plot

- scatterplot that displays the residuals on the vertical axis and the explanatory variable on the horizontal axis
- appropriate model: no leftover curved pattern
- not appropriate model: leftover curved pattern

18
New cards

correlation r strength

- strong negative: -1
- moderate negative: -0.5
- weak (no association): 0
- moderate positive: 0.5
- strong positive: 1

19
New cards

properties of correlation r

- unusual value in pattern = strengthens r
- unusual value not in pattern = weakens r

20
New cards

standard deviation of the residuals s

- measures the size of a typical residual
- s measures the typical distance between the actual y values and the predicted y values

21
New cards

coefficient of determination r²

- measures the percent of variability in the response variable that is accounted for by the LSRL
- tells us how much better the LSRL does at predicting values of y than simply guessing the mean y for each value in the data

22
New cards

regression to the mean

for an increase of 1 standard deviation in the value of the explanatory variable x, the LSRL predicts an increase of r standard deviations in the response variable y

23
New cards

high leverage in regression

much larger or smaller x-values than the other points in the data set

24
New cards

outlier in regression

- does not follow the pattern of the data
- large residual

25
New cards

influential point in regression

if removed, big changes to slope, y-intercept, and r values

26
New cards

association does not imply causation

a strong association is not enough to draw conclusions about cause and effect

27
New cards

horizontal outliers

tilt line

28
New cards

vertical outliers

shift line up/down

29
New cards

linear

graph x vs. y

30
New cards

exponential

graph x vs. log y

31
New cards

power (y=axᵖ)

graph log x vs. log y

32
New cards

achieve linearity with power model

- raise value of explanatory variable x to the p power (xᵖ, y)
- take pᵗʰ root of the values of the response variable y (x, ᵖ√y)

33
New cards

linear pattern

scatterplot of logarithms of both variables

34
New cards

roughly linear assoication

scatterplot of logarithm of y against x

35
New cards

choosing the best regression

1. check scatterplot for linear pattern
2. check residual plot for no distinct pattern
3. check for the r² that is closest to 1