unit 3 stats

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 34

flashcard set

Earn XP

Description and Tags

12th grade ap statistics

35 Terms

1

univariate data

one variable data set

New cards
2

bivariate data

relationship between two variables

New cards
3

explanatory variable

predict or explain changes in response varaible

New cards
4

response varaible

measures an outcome of a study

New cards
5

scatterplots

- shows relationship/association between two quantitative variables measured on the same individuals
- explanatory variable: x-axis
- response variable: y-axis
- no explanatory variable: either variable can go on x-axis

New cards
6

describing scatterplots

- direction: positive, negative, no association
- form: linear or nonlinear
- strength: weak, moderate, strong
- unusual features: points that fall outside of overall pattern and distinct clusters of points

New cards
7

correlation r

- measures the direction and strength of association for a linear relationship only
- between -1 and 1
- does not equal causation
- does not measure form
- not a resistant measure of strength
- both quantitative variables
- no distinction between explanatory and response variables
- does not change when units change
- no unit of measurement (just a number)

<p>- measures the direction and strength of association for a linear relationship only<br>- between -1 and 1<br>- does not equal causation<br>- does not measure form<br>- not a resistant measure of strength<br>- both quantitative variables<br>- no distinction between explanatory and response variables<br>- does not change when units change<br>- no unit of measurement (just a number)</p>
New cards
8

correlation r interpretation

"the linear relationship between X and Y is STRENGTH and DIRECTION"

New cards
9

coefficient of determination r² interpretation

"the percent of the variation in Y explained by the linear relationship with X"

New cards
10

used to make predictions

ŷ = a + bx

New cards
11

residual

actual - predicted
(difference between the actual value of y and the value of y predicted by the regression line)

New cards
12

residual interpretation

"the actual CONTEXT was RESIDUAL above/below the predicted value for X = #"

New cards
13

ŷ = a + bx interpretations

- "when X = 0 CONTEXT the predicted Y-CONTEXT is Y-INTERCEPT"
- "for each additional X-CONTEXT the predicted Y-CONTEXT increases/decreases by SLOPE"

New cards
14

regression line

summarizes relationship between two variables but only when one variable helps explain the other

New cards
15

extrapolation

- using a regression line to make a prediction for x-values outside (larger/smaller) the x-values used to obtain the data
- don't do it; not accurate

New cards
16

least-squares regression line

the line that makes the sum of the squared residuals as small as possible

New cards
17

residual plot

- scatterplot that displays the residuals on the vertical axis and the explanatory variable on the horizontal axis
- appropriate model: no leftover curved pattern
- not appropriate model: leftover curved pattern

New cards
18

correlation r strength

- strong negative: -1
- moderate negative: -0.5
- weak (no association): 0
- moderate positive: 0.5
- strong positive: 1

New cards
19

properties of correlation r

- unusual value in pattern = strengthens r
- unusual value not in pattern = weakens r

New cards
20

standard deviation of the residuals s

- measures the size of a typical residual
- s measures the typical distance between the actual y values and the predicted y values

New cards
21

coefficient of determination r²

- measures the percent of variability in the response variable that is accounted for by the LSRL
- tells us how much better the LSRL does at predicting values of y than simply guessing the mean y for each value in the data

New cards
22

regression to the mean

for an increase of 1 standard deviation in the value of the explanatory variable x, the LSRL predicts an increase of r standard deviations in the response variable y

New cards
23

high leverage in regression

much larger or smaller x-values than the other points in the data set

New cards
24

outlier in regression

- does not follow the pattern of the data
- large residual

New cards
25

influential point in regression

if removed, big changes to slope, y-intercept, and r values

New cards
26

association does not imply causation

a strong association is not enough to draw conclusions about cause and effect

New cards
27

horizontal outliers

tilt line

New cards
28

vertical outliers

shift line up/down

New cards
29

linear

graph x vs. y

New cards
30

exponential

graph x vs. log y

New cards
31

power (y=axᵖ)

graph log x vs. log y

New cards
32

achieve linearity with power model

- raise value of explanatory variable x to the p power (xᵖ, y)
- take pᵗʰ root of the values of the response variable y (x, ᵖ√y)

New cards
33

linear pattern

scatterplot of logarithms of both variables

New cards
34

roughly linear assoication

scatterplot of logarithm of y against x

New cards
35

choosing the best regression

1. check scatterplot for linear pattern
2. check residual plot for no distinct pattern
3. check for the r² that is closest to 1

New cards
robot