AP Stats Unit 2 Quiz

0.0(0)

Studied by 0 people

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/46

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

47 Terms

New cards

CDOFS =

Context (explanatory and response variable with units)
Direction (positive/negative)
Obvious?
- Influential points
  - Outliers
  - High leverage points
- Etc.
Form (linear or nonlinear)
Strength |r| (weak < 0.5 < moderate < 0.7 < strong)

New cards

Interpret the coefficient “r”

r =___. There is a (strong/moderate/weak) (positive/negative) relationship between explanatory variable and response variable. (Note: you cannot determine form from a correlation coefficient only)

New cards

Interpret the coefficient of determination “r²”

___% of the variation in the response variable can be explained by the linear relationship with explanatory variable.
____% of the error in is accounted for

New cards

Interpret the slope “b”

For each additional/(increase by 1 ____ in) explanatory variable, the predicted response variable will (increase/decrease) by ___.

New cards

Interpret the y-intercept “a”

The predicted response variable when explanatory variable = 0 is ___.

also interpret if this value makes sense in the context

New cards

Interpret residual

The residual is ___. This means that the actual response variable is ___

(less than/more than) predicted based on explanatory value of observation.
(less than/more than) the predicted explanatory value of observation.

New cards

Interpret standard deviation of the residuals “s”

s = ___. When using the least-squares regression line with x = explanatory variable to predict y = response variable, ***our predictions will typically be off from the actual data by ___.

New cards

Interpret residual plot

A linear model (or whatever the current model is) (is/is not) appropriate because there is (not an obvious/an obvious) pattern in the residual plot.

New cards

explanatory(independent) variable

the variable that predicts, explains, or influences a trend in the response variable. This is the x - value

New cards

response (dependent) variable:

The measured outcome. Responds to trends in the explanatory variable. This is the y-variable

New cards

positive correlation

as the x-values increase, the y-values also tend to increase

New cards

negative correlations

as the x-values increase, the y-values tend to decrease

New cards

least squares regression line (LSRL):

straight line that roughly puts half of your data above it and half below it
makes the sum of the squared residuals as small as possible

New cards

strong correlation interpretation:

data is close to the LSRL
the LSRL is a good model for the data
if you used the LSRL to predict new data, you would make close predictions

New cards

weak correlations interpretation:

data is far from the LSRL
the LSRL is a “meh” model for the data
if you used the LSRL to predict new data, you may be quite a bit off

New cards

correlation coefficient (r):

A number between -1 and 1 that tells you the strength and direction of correlation

(-1 < r < 1)

the closer to -1 or 1, the stronger the relationship
the closer to zero, the weaker the relationship
-1 and 1 only occur in case of perfect linear relationship
**only used to describe linear relationship

New cards

properties of the correlation coefficient ( r ):

requires that both variables be quantitative
makes no distinction between explanatory and response variables
does not change when we change the units of measurement of x, y or both
has NO UNIT of measurement

New cards

residuals

distance between each data point of response variable and predicted value in the linear regression (y-distance between the point and the line)

The “error” between the prediction and actual

residual = actual - predicted = y - ŷ

New cards

equation for LSRL and variable meanings

ŷ = a + bx

a = y intercept
b = slope
ALWAYS LABLE VARIABLES

New cards

equation for “a”

ȳ - b(x̄)

y mean - slope ( x mean)

New cards

equation for “b”

r * (Sy)/(Sx)

correlation coefficient * (standard deviation y)/(standard deviation x)

New cards

pos. residual value =

actual value is higher than model’s prediction

New cards

neg. residual value =

actual value is lower than model’s prediction

New cards

what should the residual plot look like for a good fit model?

be centered at zero
have no pattern
look random
only random scatter

New cards

properties of LSRL

always goes through (x mean, y mean)
is a “see-saw” centered at (x mean, y mean)

New cards

low-leverage points:

close to x mean
less pull on see-saw; don’t effect LSRL much

New cards

high-leverage points:

far from x mean
have much larger or much smaller x values than the other points
have a high effect on an LSRL’s slope + y-intercept

New cards

influential points:

points that if removed change the slope, y-intercept, correlation, r², or SD of residuals substantially
outliers and high leverage points are often influential in regression calculations

New cards

outlier

is any point that does not follow the pattern of the data and has a large residual

change correlation

New cards

high leverage influential points

unusually high x value
changes slope/y-int
not outlier because low residual (close to LSRL)

New cards

low leverage influential

central x-value
is an outlier: high residual (far from LSRL)
lowers r (correlation coefficient)

New cards

both outlier + high leverage influential point

unusually high x-value
high residual (far from LSRL)
changes slope/y-int
lowers r (correlation coefficient)

New cards

Where is y-int on regression table?

the top left corner number

New cards

Where is slope in regression table?

the bottom left value (below y - intercept)

New cards

Where is standard deviation in regression table

It is the S value

New cards

Where is r(correlation coefficient) in regression table?

it is R-Sq value ( usually percent)

New cards

association

There is an assoication between 2 variables if knowing the value of one variable helps us predict the value of the other.

New cards

Steps for describing scatterplot:

Direction(pos/neg)
Form(linear/nonlinear)
Strength(weak, moderate, strong)
Unusual features(outliers)
**use context

New cards

regression line

line that models how a response variable y changes as an explanatory variable x changes

New cards

extrapolation

use of a regression line for prediction outside the interval of x-values to obtain the line.

the further we extrapolate, the less reliable the predictions become

New cards

How do you label variables or LSRL?

put the names in the equation
explicitly define x and y

New cards

residual plot

scatterplot that displays the residuals on the vertical axis and the values of the explanatory variable on the horizontal axis

New cards

coefficient of determination r²

measures percent reduction in the sum of squared residuals when using LSRL to make predictions rather than the mean value y

r² measures the proportion of the variations in the response variable that is accounted for by the explanatory variable in the linear model

New cards

standard deviation of residuals “S”

measures the size of a typical residual that is, S measures the typical distance between the actual y-value and the predicted y-value

New cards

how do you undo log?

raise to power of 10

New cards

how do you undo ln

raise to power of e

New cards

How do you choose the most appropriate model?

choose the model whose residual plot has the most random scatter
if more than one model produces a randomly scattered plot, choose the model with the largest r²
not advisable to us S as y values may be on different scales