Home
Explore
Exams
Search for anything
Login
Get started
Home
Math
Statistics
Linear Regression and Correlation
2.1-2.9
0.0
(0)
Rate it
Studied by 0 people
Learn
Practice Test
Spaced Repetition
Match
Flashcards
Card Sorting
1/22
Earn XP
Description and Tags
Statistics
Linear Regression and Correlation
Add tags
Study Analytics
All
Learn
Practice Test
Matching
Spaced Repetition
Name
Mastery
Learn
Test
Matching
Spaced
No study sessions yet.
23 Terms
View all (23)
Star these 23
1
New cards
DUFS
Direction
Unusual
Form
Strength
2
New cards
Direction
\+/-/none
3
New cards
Unusual
clusters, outliers
4
New cards
Form
linear/non-linear
5
New cards
Strength
strong/weak/moderate
6
New cards
Context
the relationship between variable 1 & 2
7
New cards
Interpreting r (correlation coefficient)
use DUFS except for U
strong (-1), moderate (-0.5), weak (0), moderate (0.5), strong (1)
nonresistant to outliers
has no units
as long as pattern of points stays same r is same
8
New cards
y=a+bx
prediction = y-int + slope x
9
New cards
extrapolation
predictions outside data set
10
New cards
residuals
actual - prediction (A-P)
11
New cards
interpret residuals
the actual __context of y__ was __residual__ __higher/lower__ than predicted for __x-value.__
12
New cards
y-int
when __context of x__ is 0, the predicted __context of y__ is __y-int__
13
New cards
slope
a predicted __slope in context__ for every context of x
14
New cards
residual plots
show how good a fit the LSRL is worth the data (better than r!)
we want to see a random scatter with no pattern or large clusters
15
New cards
r^2 (coefficient of determination)
measures the % reduction in Sigma(residual)^2 when using LSRL
16
New cards
interpret r^2
About (r^2)% of the variability in context of y is accounted for by the LSRL
17
New cards
standard deviation of the residuals (s)
= square root (sigma r^2)/(n-2)
n-2 because 2 points makes a line
no variation from LSRL
18
New cards
interpret s
The actual context of y is typically about __s__ away from the number predicted by the LSRL.
19
New cards
why not just use residuals?
Any LSRL has sigma residuals about equal to 0, so r^2 tells us more!
r^2 = sigma (residuals)^2 = (correlation)^2
20
New cards
(slope) b =
r sy/sx
S.D. of y residuals
S.D. of x residuals
21
New cards
(y-int) a =
y-bar = b x-bar
mean y & x
22
New cards
To transform data, use inverses
function I graph
linear I x vs. y
exponent I x vs. logy
power I logx vs. logy
predictions/residuals
substitute into the transferred LSRL may need to use algebra to solve for y
inverses → log vs. 10^x ; lnx vs. e^x ; x^3 vs. cubic root x ; etc.
23
New cards
choosing the best regression
1) check transformed scatterplot for linear pattern
2) check residual plot for no leftover pattern
3) if deciding between several option
r close to +-1
r^2 close to 1
s small as possible
last two are better than r