correlation and regression

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/23

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

24 Terms

1
New cards
S2
symbol for variance
2
New cards
S
symbol for standard deviation
3
New cards
Simplest general pattern
linear
4
New cards
Direction
positive (uphill), negative (downhill)
5
New cards
Strength
how closely the data follows the pattern (the line)
6
New cards
r
symbol for correlation coefficient, between -1 and 1
7
New cards
Correlation
describes the strength and direction of the linear relationship between x and y
8
New cards
Perfect positive linear relationship
+1 = r
9
New cards
Perfect negative linear relationship
-1 = r
10
New cards
Weak correlation
anything between -0.3 and 0.3
11
New cards
Strong correlation
anything between -1 and -0.7 and 0.7 and 1
12
New cards
Moderately weak
anything between -0.3 and -0.5 and 0.3 and 0.5
13
New cards
Moderately strong
anything between -0.5 and -0.7 and 0.5 and 0.7
14
New cards
Properties of correlation
2 quantitative variables only, linear relationship only, r has no units, switching x and y doesn’t affect r, and r is affected by outliers and skewness
15
New cards
b0
symbol for y-intercept
16
New cards
b1
symbol for slope
17
New cards
SSE
sum of squared errors
18
New cards
Ybar
mean of all y values
19
New cards
Coefficient of determination
% of variability in y that is explained by x. Notation: R2 = r2
20
New cards
Y hat
predicted y in terms of x
21
New cards
Notes about interpreting y-intercept
X = 0 needs to make sense, need data near where X = 0
22
New cards
Extrapolation
no data out there, think about temperatures
23
New cards
How to calculate residuals
observed y – predicted x
24
New cards
If line fits well
residual should have no pattern (should have random scatter about the regression line), no systematic change as X increases, no unusually large values of a residual (outlier in y direction), no influential points (outlier in x direction)