Linear Regression

studied byStudied by 0 people
0.0(0)
Get a hint
Hint

Intercept (a)

1 / 35

flashcard set

Earn XP

36 Terms

1

Intercept (a)

starting value in y-units. The y-value when x is zero

New cards
2

slope(b)

For every 1 (x unit) increase in (x variable) there is a (slope) (y unit) increase in mean (y variable). slope = SD of x/SD of y

New cards
3

Correlation Coefficient

There appears to be a (weak/moderate/strong) (positive/negative) (linear/nonlinear) relationship between (x variable) and (y variable)

New cards
4

Coefficient of Determination

About R²% of the variability in (y variable) can be explained by variability in (x variable)

New cards
5

Standard Error of residuals

on average, the actual (y variable) values vary about (standard error of slope, sb/SEb with units) from the predicted values (find using LinRegTTest and select sign from alt. hypothesis)

New cards
6

Residual = Actual - preducted value (how far vertically from line of best fit)

pos: underestimated

neg: overestimated

New cards
7

Explanatory variable

cause, independent, x-axis

New cards
8

Response variable

effect, dependent, y-axis

New cards
9

Association (any form)

Direction: positive/negative, Form: straight/curved Strength: weak/moderate/strong or combo

New cards
10

Correlation

cannot be greater than one. If given r², square root r²

New cards
11

Outliers

can either have large residual or high leverage

New cards
12

Leverage

high leverage if x value is far from mean of x-values, works like a lever if it’s influential

New cards
13

Quantitative variables condition

both variables are quantitative

New cards
14

Straight enough condition

scatter plot looks reasonably straight

New cards
15

Outliers condition

outliers either arent obvious or have a large enough sample to proceed with caution with

New cards
16

Correlation of 0

no linear association

New cards
17

Correlation

measures strength of linear association between two variables, which can be strongly associatied but still have small correlation if said association isnt linear

New cards
18

Linear model

y = a+b(x)

New cards
19

Residual

observed-predicted

New cards
20

Turn scatter plot on

stat diagnostic on in mode, stat edit, L1 = X, L2 = Y, 2nd y= on, window 9, graph

New cards
21

Get linear model on calc

stat-calc-8, store regEq, vars-y-vars-function-y1

New cards
22

residuals

use l3 to 2nd-stat-resid

New cards
23

Outliers

horizontal outliers (leverage) more influential than vertical outliers (residuals)

New cards
24

A residual scatter plot with a cluster and one “stray point:”

The point has high/low leverage and a large/small residual. this point is/isn’t influential/ If the point were removed the correlation would become weaker/stronger, and removing it would strengthen/weaken the association. The slope would increase/decrease/remain the same, since the point is/isn’t influential.

New cards
25

Null hypothesis

Ho: There is no linear relationship between —- and —-. (B = 0.)

New cards
26

Alternative hypothesis

Ha: there is a linear relationship between —— and ——. (B doesnt equal 0)

New cards
27

Assumptions for inference. IN ORDER

Straight enough, Independence, Spread, Nearly Normal (SEISNN, Sally Eats Icees Stealthily Nearing Normandy)

New cards
28

Straight Enough

Scatter plot of data points is straight enough to try a linear model

New cards
29

Independence

residual plot is scattered

New cards
30

Spread

spread of residuals is consistent

New cards
31

Nearly Normal condition

histogram of residuals is unimodal and symmetric. If possible outlier: with one possible outlier, with the large sample size however, it should be okay to proceed

New cards
32

After conditions

since the conditions for inference have been met, the sampling distribution of the regression slope can be modeled by a Student’s t-model with — degrees of freedom. We’ll use a regression slope t-test. The equation of the line of best fit of these data points is y = a+bx where —- are measured in — units.

New cards
33

P-value is less than alpha

the value of t = ____. The P-value of less than alpha means that the association we see in the data is unlikly to occur by chance. Since our P-value is below our signifcance level of —, we reject the null hypothesis and conclude there is strong evidence of a linear relationship between —— and —-. As —— increases, —— (increases/decreases)

New cards
34

P value is greater than alpha

the value of t = ____. The P-value of greater than alpha means that the association we see in the data is likely to occur by chance. Since our P-value is above our significance level of —-, we fail to reject the null hypothesis and conclude theres weak evidence of a linear relationship between —- and —-.

New cards
35

conf interval

a GIVEN PERCENT confidence regression slope t-interval: ind. coeficcient +- (invT(conf level, Dof (remember it’s -2!)(SE coefficient of independent variable) equals about (—-,—--)i

New cards
36

interpret confidence interval

we are GIVEN PERCENT confident that the mean increase/decrease is in an interval between about —- and about —-

New cards

Explore top notes

note Note
studied byStudied by 51 people
... ago
5.0(1)
note Note
studied byStudied by 10 people
... ago
5.0(1)
note Note
studied byStudied by 14 people
... ago
5.0(1)
note Note
studied byStudied by 19 people
... ago
5.0(1)
note Note
studied byStudied by 10 people
... ago
5.0(1)
note Note
studied byStudied by 33 people
... ago
5.0(1)
note Note
studied byStudied by 18 people
... ago
5.0(1)
note Note
studied byStudied by 113 people
... ago
4.0(1)

Explore top flashcards

flashcards Flashcard (102)
studied byStudied by 6 people
... ago
5.0(1)
flashcards Flashcard (45)
studied byStudied by 5 people
... ago
5.0(1)
flashcards Flashcard (40)
studied byStudied by 2 people
... ago
5.0(1)
flashcards Flashcard (28)
studied byStudied by 7 people
... ago
5.0(1)
flashcards Flashcard (52)
studied byStudied by 3 people
... ago
5.0(1)
flashcards Flashcard (27)
studied byStudied by 135 people
... ago
5.0(3)
flashcards Flashcard (110)
studied byStudied by 18 people
... ago
5.0(1)
flashcards Flashcard (42)
studied byStudied by 1 person
... ago
5.0(1)
robot