Linear Regression

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/20

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 1:53 PM on 4/8/25
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

21 Terms

1
New cards

predicted value (y hat)

The estimate made from the model

2
New cards

Residual

The difference between the observed value (the one that’s given) and the predicted value (using model) and it tells us how far off the model’s prediction is at that point

3
New cards

To find residuals

residual = observed - predicted (y- y hat)

4
New cards

If there’s a patter in the residual plot..

then the model is not appropriate

5
New cards

a line that fits well has..

very small residuals

6
New cards

How to asses how well the line fits with residuals?

squaring the residuals (makes them all positive and emphasizes large residuals)) and then adding them all up (the smaller the sum the better)

7
New cards

Least Squares Line or the Least squares regression line (LSRL)

the unique line of a data set that makes the sum of the squared residuals as small as it can be

8
New cards

Correlation and Line

Correlation and the line are related. The correlation leads us to the equation we need to model the data

9
New cards

What r tells us about the regression

  • For each SD of x that we are away from the x mean, we expect to be r SDs of y away from the mean

  • Because r is always between -1 and +1, each predicted y is fewer SDs away from its mean than the corresponding x was (regression to the mean)

  • The slope of the line is based on the correlation (slope = r times the quotient of SD of y divided by SD of x)

10
New cards

Regression to the mean

each predicted y tends to be closer to its mean (in SDs) than its corresponding x was.

11
New cards

Line of best fit/ LSRL formula

y hat= bo +b1x

<p>y hat= bo +b1x </p>
12
New cards

slope formula

b1 = r times sy divided by sx

<p>b1 = r times sy divided by sx </p>
13
New cards

y-intercept formula

b0 = average y - b1 times average x

<p>b0 = average y - b1 times average x </p>
14
New cards

slope

as your explanatory variable increases by ___ the response variable has been associated by an increase of b1 (NOT CAUSE AND EFFECT)

15
New cards

y-intercept

when explanatory variable is 0, the response variable is b0

16
New cards

Typical Residual Size/ Standard Error

one way to evaluate how good a model is, a rough estimate of how much our model is typically off by when we use it to predict the y-values from the x-values

17
New cards

Standard Error formula

Standard error= standard error of residuals= difference between y and y hat, squared and summed up, then divide by n-2, then square root it

18
New cards

Coefficent of Determination (r²)

in general, when the LSRL is used to fit bivariate data, it represents the fraction of the variation between 0-1 (always positive because it’s squared); this tells you the % of the variation in the responses (= to how the % that the model explains the data), so the closer it is to 1 the better/more robust the model is.

19
New cards

A tale of 2 regressions

you CANNOT PREDICT X FROM Y IN A MODEL because they have different equations (but same correlation, but need to switch variables to get new regression)

20
New cards

Before starting a regression, check the following things:

  • Quantitative Variable Condition (are both variables quantitative?)

  • Straight Enough Condition (is r and r² close to 1/ is the data straight and linear enough to run a regression)

  • Outlier Condition

  • Does the plot thicken? (make sure there’s a relatively consistent spread (also for residuals)

  • Check the residuals with residual plot (if there’s a pattern = linear model is NOT APPROPRIATE even with high r/r²)

21
New cards

What can go wrong?

  • Don’t force a linear relationship

  • Don’t say x causes y

  • Don’t invert regression

  • Don’t choose based off r²/scatterplot alone (check EVERYTHING to make sure that the model is appropriate)

Explore top notes

note
Social Stratification
Updated 1372d ago
0.0(0)
note
AP Biology Course Review Part 4
Updated 1628d ago
0.0(0)
note
1.1 Understanding Social Problems
Updated 1100d ago
0.0(0)
note
APUSH Exam Review
Updated 677d ago
0.0(0)
note
The Cell
Updated 1268d ago
0.0(0)
note
The Odyssey Summary
Updated 1191d ago
0.0(0)
note
APUSH
Updated 692d ago
0.0(0)
note
Social Stratification
Updated 1372d ago
0.0(0)
note
AP Biology Course Review Part 4
Updated 1628d ago
0.0(0)
note
1.1 Understanding Social Problems
Updated 1100d ago
0.0(0)
note
APUSH Exam Review
Updated 677d ago
0.0(0)
note
The Cell
Updated 1268d ago
0.0(0)
note
The Odyssey Summary
Updated 1191d ago
0.0(0)
note
APUSH
Updated 692d ago
0.0(0)

Explore top flashcards

flashcards
Test 1
334
Updated 392d ago
0.0(0)
flashcards
histo cell structure final nmu
92
Updated 620d ago
0.0(0)
flashcards
LESSON 1: LITERARY CRITICISM
20
Updated 786d ago
0.0(0)
flashcards
ap psych final
106
Updated 1185d ago
0.0(0)
flashcards
Into and W. Study
46
Updated 1196d ago
0.0(0)
flashcards
Exploring the Bible Exam 3
66
Updated 853d ago
0.0(0)
flashcards
Etymology (in Literature)
21
Updated 1031d ago
0.0(0)
flashcards
Test 1
334
Updated 392d ago
0.0(0)
flashcards
histo cell structure final nmu
92
Updated 620d ago
0.0(0)
flashcards
LESSON 1: LITERARY CRITICISM
20
Updated 786d ago
0.0(0)
flashcards
ap psych final
106
Updated 1185d ago
0.0(0)
flashcards
Into and W. Study
46
Updated 1196d ago
0.0(0)
flashcards
Exploring the Bible Exam 3
66
Updated 853d ago
0.0(0)
flashcards
Etymology (in Literature)
21
Updated 1031d ago
0.0(0)