R-squared in Linear Regression (Notes)

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/26

flashcard set

Earn XP

Description and Tags

Flashcards to review the concept of R-squared, explained/unexplained variation, and the worked example from the notes.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

27 Terms

1
New cards

What does R-squared measure in linear regression?

The percentage of variability of y that is explained by the model.

2
New cards

R-squared can be written as 1 minus the ratio of what two quantities?

Unexplained variation to total variation (SSE/SST).

3
New cards

What is the formula for the unexplained variation (SSE) in the notes?

SSE = (1/n) sum (y - y_hat)^2.

4
New cards

Why is the average of prediction errors not included in SSE?

Because the average prediction error is zero.

5
New cards

How is the total variation of y expressed?

As the variance of y: (1/n) sum (y - y_mean)^2.

6
New cards

What are the two variance conventions mentioned, and when do they apply?

1/n vs 1/(n-1); 1/n is population variance, 1/(n-1) is sample variance.

7
New cards

Do 1/n and 1/(n-1) affect the value of R-squared?

No; they cancel between numerator and denominator.

8
New cards

What is the x-values in the example?

x = 1, 2, 3, 4.

9
New cards

What are the corresponding y-values in the example?

y = 5, 6, 6, 9.

10
New cards

What is the regression model used in the example?

y_hat = 3.5 + 1.2 x.

11
New cards

What is SSE in the example?

SSE = 1.8.

12
New cards

How is the mean of y computed in the example?

Mean y = (5+6+6+9)/4 = 6.5.

13
New cards

What is SST in the example?

SST = 9.

14
New cards

How is R-squared calculated in the example?

R^2 = 1 - SSE/SST = 0.8.

15
New cards

What is the numerical value of R-squared in the example?

0.8 (or 80%).

16
New cards

What does an R-squared of 0.8 mean?

80% of the variance in y is explained by the model.

17
New cards

What does SSE measure conceptually?

Unexplained variation; the squared prediction errors.

18
New cards

What does SST measure conceptually?

Total variation of y around its mean.

19
New cards

What is y_hat?

The predicted value of y from the regression model.

20
New cards

How do you compute the predicted values for x=1..4 in the example?

Plug into y_hat = 3.5 + 1.2 x to get 4.7, 5.9, 7.1, 8.3.

21
New cards

What is the broader interpretation of R-squared?

The proportion of total variability in y explained by the model.

22
New cards

How does the example illustrate the R-squared comparison?

It compares how close y is to yhat versus to ymean.

23
New cards

What is the mean of y in the dataset?

6.5.

24
New cards

What is the predicted value when x equals 1 in the example?

4.7.

25
New cards

How many data points are used in the example?

Four.

26
New cards

What does an 80% explanation imply about model fit?

The model explains most but not all of the variability.

27
New cards

In summary, how are SSE and SST related to R-squared?

R^2 = 1 - SSE/SST; decreasing SSE relative to SST increases R^2 toward 1.