Pitfalls of Regression (slides 43-52)

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/6

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

7 Terms

1
New cards

What is collinearity?

Two predictor variables are highly correlated, or even perfectly related to one another. This messes up the model.

2
New cards

What is extrapolation?

The model is used to make predictions outside the observed range of data.

  • If this happens with the intercept, we can use “mean centering” to make it more interpretable or meaningful.

    • Subtract the mean of the X variable from every individual of X

3
New cards

What are outliers?

Values that are far away from most of the other values in the dataset.

4
New cards

What does leverage describe?

Describes an X value that is far away from its peers.

5
New cards

What does discrepancy describe?

Describes a Y value that is far away from its peers.

6
New cards

What happens when a data point has both high leverage and high discrepancy?

  • Takes on a high “influence

7
New cards

What is heteroscedasticity?

The model is more accurate for some values (ex. low ones) than others (ex. high ones)

  • We want to see homoscedasticity, meaning “constant variance”

  • We expect the range of errors to be the same regardless of whether we’re dealing with high values or low ones. We expect the variability or variance in the errors to be constant.