STA Class 5: Regression Assumptions

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/17

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

18 Terms

1
New cards
We use regression for __________ to predict Y given a particular set of values.
prediction
2
New cards
Regression is used for __________ to understand the relationship between Y and a particular predictor after holding constant other predictors.
adjustment
3
New cards
In regression, we use it for __________ to infer something about the population relationships between Y and the predictors based on the sample.
inference
4
New cards

Y is a linear function of the X’s, which means we expect __ _____ ___ to represent the overall pattern of the data.

a linear function
5
New cards

A violation of linearity indicates a trend or _____ _____ in the residuals.

discernible pattern
6
New cards
The prediction errors (residuals) in regression should be __________ distributed to satisfy independence.
normally
7
New cards
A violation of independence would indicate __________ amongst observations.
correlation
8
New cards

For predictions to be accurate within ± 2*RSE, the residuals must be ____ ______.

normally distributed
9
New cards
In a normal Q-Q plot, if the dots line up in a straight line, __________ is satisfied.
Normality
10
New cards
The variance of Y should be the same for any value of X, known as __________.
homoscedasticity
11
New cards

A violation of equal variance would indicate a ____/______ shape in the plots.

fan/funnel
12
New cards
All four line assumptions in regression are required for __________.
inference
13
New cards
If only __________ is satisfied in regression, we cannot put reliable Confidence Intervals on predictions.
linearity
14
New cards
Influential observations in a regression context are outliers that have __________ leverage.
high
15
New cards
For an observation to be influential, it needs both a large __________ and high leverage.
residual
16
New cards
Leverage refers to when an observation has a very unusual __________ value.
x
17
New cards
Outliers are points with large residuals, meaning they are unusual __________ for the given x.
y
18
New cards

High leverage points can inflate ____-______ and provide a false sense of confidence in the model.

R-squared