EXS 407 exam 2: multiple regression

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/24

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

25 Terms

1
New cards

What does a coefficient of determination (r²) of 0.73 indicate?

It indicates that 73% of the variance in the dependent variable is explained by the independent variables.

2
New cards

What is multiple correlation?

It quantifies the degree of relationship/association between a function of independent variables and one dependent variable

-represented by a coefficient (R) between 0.00 and 1.00.

3
New cards

What does an R value of 0.00 signify?

It signifies no correlation or relationship between independent variables and the dependent variable.

4
New cards

What does an R value of 1.00 signify?

It signifies a perfect correlation where independent variables completely explain the dependent variable.

5
New cards

What is partial correlation?

It quantifies the relationship between an independent variable and a dependent variable after removing the effect of another variable (covariate).

r^2yx1,x2=partial coefficient of determination: variance in Y explained by X1 after removing the effects of X2 on both

6
New cards

What are the assumptions for using multiple/partial correlation?

1. Continuous (interval or ratio) scale for both variables

2. Independence of variable pairs

3. Approximate normal distribution of both variables

4. Linear relationship between variables

5. No outliers

7
New cards

What is the purpose of multiple linear regression?

To predict one dependent variable from multiple predictor variables and find the most satisfactory equation for prediction of the DV (lowest SEe)

-more predictor variables can increase or not change the predictive power

8
New cards

What is the general form of a multiple linear regression equation?

Y = a + b1X1 + b2X2 + ... + bkXk, where Y is the dependent variable, a is the Y-intercept, and b's are the slope coefficients.

9
New cards

What is hierarchical multiple regression?

-the researcher controls the model equation and which predictors are included

-often used for hypothesis testing (not DV predictions)

10
New cards

What is forward selection in multiple regression?

A method that starts with the intercept only and adds predictors one-by-one based on their contribution to the model.

11
New cards

What is backward elimination in multiple regression?

A method that starts with all predictors and eliminates them one-by-one based on their contribution to the model.

12
New cards

What is stepwise regression?

A method that starts with just the intercept, then combines forward selection and backward elimination, allowing previously entered variables to be removed.

13
New cards

process of forward selection

IV with the strongest correlation (highest r) to the DV is added first

-the IV that increases r^2 the most is added next (explains the most unique variance)

-continue with the next strongest r^2

14
New cards

process of backward elimination

-all IV are added initially

-IVs are eliminated in order that their removal decreases explained variance the least, stop when theres a significant reduction in explained variance

15
New cards

Process of Stepwise Regression

-IVs are added as in forward selection, but they can be eliminated if the r^2 is not affected

-con: need a larger sample size

16
New cards

assumptions of multiple regression

1. Linear relationships

2. Independent variables

3. Approx normally distributed

4. The variance of the residuals must be consistent across the range of IV values (homoscedasticity)

5. Independent variables should not be correlated with each other (multicollinearity)

17
New cards

What is multicollinearity?

It occurs when independent variables are correlated with each other= inflated confidence intervals and unstable coefficient estimates (B1 is meaningless)

-no threshold of acceptable multicollinearity, but VIF>10 is suspicious

-ideally there is no correlation of IVs (explain unique amounts of variability)

<p>It occurs when independent variables are correlated with each other= inflated confidence intervals and unstable coefficient estimates (B1 is meaningless)</p><p>-no threshold of acceptable multicollinearity, but VIF&gt;10 is suspicious</p><p>-ideally there is no correlation of IVs (explain unique amounts of variability)</p>
18
New cards

What is the variance inflation factor (VIF)?

A measure used to detect multicollinearity; a VIF greater than 10 is considered suspicious.

19
New cards

singularity

two IVs are perfectly related, r=1, usually because one was derived from the other

20
New cards

What is cross-validation in regression analysis?

A process of testing regression equations on a separate sample to ensure accuracy in predictions.

-can accomplish by splitting dataset in half and running model on the other half

21
New cards

What is the expected outcome when applying models to different samples?

Higher prediction errors are expected because the model is optimal for the training data.

-the multiple correlation coefficient undergoes shrinkage

22
New cards

What statistic is used to make decisions about adding or removing predictors in regression?

r^2

23
New cards

What is the significance of the slope coefficients in a regression equation?

They represent the weight or impact of each independent variable on the dependent variable.

24
New cards

What does the term 'unexplained variance' refer to in regression analysis?

It refers to the portion of variance in the dependent variable that is not accounted for by the independent variables.

25
New cards

What is the importance of checking for outliers in regression analysis?

Outliers can distort correlation coefficients and affect the reliability of the regression model.