7. Multiple Linear Regression

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
full-widthPodcast
1
Card Sorting

1/25

flashcard set

Earn XP

Description and Tags

These flashcards cover key concepts and vocabulary related to multiple linear regression, fundamental for understanding the material focused on in Chapter 7.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

26 Terms

1
New cards

Multiple Linear Regression

A statistical technique that models the relationship between a dependent variable and multiple independent variables.

2
New cards

Explanatory Variable

An independent variable that is used to explain variations in the dependent variable.

3
New cards

Response Variable

The dependent variable that researchers are trying to predict or explain.

4
New cards

Confounding Variable

An outside influence that changes the effect of a dependent and independent variable.

5
New cards

ANOVA (Analysis of Variance)

A statistical method used to test differences between two or more group means.

6
New cards

Interaction in Regression

Occurs when the effect of one independent variable on the dependent variable changes depending on the level of another independent variable.

7
New cards

Residuals

The differences between observed and predicted values in a regression model.

8
New cards

R-squared (R)

A statistical measure that represents the proportion of variance for the dependent variable that's explained by the independent variables.

9
New cards

Adjusted R-squared

A modified version of R-squared that adjusts for the number of predictors in the model, providing a more accurate measure for multiple regression.

10
New cards

Hypothesis Testing in Regression

A method used to determine whether there is enough statistical evidence in a sample to infer that a certain condition holds true for the entire population.

11
New cards

Statistical Significance

A determination that the observed data would be very unlikely under the null hypothesis.

12
New cards

Model Selection

The process of selecting a statistical model from a set of candidate models based on how well they explain the data.

13
New cards

Categorical Predictors

Variables that represent categories, often used in regression to assess the impacts of different groups.

14
New cards

Least Squares Estimation

A method to estimate the coefficients of a linear regression model which minimizes the sum of the squares of the residuals.

15
New cards

Multicollinearity

A statistical phenomenon where two or more predictors in a model are highly correlated, leading to unreliable estimates.

16
New cards

Standard Error of Coefficient

An estimate of the variability of a regression coefficient, indicating how much the coefficient varies from the actual average value.

17
New cards

F-statistic

A ratio used in ANOVA and regression analysis to compare model fits, indicating the overall quality of the model.

18
New cards

Prediction Interval

An estimate of how far observed outcomes deviate from predicted values in a regression model.

19
New cards

Confidence Interval

A range of values derived from a data set that is likely to contain the true value of an unknown population parameter.

20
New cards

Assumptions of Multiple Linear Regression

Key conditions that should be met for the model to be valid: linearity, independence of residuals, normality of residuals, and homoscedasticity.

21
New cards

Dummy Variables

Binary variables (0 or 1) used to represent categorical predictors with more than two levels in a regression model.

22
New cards

Heteroscedasticity

A violation of regression assumptions where the variance of the residuals is not constant across all levels of the independent variables.

23
New cards

Variance Inflation Factor (VIF)

A measure used to detect multicollinearity by quantifying how much the variance of an estimated regression coefficient is inflated due to correlation with other predictors.

24
New cards

P-values in Hypothesis Test

P < 0.05 : reject H0

P > 0.05 : fail to reject H0

25
New cards

Residual Confounders

Possible predictors that may be confounders but have not been examined - can be other variables in a dataset that have not been examined or measured.

26
New cards

Scatterplot Matrix

Useful for visualizing the relationship between the predictor and response variables, as well as the relationship between predictors