Regression Analysis and Linear Regression

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/36

flashcard set

Earn XP

Description and Tags

Flashcards covering key vocabulary and concepts from the lecture notes on Regression Analysis and Linear Regression.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

37 Terms

1
New cards

Response Variable (Y)

The variable you are trying to predict or explain.

2
New cards

Predictive Variables (Xs)

Variables used to predict the response variable. There can be one or more.

3
New cards

Generalizability

Evaluating if a model is applicable to a new dataset, not just the one it was trained on.

4
New cards

Linear Relationship

A straight-line relationship between variables.

5
New cards

Non-linear Relationship

A relationship between variables that is not a straight line.

6
New cards

Simple Linear Regression

A regression with one predictive variable and one response variable.

7
New cards

Multiple Linear Regression

A regression with multiple predictive variables and one response variable.

8
New cards

Reasons to do Regression

Want to determine if there is a relationship between predictor and response, estimate new values of the response, and test hypotheses.

9
New cards

Line of Best Fit

A line that best represents the data points in a scatter plot.

10
New cards

Residual

The distance between the actual observed data point and the line of best fit.

11
New cards

Sum of Squared Residuals

The sum of the squared distances between each data point and its predicted value on the regression line. Regression aims to minimize this.

12
New cards

Galton and Pearson

Credited with inventing regression and putting together the theory of least squares.

13
New cards

Pearson's Correlation Coefficient

A numerical method to check for linearity between variables, ranging from -1 to +1.

14
New cards

Intercept

The intercept point where the regression line crosses the y-axis

15
New cards

Slope

The rate of change in the dependent variable for every one-unit increase in the independent variable.

16
New cards

Residual (Error)

The difference between the actual value and the predicted value in a regression model.

17
New cards

Method of Least Squares

A method of estimating the parameters in a statistical model by minimizing the sum of the squares of the residuals.

18
New cards

Y-hat (ŷ)

Used to denote a predicted value in regression analysis.

19
New cards

Lm Function

Function used to fit linear models, including both simple and multiple linear regression.

20
New cards

Assumptions of Linear Regression (LINE)

Linearity, Independence, Normality, Equal Variance.

21
New cards

Linearity (Assumption)

The assumption that there is a straight-line relationship between the predictor and response variables.

22
New cards

Independence (Assumption)

The assumption that observations are independent of one another.

23
New cards

Normality (Assumption)

The assumption that the residuals are normally distributed.

24
New cards

Equal Variance (Homoscedasticity) (Assumption)

The assumption that the variance of the residuals is constant across all levels of the predictor variables.

25
New cards

par(mfrow = c(rows, columns))

Arrange plots in R.

26
New cards

Cook's Distance

The distance from the data which shifts the line; when seen transformation may be needed.

27
New cards

Hat PC

Used for Linear Regression models.

28
New cards

Collinearity

The degree to which the independent variables in a multiple regression model are correlated.

29
New cards

ANOVA (Analysis of Variance)

A statistical test used to assess how well the model fits the data; partitioning variance into sums of squares for the residual and for the error.

30
New cards

R-squared

A measure of the proportion of variance in the dependent variable that can be predicted from the independent variable(s).

31
New cards

Adjusted R-squared

A modified version of R-squared that adjusts for the number of predictors in the model.

32
New cards

Predict Function

A function in R used to generate predictions from a fitted model.

33
New cards

Transformation

Altering the mathematical scale of a variable to better meet the assumptions of a statistical model or improve the relationship between variables.

34
New cards

Back Transformation

Converting data back to its original scale after a transformation has been applied.

35
New cards

Natural Log Transformation

A type of transformation often used to address issues of non-linearity or non-constant variance.

36
New cards

Multiple Linear Regression Equation

A linear regression equation with multiple independent variables.

37
New cards

Partial Regression Coefficient

The change in the response variable associated with a one-unit increase in a specific predictor, holding all other predictors constant.