Basic of multiple regression

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/32

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

33 Terms

New cards

the difference between Simple and multiple regression

simple regression compares if line fits data, multiple regression compares if multiple variance is worth the trouble compared to simple regression

New cards

Assumption of Multiple Regression: the relationship between dependent variable and independent variable is

liner

New cards

Assumption of Multiple Regression:is the independent variable random?

New cards

Assumption of Multiple Regression: relationship between two or more independent variable

there is no define liner relationship between them

New cards

Assumption of Multiple Regression: Expected value of the error term equals to

New cards

autocorrelation

when a time series model next value is determine by the previous value. Model accuracy is reduced, estimated standard error is overrated

New cards

which test tests for auto correlation

durbin watson (DW)

New cards

multicollinarity, and how is t test and standard error

2 or more independent variables are highly correlated , high standard error low t test

New cards

Heteroskedasticity

variance of the error term is not constant

New cards

Logistic regression predicts

true or false; and fits a S shape progression ; can use continues data like (size, length) and also descrete date like true false

New cards

The normal Q-Q plot is useful for

exploring whether the residuals are normally distributed (it can be because of heteroskedety, but it could also be not normal because of outliers and other factors)

New cards

A pairwise scatterplot is used to detect whether

there is a linear relationship between the dependent and independent variables

New cards

AIC and BIC, lower better or higher

lower

New cards

AIC and BIC is each used for

AIC is used if the goal is to have a better forecast. BIC is used if the goal is a better goodness of fit.

New cards

what does it mean if adjusted R2 is lower

meaning adding the last variable does not make the model better

New cards

difference in T test and F test

T test compares mean of two group to see if they are different, F test compares Variance of two group to see if they are different . ie check if two stock price is different (t test) vs if the volatility of two stock is different

New cards

can R2 detect the statistical difference of coefficient

New cards

a poor model can have high R2 because of

overfitting

New cards

adjusted R2 penalizes extra

factor added to the model

New cards

R2 increase when t test

>\1\

$<p>>\1\ </p>$

New cards

f test if all independent variable explain dependent variable

New cards

reject the null if F value

> critical value

New cards

what does rejecting null mean

at least one of the variable is doing a good job

New cards

p value is

how confident you are at you at you model

New cards

p value ranges from —- lower better or higher better

0-1 , lower the better

New cards

a p value of 0.05 which is commonly used threshold means

if we run a bunch of experiment, 5% at a time it is wrong (false positive)

New cards

can adjusted r2 be negative or decline

When a new independent variable is added, adjusted R² can decrease if adding that variable results in only a small increase in R². In fact, adjusted R² can be negative, although R² is always nonnegative. Adjusted R² can be negative as well as decline.