Quant

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/104

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 2:07 AM on 4/30/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

105 Terms

1
New cards

P-Value for Slopes

use the p-value to evaluate the null hypothesis that a slope coefficient is equal to zero.

The p-value is the smallest level of significance for which the null hypothesis can be rejected. We test the significance of coefficients by comparing the p-value to the chosen significance level:

  • If the p-value is less than the significance level, the null hypothesis can be rejected.

  • If the p-value is greater than the significance level, the null hypothesis cannot be rejected.

2
New cards

Assumptions underlying a multiple regression model include

  • A linear relationship exists between the dependent and independent variables.

  • The residuals are normally distributed.

  • The variance of the error terms is constant for all observations.

  • The residual for one observation is not correlated with that of another observation.

  • The independent variables are not random, and there is no exact linear relation between any two or more independent variables.

3
New cards

Goal of QQ plot

Check if residuals are normally distributed

Points deviating from the Q–Q line indicate that residuals are not normally distributed. The plot assesses normality, not correlation. Points below the line indicate that observed values are more negative than expected under normality, while points above the line indicate more positive values than expected.

4
New cards

Functional Form Model Misspecifications

Misspecification

Description

Effect

Omission of important independent variable(s)

Based on economic theory, one or more variables that should have been included are omitted.

Biased and inconsistent regression parameters

May lead to serial correlation or heteroskedasticity in the residuals

Inappropriate variable form

The relationship between the dependent and independent variables may be non-linear.

May lead to heteroskedasticity in the residuals

Inappropriate variable scaling

Variables may need to be transformed before estimating the regression.

May lead to heteroskedasticity in the residuals or multicollinearity

Data improperly pooled

Sample has periods of dissimilar economic environments (that should not be pooled).

May lead to heteroskedasticity or serial correlation in the residuals

5
New cards

More on Misspecifications

Problem

What goes wrong

Why it matters

Omitted variable (correlated)

Bias + inconsistency (other variables take credit)

Worst case

Omitted variable (uncorrelated)

Intercept bias only

Less severe

Wrong transformation

Nonlinear → forced linear

Poor fit

Bad scaling

Units distort meaning

Misleading comparisons

Pooling data

Different regimes mixed

Wrong conclusions

6
New cards

robust standard errors

Correcting Heteroskedasticity

To correct for conditional heteroskedasticity of regression residuals, we can calculate robust standard errors (also called White-corrected standard errors or heteroskedasticity-consistent standard errors). These robust standard errors are then used to recalculate the t-statistics using the original regression coefficients for hypothesis testing.

7
New cards

Serialcorrelation

Serialcorrelation, also known as autocorrelation, refers to a situation in which regression residual terms are correlated with one another; that is, not independent. Serial correlation can pose serious problem with regressions using time series data.

8
New cards

Positiveserial correlation

Positiveserial correlation exists when a positive residual in one time period increases the probability of observing a positive residual in the next time period.

9
New cards

Negativeserial correlation

Negativeserial correlation occurs when a positive residual in one period increases the probability of observing a negative residual in the next period.

10
New cards

lagged values and positive serial correlation

lagged values
- if lagged values is inaccurate and mess up the whole model and lead to inconsistency

  • if not lagged values and uses external variables then can never be inconssitent

postiive serial correlation

  • Type I error - false positive

11
New cards

Durbin-Watson (DW) statistic

Residual serial correlation at a single lag can be detected using the Durbin-Watson (DW) statistic.

12
New cards

Breusch-Godfrey (BG) test

more general test (which can accommodate serial correlation at multiple lags) is the Breusch-Godfrey (BG) test.

13
New cards

robuststandard errors

To correct for serial correlation in regression residuals, we can calculate robuststandard errors (also called Newey--West corrected standard errors or heteroskedasticity-consistent standard errors). These robust standard errors are then used to recalculate the t-statistics using the original regression coefficients.

14
New cards

most common sign of multicollinearity

The most common sign of multicollinearity is when t-tests indicate that none of the individual coefficients are significantly different than zero, but the F-test indicates that at least one of the coefficients is statistically significant, and the R2 is high. This suggests that the variables together explain much of the variation in the dependent variable, but the individual independent variables don't. This can happen when the independent variables are highly correlated with each other—so while their common source of variation is explaining the dependent variable, the high degree of correlation also "washes out" the individual effects.

15
New cards

What test to check Heteroskedasticity?

Breusch Pagan Chi²

  • regress residuals² back into the formula

  • BP = n * R² for residuals

    • df = k

  • H0: Homo Ha: Hetero

16
New cards

How to fix Hetero?

White-Correlated/Robert Standard error

17
New cards

What test for serial correlation

Durbin Watson - single

Breusch Godfrey - multi

  • regress residuals but with lagged variables

    • test if lagged variable slopes is stat sig

  • H0: no serial Corr Ha: serial corr

18
New cards

How to fix serial corr?

Newey-West/Robust SE

19
New cards

Multicollinearity

inflated SE

Type II

regress the variables with each other removing 1 at a time

VIF = 1/1-R²

low is good high is bad

20
New cards

high-leverage points

high-leverage points are the extreme observations of the independent (or 'X') variables.

The sum of the individual leverages for all observations is k + 1 (with intercept). If a variable's leverage is higher than three times the average, [3(k + 1) / n], it is considered potentially influential.

21
New cards

studentized residuals

We can identify outliers using the studentized residuals. The following steps outline the procedure:

  1. Estimate the regression model using the original sample of size n. Delete one observation and re-estimate the regression using (n - 1) observations. Perform this sequentially, for all observations, deleting one observation at a time.

  2. Compare the actual Y value of the deleted observation i to the predicted Y-values using the model parameters estimated with that observation deleted.

    e*i = Yi -∧Y*i

  3. The studentized residual is the residual in Step 2 divided by its standard deviation.

    t*i = e*is*e

  4. We can then compare this studentized residual to critical values from a t-distribution with n – k – 2 degrees of freedom (because we now only have n – 1 observations), to determine if the observation is influential.

22
New cards

qualitative dependent variable

Financial analysis often calls for the use of a model that has a qualitative dependent variable—a categorical variable, usually a binary variable, which takes on a value of either zero or one. An example of an application requiring the use of a qualitative dependent variable is a model that attempts to estimate the probability of default for a bond issuer. In this case, the dependent variable may take on a value of one in the event of default and zero in the event of no default.

23
New cards
24
New cards
25
New cards
26
New cards
27
New cards
28
New cards
29
New cards
30
New cards
31
New cards
32
New cards
33
New cards
34
New cards
35
New cards
36
New cards
37
New cards
38
New cards
39
New cards
40
New cards
41
New cards
42
New cards
43
New cards
44
New cards
45
New cards
46
New cards
47
New cards
48
New cards
49
New cards
50
New cards
51
New cards
52
New cards
53
New cards
54
New cards
55
New cards
56
New cards
57
New cards
58
New cards
59
New cards
60
New cards
61
New cards
62
New cards
63
New cards
64
New cards
65
New cards
66
New cards
67
New cards
68
New cards
69
New cards
70
New cards
71
New cards
72
New cards
73
New cards
74
New cards
75
New cards
76
New cards
77
New cards
78
New cards
79
New cards
80
New cards
81
New cards
82
New cards
83
New cards
84
New cards
85
New cards
86
New cards
87
New cards
88
New cards
89
New cards
90
New cards
91
New cards
92
New cards
93
New cards
94
New cards
95
New cards
96
New cards
97
New cards
98
New cards
99
New cards
100
New cards