Applied econometrics lectures 9-10 - Limited dependent variables

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/36

Earn XP

Description and Tags

University/Undergrad

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

37 Terms

New cards

LDV defenition

A dependent variable is limited if the values it can take are restricted

New cards

LDV in practice

Most economic variables are limited, but they do not all need special treatment – we may treat them simply as continuous.

• e.g. many cannot be negative (GDP)

It is only when the restriction is substantial that applying OLS (or other linear estimation) leads to problems

New cards

Why LDVs cause problems

We use dummy variables as the dependent variable

New cards

How to correct LDV problems

New cards

linear probability model (LPM)

model where the probability of y (dummy) taking the value 1 is a linear function of x

New cards

Interpreting the results from OLS after using for LDV

shows the gain / reduction on the probability of y = 1 (y is a dummy)

Dont bother looking at R²

New cards

How is goodness of fit measured for LPM (R² not used)

goodness of fit is measured by the percent of within-sample predictions that are “correct”

Goodness of fit = (203+350) / 753 = 0.734

New cards

Heteroscedasticity and LPM

The variance of the error term depends on x → heteroscedasticity

•The estimated coefficients are unbiased and consistent but our inferences may be wrong owing to biases in the estimated standard errors

• Simulations suggest that the biases in the s.e.s are rarely large

• In Stata we can adjust s.e.s for heteroskedasticity and so in reality this isn’t usually a problem

New cards

Why we use probit / logit models

Preferable we want the predicted conditional response to always fall between 0-1 (no negatives or greater than 1)

We ensure it using a nonlinear function

<p>Preferable we want the predicted conditional response to always fall between 0-1 (no negatives or greater than 1)</p><p>We ensure it using a nonlinear function</p><img src="https://knowt-user-attachments.s3.amazonaws.com/1af39c71-5675-48d5-9227-52b34b46d973.png" data-width="100%" data-align="center" alt="knowt flashcard image"><p></p>

New cards

Logit / probit vs LPM

New cards

Logit model

New cards

Probit model

New cards

Deriving Logit / Probit models from underlying latent variable models

New cards

Graphical representation of probabilities and logit / probit models

New cards

Why y (our dependent variable) is a function of x_j

We are interested in the effects of changes in the explanatory variables, x_j , on the conditional response probability

the latent variable y* is a linear function of x_j

y is a non linear function of y* so is also nonlinear of x_j

New cards

Effect of ∆x_j on prob of y = 1

G is a monotonically increasing function → anything that causes y* to increase also causes p(y = 1)

New cards

The marginal effect or partial effect of a continuous variable x_j on the probability of y = 1

given by the partial derivative of our probit or logit model with respect to x_j (same as in linear models)

New cards

Magnitude of marginal effect

depends on all the x’s we evaluate at

because g(z) > 0 for all z, the marginal effect always has the same sign as β_j

New cards

Marginal effect of a continuous variable on LPM

New cards

Marginal effect of a continuous variable on Logit / probit model

New cards

Binary X’s - effect of switching from 0→1

Depends on the values of x₂, x₃ , etc

New cards

What to report with linear and non linear models

Usually, one reports:

• the marginal effects of the continuous explanatory variables evaluated at the means of all the explanatory variables

• the partial effects of binary explanatory variables switching from 0 to 1 evaluated at the means of all the other explanatory variables

When its a LPM it doesn't matter where you start - The marginal effect is always constant and does not depend on x.

While if non-linear function then it matters where we start as the change between values varies and isn’t constant

New cards

Can we use OLS for nonlinear models?

No we need to use maximum likelihood estimation (MLE)

New cards

maximum likelihood estimation (MLE)

Find the values of the β_j that maximize the likelihood of the observed y_i for our sample, given the x for our sample

Max the probability that our observed x's and y's come from that model

We maximize the sum of the log-likelihoods, L

L ≤ 0 because likelihoods are always between 0 and 1 so their logs are always negative

New cards

Logit / Probit estimators

No formula unlike OLS

Consistent (unbiased when n is large) + asymptotically normal + account for heteroscedasticity + asymptotically efficient

New cards

Likelihood ratio test

a measure of how MLE likelihood changes when we drop the x’s

New cards

Goodness of fit for MLE - percent correctly predicted

We can use “percent of observations correctly predicted” as our goodness of fit measure (just like for the LPM)

New cards

Goodness of fit for MLE options

Percent correctly predicted
Pseudo R²

New cards

Goodness of fit for MLE - pseudo R²

R² = 1 - L_ur/ L₀

New cards

Interpreting logit / probit estimates - Sign

The sign of an estimated coefficient indicates the sign of the marginal or partial effect of a change in the corresponding variable

• Can interpret it as usual

New cards

Interpreting logit / probit estimates - Significance

^β_j / se(^β_j) = z stat

An estimated coefficient, ^β_j divided by its standard error yields the z-statistic corresponding to the null hypothesis that x_j has no effect on y

Just like for OLS, can interpret it as usual (like t statistic)

New cards

Interpreting logit / probit estimates - Magnitude

The partial or marginal effects depend on x
The same is true of estimated effect of any change in an x_j

New cards

Default option : reporting - Marginal effects

The marginal effects of the continuous explanatory variables evaluated at the sample means of all the explanatory variables

New cards

Why the default option is to report

The magnitude of our probit / logit estimators shows effect of changes of x_jon y

But x isnt defined so we need to choose an x before estimating the changes

We find the effects by using the sample means of all the explanatory variables

New cards

Partial (or marginal) effects at the average (PEA)

The estimates marginal / partial effects the for the sample average individual

We take a fictional individual that has the same characteristics as the average of all of the explanatory variables, and we are basically computing the marginal values for that person.

New cards

Average partial effects (APE)

the observed values of explanatory variables for individual i used to compute individual partial effect → then average across the sample

New cards

Likelihood ratio test statistic

2(L_ur - L_r)

X²_df

df - no. x_j’s dropped for restricted