Applied econometrics lecture 15 causality

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/22

Earn XP

Description and Tags

University/Undergrad

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

23 Terms

New cards

Non-random sampling

where the sample is not drawn randomly from the population of interest

We assume random sampling

New cards

causal Qs and splitting the problem into 2

Identification

Statistics

New cards

splitting the problem into 2 - identification

What could we learn about the parameters we care about (causal effects) if we had the observable data for the entire population

Need to make assumptions about how observed outcomes relate to outcomes that would have been realized under different treatments

New cards

splitting the problem into 2 - stats

What can we learn about the full population that we care about from the finite sample that we have?

Need to understand the process by which our data is generated from the full population

New cards

Indicator for treatment

D_i

New cards

Outcome under treatment

Y_i(1)

New cards

Outcome under control

Y_i(0)

New cards

observed outcome

D_i Y_i (1) + (1 - D_i) Y_i(0)

When D_i = 1 we only observe Y_i(1)

New cards

Example estimator

<img src="https://knowt-user-attachments.s3.amazonaws.com/13a96e5c-4f97-40cc-a935-330aafe8403c.png" data-width="100%" data-align="center" alt="knowt flashcard image"><p></p>

New cards

Example estimand

<img src="https://knowt-user-attachments.s3.amazonaws.com/d5b218cc-633a-453b-bca7-b179ed1697f3.png" data-width="100%" data-align="center" alt="knowt flashcard image"><p></p>

New cards

Example target parameter

We cant assume the 2nd term is equal to earning as NTU for NTU grads - due to selection bias / omitted variables

(other ways that affect earnings (regardless of where they went to uni))

<img src="https://knowt-user-attachments.s3.amazonaws.com/170239e9-c0e3-4ada-bf3c-23b48e6cc138.png" data-width="100%" data-align="center" alt="knowt flashcard image"><p>We cant assume the 2nd term is equal to earning as NTU for NTU grads - due to selection bias / omitted variables</p><p>(other ways that affect earnings (regardless of where they went to uni))</p>

New cards

Individual treatment effect (TE)

Y_i (1) - Y_i (0)

New cards

Average treatment effect (ATE)

average causal effect between group the unit is in

(affect of attending UoN instead of NTU)

E( Y_i (1) - Y_i (0) )

New cards

Effect of treatment on treated (TOT)

average causal effect of group among those who are part of the group

(of attending UoN among those who attended UoN)

E( Y_i (1) - Y_i (0) | D_i = 1) = E( Y_i (1) | D_i = 1) - E( Y_i (0) | D_i = 1)

New cards

Selection bias

<img src="https://knowt-user-attachments.s3.amazonaws.com/e16c6f90-1f3e-40fa-ae7c-e694746bdfac.png" data-width="100%" data-align="center" alt="knowt flashcard image"><p></p>

New cards

Does having health insurance make you healthier

As the extra characteristics are statistically different between men with HI and without HI, it makes us believe that the outcome will be different for the 2 populations

The observed difference could come from the selection bias term

<p>As the extra characteristics are statistically different between men with HI and without HI, it makes us believe that the outcome will be different for the 2 populations</p><p>The observed difference could come from the selection bias term</p>

New cards

Random assignment and selection bias

When treatment status is randomly assigned, selection bias disappears

The potential outcome of the people that are not treated without treatment is the same as the potential outcome without treatment of the people who are treated.

New cards

Random assignment and the Uni example

Suppose that UoN & NTU administration randomized who got into which uni

Since university is randomly assigned, the only thing that differs between UoN and NTU students is the university they went to

<p>Suppose that UoN & NTU administration randomized who got into which uni</p><p>Since university is randomly assigned, the only thing that differs between UoN and NTU students is the university they went to</p><img src="https://knowt-user-attachments.s3.amazonaws.com/7bbec1ec-6c3b-4763-9757-c83464eeea62.png" data-width="100%" data-align="center" alt="knowt flashcard image"><p></p>

New cards

Random sampling vs Random assignment

Random sampling

facilitates statistical inference about population parameters, causal or otherwise
We generally assume it holds

Random assignment

supports causal inference, that is, comparisons of potential outcomes free of selection bias
we achieve with RTCs
- RTCs immoral in many cases

New cards

Omitted variable bias (OVB)

<img src="https://knowt-user-attachments.s3.amazonaws.com/dfd0b46d-c10a-47ae-be36-5115b7e8a580.png" data-width="100%" data-align="center" alt="knowt flashcard image"><img src="https://knowt-user-attachments.s3.amazonaws.com/05430884-d773-4a19-9c92-ef1b47df38ac.png" data-width="100%" data-align="center" alt="knowt flashcard image"><img src="https://knowt-user-attachments.s3.amazonaws.com/9a5e90b6-e811-4f4f-9aef-93f9e3d9b7a0.png" data-width="100%" data-align="center" alt="knowt flashcard image"><p></p>

New cards

using OVB formula to investigate selection bias

The OVB formula is true for any short vs long regression comparison

Why would we want to run a long regression as opposed to a short regression:

Because the potential outcomes ( Y_i (0)’s ) are likely different between the treated and the untreated
Regression with the right controls reduces, and maybe even eliminates, selection bias arising from unbalances in Y_i (0)’s

New cards

Removing selection bias by controlling for OVB

By controlling for the only omitted variable (correlated with our explanatory), we can recover the treatment of the treated → we remove selection bias

<p>By controlling for the only omitted variable (correlated with our explanatory), we can recover the treatment of the treated → we remove selection bias</p>

New cards

Conditional independence assumption

When we have many variables, it is sufficient that D_i is ‘randomly assigned’ conditional on x, to recover causal parameter of interest

D_i ⊥ Y_i (…) | X_i

⊥ - conditional on

Sometimes (rare in reality) that by controlling for the right X_i’s it is sufficient to recover causal parameters of interest