Notes designed to complement ECON20110/30370 lectures.
Focus on mathematical aspects of econometrics.
Lecture slides emphasize application and intuition.
Report typos or errors via email.
7.1 Causal Models and Endogeneity
Distinction between descriptive and causal analysis.
Causal analysis essential for evaluating interventions.
Counterfactual approach to identify causal relationships.
Regression can aim for prediction or causal relationships.
Prediction Focus:
OLS used as a descriptive tool.
Orthogonality condition holds, enabling consistent estimators.
Issues like omitted variable bias do not affect predictions.
Causal Analysis Focus:
Identify and measure causal influences.
Essential to control for confounding factors (ceteris paribus).
Key model where outcome variable Y is influenced by multiple variables.
Identify if E(ε) = 0 and cov(X_l, ε) = 0.
Exogeneity condition crucial for consistent estimation of causal effects.
Endogeneity arises when explanatory variables are correlated with the error term.
Challenges conventional regression assumptions.
Biased OLS estimates when orthogonality fails.
Common Sources:
Omitted variable bias.
Wrong functional form.
Measurement error.
Simultaneous causality bias.
Sample selection bias.
OVB occurs when a key determinant is omitted, affecting observed relationships.
OVB Formula:
Derived under conditions of OVB; critical to understand the direction of bias.
Example: Representing wages as a function of years of education and ability.
Impact of omitting ability on estimated effect of education.
Arises when the model fails to summarize the true causal relationship.
Need for careful evaluation of model specifications.
Distortion from inaccuracies in data collection (e.g., survey errors).
Impacts reliability of regression analyses.
Measurement error in X affects estimates through bias.
Underestimates true effects due to measurement error, particularly concerned in regression models.
Measurement error in Y increases error variance without introducing bias if uncorrelated with X.
Occurs when independent variables influence each other.
Example: Demand and supply equations; challenges the traditional causal model assumptions.
Different effects of missing data:
Missing Data Completely at Random: No bias induced.
Missing Data Conditional on X: Unbiased but less precise.
Sample Selection Bias: Correlation with Y leads to biased estimates.
Strategies include excluding omitted variables, using instrumental variables (IV), and performing RCTs.
OLS implications when variables exhibit endogeneity.
Necessary conditions for IV to be effective include relevance, exogeneity, and exclusion.
Method for obtaining consistent estimates in the presence of endogeneity.
Used for estimating causal models using the IV technique.
The quality of IV selection and the model structure affects results.
Tests to validate the instruments used in IV analysis, emphasizing the need for theoretical grounding alongside empirical results.