1/95
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
If we say E(y|x) = Bo + B1x, where Bo and B1 solve the population least-squares problem, then the CEF is the population regression BLANK and Bo and B1 are population regression BLANK
function, coefficients
the population regression function provides the best BLANK to the CEF
linear approximation
(simple regression model) the coefficient B1 measures the BLANK in y BLANK with a BLANK in x1, holding all of the unobservables constant
change, associated, unit change
(simple regression model) if Bo and B1 solve the population least-squared problem their values BLANK the expected value of the BLANK difference between the dependent variable and the CEF
minimize, squared
the value of B1 that solves the population least-squares problem is:
cov(xi yi)/ var(xi)
the OLS estimator for B1 can be obtained by plugging in the BLANK of xi and yi in BLANK and plugging in another BLANK for each outer expectations
sample averages, population average, sample average
if there were more than one x in (1), then the formula for B1 would be the BLANk, except xi1 would be replaced with the BLANk from a regression of xi1 on the other xs
same, residual
the BLANK theorem says you can control for other explanatory variables in estimating the effect of an x on y by either including the other variables directly or regressing y on the BLANK from a regression of x on the other variables
FWL, residuals
when the PRF includes more than one x, we say that B1 measures the BLANK effect of x1 (w/o necessary giving a casual interpretation)
partial
if E(ui | xi1) = 0 in (1), xi1 is BLANK of ui and the sampling error of B1 hat equals Blank on average, which implies that B1 hat is BLANK
mean independent, 0, unbiased
(if E(ui | xi1,xi2) = 0) if you omit xi2 from (2), B1 hat will be biased BLANK if B2 and cov(xi1,xi2) have the same BLANK
upward, sign
if yi1 is log wage, xi1 is education and xi2 is labor market experience, and you omit xi2 from (2), then B1 hat will be biased BLANk because B2 is BLANK and cov(xi1,xi2) are BLANK correlated
downward, positive, negatively
let’s say you don’t omit xi2, but it is measures with error. Then B2 hat will be
biased down
R² measures how much the variance of the BLANK variable is accounted for by the BLANK variables
dependent, explanatory
true or false: R² is centrally important for doing casual inference
false
basic OLS inference is grounded in the application of the BLANK, which says that the BLANK of the OLS estimator can be regarded as approximately BLANK for large samples
CLT, sampling distribution, normal
the modern approach to regression inference allows for the variance of the errors depends on the BLANK variables
explanantory
the modern approach means we should always report BLANK standard errors and test statistics
robust
the R function lm gives the wrong standard errors, test statistics, and confidence intervals because it ignores
heteroscedasticity
if E(ui|xi1)=0 in (1), the sampling error of B1 hat converges to 0 and B1 hat is BLANK
consistent
the test statistic for whether a explanatory variable has a statistically significant association with the dependent variable is the ratio of the explanatory variable’s BLANK to its BLANK
estimated coefficient, standard error
in (2), the test statistic for the null hypothesis that B2=1 is BLANK
(B2 hat -1)/se(B2)
larger BLANK statistics and smaller BLANK values indicate stronger evidence BLANK the null hypothesis
test, p, against
suppose yi= Bo + B1xi1+ B2xi2 + B3xi3 + B4xi4 + ui. To test the null that B3=B4=0, you can use a BLANK test, which compares the fit of a short regression that BLANK x3 and x4 with the fit of a longer regression that BLANK them
F, excludes, includes
true or flase. if corr(x,) = 0, y does not depend on x
false
true or false. if x causes y, the conditional distribution of y given x must depend on x
true
in the above DAG, z is a BLANK
confounder
you can’t observe the effect of a treatment on an individual bc you can’t observe their BLANK outcome. In this sense, causal inference is fundamentally a BLANK data problem
counterfactual, missing
while individual treatment effects are not observable, you may able to identify the average treatment effect (ATE), which is the difference in average BLANK outcomes
potential
Using the difference in sample average outcomes for treated and untreated individuals generally won’t work for estimating the ATE because potential outcomes are not independently of treatment assignment, which results in what kind of bias?
selection
term 1 in (1) is E(y1i|Di = 1) − E(y0i|Di = 1)
the average treatment on the treated
term 2 in (1) is E(y1i|Di = 1) − E(y0i|Di = 0)(1)
selection bias
if treatment assigned is randomized, then term 2 E(y0i|Di = 1) − E(y0i|Di = 0)(1) equals BLANK and term 1 E(y1i|Di = 1) − E(y0i|Di = 1) equals BLANK
0, ATE
if the potential outcomes are BLANK of treatment assignment, the assignment mechanism is BLANK and the difference in sample average outcomes for treated and untreated individuals will identify the ATE
independent, ignorable
potential outcomes will be BLANK of treatment assignment if individuals are BLANK assigned to treated and untreated groups
independent, randomly
the conditional independence assumption (CIA) is a claim that there is a set of covariates that once you control for them, you can consider the potential outcomes to be BLANK of treated assignment. The CIA is a claim of unBLANK and is untestable
independent, confoundedness
to estimate the ATE under a CIA, you also need overlap, which is the ability to observe BLANK and BLANK units for any set of covariate values
untreated, treated
if you have a set of control variables for which a CIA holds, you can identify the average effect of the treatment on the outcome by running a regression of the outcome on the BLANK from a regression of the treatment dummy on the controls
residuals
unlike in standard regression analysis, in RD designs there is no BLANK in treated and control units because individuals with different values of D, the treatment, have different values of the covariate by construction
overlap
in a sharp RD design, the conditional BLANK assumption holds automatically because treatment assignment is determined solely by the cutoff value of the BLANK variable
independence, running
in a fuzzy RD design, the cutoff value of the running variable determines the BLANK of treatment
probability
the key identifying assumption of an RD design is that the average BLANK outcomes are BLANK through the cutoff
potential, continuous
under the assumptions of a sharp RD design, you identify an
average treatment effect on the treated
the black lines are linear regression approximations to the CEFs for the BLANK outcomes
potential
select the regression specification that is consistent with the black lines
yi = Bo + B1xi + tDi + ui
under the key identifying assumption of a sharp RD design, the model in question 7 identified
t=E(y1i-yoi|xi=c)
the basis for an RD analysis should be apparent in a binned BLANK plot of the outcome and BLANK variable
scatter, running
in general, the RD specification should include a low-order BLANK in the running variable and an interaction of the running variable with the BLANK indicator
polynomial, treatment
the distribution of the running variable should show
no evidence of manipulation because it is smooth throughout the cutoff
an RD analysis of baseline BLANK should show no evidence of BLANK among them
covariates, discontinuities
including the baseline BLANK in the regression model BLANK affect the estimated treatment effect
covariates, should not
the ldurat difference in differences is
0.20
the benefit difference in differences is
88
the high-earner group is BLANK male and BLANK married, but the male and married shares BLANK change over time for either group
more, more, do not
the high-earner groups is BLANK likely to work in manufacturing and BLANK likely to work in construction, and the share of high earners in construction BLANK by BLANK points after the WBA increase
less, more, falls, 4
based on table 1, average time out of work rose BLANK % because of the WBA increase
20
column (1) indicates that time out of work (BLANK) rise for low earners
did not
column (1) indicates that average time out of work was BLANK % BLANK for high earners
25.6, higher
the results in column (1) suggest that time out of work rose BLANK % in Kentucky because of the WBA increase
19.1
the standard error for the estimated DD coefficient is BLANK, which implies that the result is significant at the BLANK % level
.069, 1
controlling for gender, industry affiliation and injury type BLANK the DD coefficient estimate for KY by BLANK percentage points
increases, 4
controlling for gender, industry affiliation and injury type BLANK the overall fit of the regression by BLANK percentage points
increases, 2
still, the overall fit reported in column (2) is too low for the regression results to be trustworthy
false
the results in column (3) suggest that time out of work rose BLANK % in Michigan because of the WBA incease
19.2
the t statistic for the estimated DD coefficient in column (3) is BLANK, which implies you BLANK reject the null at the 5% level
1.25, cannot
the value of the BLANK test for the null that the coefficients of the control are jointly zero is BLANK, so the null is BLANK
F, 9.8, rejected
the metric that we use to compare prediction models is BLANK or MSPE
mean squared prediction error
mean squared error of (uhat)=
1
meansquarederrorof(uhat)=
0.75
E(uhat) - u = BLANK, which implies uhat is BLANK
0, unbiased
although uhat is BLANK, it has a lower mean squared error
biased
R² penalizes the inclusion of an additional explanatory variable if its associated t-statistics is less than
1
machine learning that involved predicting an outcome with a set of explanatory variables is called BLANK learning
supervised
choosing the best-performing ML model involved empirically tuning model complexity through
cross-validation
cross-validation beings by dividing the data into BLANK and BLANK samples
training, testing
the training sample is divided into BLANK, one of which is held out for BLANK while the others are used to BLANK the model
folds, validation, estimate
cross-validation involves computing the BLANK for each fold and BLANK them over all folds
MSPE, averaging
cross-validation is repeated for different values of the BLANK parameter, which determines the strength of the BLANK imposed by the regularizer
tuning, penalty
LASSO is a BLANK estimator that also performs variables BLANK by forcing the coefficients of the least releavant variables to be equal to BLANK relevant
shrinkage, selection, zero
the 2×2 DD analysis compares the difference in average outcomes for the BLANK observations before and BLANK treatment with the difference in mean outcomes for the control observations BLANK and BLANK treatment
treated, after, before, after
a DD analysis targets the average treatment effect on the BLANK
treated
the target estimand cannot be estimated directly because E(yo|g=1,t=1) is BLANK
unobserved
the key identifying assumption in a DD analysis is that the treated and untreated outcomes would follow BLANK trends in the BLANK of the treatment
parallel, absence
a simple before vs after comparison of treated observations misses the BLANK in the outcome not associated with treatment
trends
a simple comparison of treated vs control observations after treatment misses factors that cause non-random BLANK into treatment
selection
the parameter Y reflects the average difference between BLANK and BLANK outcomes before treatment
treated, untreated
the parameter N reflects the average differences in outcomes BLANK and BLANK treatment for the untreated group
before, after
the parameter N also reflects the BLANK average difference in outcomes between periods 0 and 1 for the BLANK group
counterfactual, treated
if N varied by group, the BLANK assumption would not hold
parallel trends
the parameter S represents the BLANK
difference in differences
the standard 2×2 DD analysis can be carried out by regressing the outcome on a BLANK dummy, a period BLANK, and their BLANK
group, dummy, interaction
a formal expression of the DD regression consistent with table 2 is:
y=u + Ytreat +Nafter + Streat *after + ua
a regression formulation of DD design is appealing because it
all of the above
we described a TWFE model as a regression model for data with both a BLANK and time dimension
group
estimating a TWFE model with data on multiple groups and variation in treatment timing can identify the ATE if the treatment effect is
homogeneous
computing the correct standard errors for TWFE estimates usually requires BLANK at the group level to account for BLANK and BLANK correlation
clustering ,heteroscedasticity, serial