How do discrete and continuous random variables differ?
A discrete random variable takes a countable set of distinct values, while a continuous random variable can take any value within a range.
2
New cards
Which of these is not an unbiased estimator of the population mean?
The geometric mean of the observations.
3
New cards
What statistical meaning does a p-value convey in hypothesis testing?
The probability of obtaining test results at least as extreme as those observed, if the null hypothesis were true.
4
New cards
What does the R-squared statistic reveal about a regression model?
The proportion of variation in the dependent variable that is predictable from the regressors.
5
New cards
Which assumption must hold for OLS estimators to be interpreted as measuring causal effects?
The expectation of the error term conditional on X must be zero: E[u|X] = 0.
6
New cards
Which is a consequence of heteroskedasticity in regression analysis?
Unreliable calculated standard errors from standard formulae.
7
New cards
In the multiple regression model Yi = β0 + β1X1i + β2X2i + ui, what specific interpretation applies to β1?
The marginal effect of X1 on Y when X2 remains unchanged.
8
New cards
What causes omitted variable bias in econometric analysis?
Excluding a relevant explanatory variable that correlates with included variables.
9
New cards
Which of the following is the most important statistical problem caused by imperfect multicollinearity?
The coefficient standard errors are large, reducing statistical precision.
10
New cards
What should researchers conclude if a confidence interval for a difference between two population means contains zero?
The observed difference is not large enough to reject the null hypothesis of equal means.
11
New cards
What statistical property ensures that an estimator converges to the true parameter value as sample size approaches infinity?
Consistency.
12
New cards
In the wage equation Salaryi = β0 +β1Femalei +ui, if β1 = −8000, what does this imply?
On average, females earn $8000 less than males, all else equal.
13
New cards
Which regression diagnostic statistic is expressed in the same units as the dependent variable?
The standard error of the regression (root MSE).
14
New cards
What is the primary difference between testing individual hypotheses and joint hypotheses in regression analysis?
Joint tests account for correlations between estimators, while individual tests examine coefficients in isolation.
15
New cards
What does the null hypothesis H0 : β1 = β2 = β3 = 0 test in a regression model with three education variables?
Whether education as a whole has any significant effect on the dependent variable.
16
New cards
Which test measures the improvement in model fit when comparing nested models?
F-test.
17
New cards
What might occur during hypothesis testing if two highly correlated variables are included in a regression model?
The variables might be jointly significant (significant F-test) but individually insignificant (insignificant t-tests).
18
New cards
What does a low p-value for the F-statistic indicate in a joint hypothesis test of H0 : β1 = β2 = β3 = 0?
There is strong evidence against the hypothesis that all three coefficients are simultaneously zero.
19
New cards
In a regression of salary on years of education, what would be the most likely direction of bias if ability is omitted?
Upward bias because ability is positively correlated with both education and salary.
20
New cards
What is the reason economists report multiple specifications of the same model in research papers?
To test whether key results are robust to changes in model formulation.
21
New cards
In a regression model with a quadratic term, how does the effect of X on Y differ from a linear model?
The effect varies with the value of X and equals β1 + 2β2X.
22
New cards
What is a key advantage of using natural logarithms in regression models?
They compress wide-ranging data and reduce the influence of extreme values.
23
New cards
In a log-linear model ln(Y) = β0 + β1X + u, if β1 = 0.03, what is the effect of a 1-unit increase in X on Y?
Y increases by approximately 3%.
24
New cards
In a regression model with an interaction term, what does β3 measure?
How the effect of X1 on Y changes with different values of X2.
25
New cards
In a wage equation, if β1 = 2.5, β2 = −5, and β3 = −0.5, what is the gender wage gap for someone with 10 years of experience?
−15.
26
New cards
What is a limitation of the Linear Probability Model for binary outcomes?
It can predict probabilities outside the [0,1] range.
27
New cards
In binary response models like logit and probit, what is a 'latent variable'?
An unobserved continuous variable that determines the binary outcome when it exceeds a threshold.
28
New cards
What is the key difference between logit and probit models?
They assume different distributions for the error term in the latent variable model.
29
New cards
In maximum likelihood estimation for binary response models, what are we trying to maximize?
The probability of observing our exact sample pattern of ones and zeros given the model parameters.
30
New cards
What can make a coefficient estimate statistically significant but economically trivial?
A very large sample size combined with a small effect size.
31
New cards
What does the interaction term in a regression model indicate?
It indicates how the relationship between one independent variable and the dependent variable changes at different levels of another independent variable.
32
New cards
Why might a researcher choose to use a log-linear model?
To interpret coefficients as elasticities, which represent percentage changes.
33
New cards
What does the term 'indeterminate bias' refer to in econometric models?
It refers to bias that depends on other factors in the model.
34
New cards
What is the effect of a quadratic term in a regression model?
It allows for a non-linear relationship between the independent variable and the dependent variable.
35
New cards
What is the significance of β1 in a log-linear model?
It represents the percentage change in Y for a one-unit change in X.
36
New cards
What does a confidence interval that includes zero indicate about a coefficient?
It suggests that the coefficient may not be statistically significant.
37
New cards
What is the role of maximum likelihood estimation in econometrics?
To estimate the parameters of a model by maximizing the likelihood of the observed data.
38
New cards
What does it mean if a model predicts probabilities outside the [0,1] range?
It indicates a limitation of the model, specifically the Linear Probability Model.
39
New cards
In the context of econometrics, what is an interaction term?
A term in a regression model that represents the combined effect of two variables on the dependent variable.
40
New cards
What is the implication of having a large standard error in a coefficient estimate?
It suggests that the estimate is less precise and may not be statistically significant.
41
New cards
What is the primary focus of causal inference in econometrics?
Determining whether X causes Y, rather than just being associated with
42
New cards
What is the difference between a discrete and a continuous random variable?
A discrete random variable takes a finite number of values while a continuous random variable takes values on a continuum.
43
New cards
In hypothesis testing, what does the p-value represent?
The probability of observing data at least as extreme as the sample data assuming the null hypothesis is true
44
New cards
What is the interpretation of a 95% confidence interval for a parameter?
If we repeated sampling many times, approximately 95% of similarly constructed intervals would contain the true parameter
45
New cards
In the regression model Yi = β0+β1Xi+ui, what does the coefficient β1 represent?
The change in Y associated with a one-unit increase in X
46
New cards
What is the computational objective of Ordinary Least Squares (OLS) estimation?
To find the line that minimizes the sum of squared residuals
47
New cards
What does the R-squared value in a regression analysis represent?
The fraction of variance in the dependent variable explained by the independent variable(s)
48
New cards
Which of the following is a necessary assumption for the OLS estimator to have a causal interpretation?
The error term has mean zero given X: E[u|X] =0
49
New cards
What is heteroskedasticity in the context of regression analysis?
When the variance of the error term is not constant across all values of the independent variables
50
New cards
If heteroskedasticity is present in a regression model, which of the following statements is true?
The standard errors of the coefficients are incorrect, affecting hypothesis tests and confidence intervals
51
New cards
In a multiple regression model Yi = β0 + β1X1i + β2X2i + ui, what does the coefficient β1 represent?
The change in Y associated with a one-unit increase in X1, holding X2 constant
52
New cards
Consider a crop yield model: Y ieldi = β0 + β1Rainf alli + β2T emperaturei + UI. If β1 = 0.5 and β2 = −0.3, how would you interpret these coefficients?
Each additional inch of rainfall is associated with a 0.5 bushel/acre increase in yield, holding temperature constant; each additional degree Fahrenheit is associated with a 0.3 bushel/acre decrease in yield, holding rainfall constant
53
New cards
What is omitted variable bias in regression analysis?
The bias in coefficient estimates that occurs when a relevant variable is excluded from the model and is correlated with included variables
54
New cards
How do R-squared and adjusted R-squared differ in multiple regression?
R-squared never decreases when adding variables, while adjusted R-squared penalizes for additional regressors and can decrease
55
New cards
What happens under perfect multicollinearity in a regression model?
Unique coefficient estimates cannot be determined because one regressor is an exact linear function of others
56
New cards
When imperfect multicollinearity exists in a regression model, what is the primary consequence?
The standard errors of the coefficients increase, making estimates less precise
57
New cards
When interpreting a confidence interval for the difference between two means, if the interval includes zero, what can be concluded?
There is no statistically significant difference between the means
58
New cards
Which of the following is an example of an biased estimator of the population mean?
The root mean square (RMS)
59
New cards
What property describes an estimator gets closer to the true parameter value as the sample size increases?
Consistency
60
New cards
What is perfect multicollinearity in a regression model?
When one regressor is an exact linear function of other regressors
61
New cards
In a regression model Yi = β0 + β1Di with a Di being binary explanatory, what is the interpretation of β0?
The expected value of Y when D = 0
62
New cards
In a simple wage regression model W agei = β0 + β1M alei + ui, if β1 = 5000, what is the correct interpretation?
Males earn $5000 more than females, on average
63
New cards
Which measure of regression fit maintains the same units as the dependent variable?