Adv Econometrics III

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/54

flashcard set

Earn XP

Description and Tags

Key concepts & common STATA commands.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

55 Terms

1
New cards

random variable

a numerical summary of a random outcome

2
New cards

outcome

the mutually exclusive result of a random process

3
New cards

variable

a measurable characteristic of a population

4
New cards

sample space

the set of all possible outcomes of a random process

5
New cards

event

a subset of the sample space

6
New cards

estimator

the function of the data in the sample derived to infer the estimand

7
New cards

estimand

the true value in the observable population which is to be estimated

8
New cards

target/structural parameter

the specific unknown population parameter that is to be estimated

9
New cards

central limit theorem

when N is sufficiently large, the distribution of the estimated mean becomes more normal

10
New cards

Gauss-Markov OLS Assumptions

  1. Exogeneity

  2. No multicolinearity

  3. Linear relationship between dependent var & independent var

  4. Homoskedasticity

  5. No autocorrelation

  6. Normally distributed error term

11
New cards

properties of OLS under Gauss-Markov assumptions

B-L-U-E

Best linear unbiased estimator

12
New cards

exogeneity

the x-variables and the error term are not correlated: E(εi | X) = 0. Therefore, neither the error term nor the dependent variable influence the explanatory variables since they are determined outside of the model.

13
New cards

multicollinearity

a breach in Gauss-Markov correlation between ≥2 explanatory variables, probably because they measure a similar trait

14
New cards

homoskedasticity

the variance (σ) of the error term (ε) is constant throughout the sample. Therefore, the dispersion of residuals is similar for all X

15
New cards

heteroskedasticity

the variance (σ) of the error term (ε) is not constant throughout the sample. Therefore, the dispersion of residuals is dissimilar for all X. This violates Gauss-Markov assumptions of OLS.

16
New cards

autocorrelation / serial correlation

the correlative relationship between an independent variable and its own past values. This violation of OLS assumptions often occurs in time series data.

17
New cards

endogeneity

correlation between explanatory vars and the error term such that there is a bilateral causal relationship between the X and Y variables.

18
New cards

residuals

the differences between observed (actual) values and the estimated values predicted by the model

19
New cards

grounds to reject the null hypothesis (Ho) and propose the alternative hypothesis (Ha) / statistical significance

p-value < critical value

20
New cards

insufficient grounds to reject the null hypothesis / statistical insignificance

p-value > critical value

21
New cards

p-value

the probability of observing a z-stat, t-stat, F-stat, etc. with an absolute value ≥ the observed results

22
New cards

Triple S method of analysing variable coefficients

sign, size & significance of the a variable’s estimated coefficient

23
New cards

dummy variable

a numerical var expressed as 0 or 1 to represent categorical data, often gender, race, union membership, etc.

24
New cards

elasticity

the % change of the dependent var due to 1% change in the independent var

25
New cards

linear-linear model (Y = f[X])

change Y = beta change X

26
New cards

linear-log model (Y=f[logX])

change Y = beta/100 % change X

27
New cards

log-linear model (logY = f[X])

% change Y = (100)beta change X

28
New cards

log-log model (logY = f[logX])

% change Y = beta% change X

29
New cards

internal validity

a regression that successfully yields inferences applicable to the chosen population

30
New cards

external validity

a regression whose inferences made from a sample can also be applied to other populations

31
New cards

Variance Inflation Factor (VIF)

a method of identifying multicollinearity by quantifying how much correlation between predictor variables inflates the variance of a regression coefficient. This index = 1/(1-R²). To run this in STATA, use command

32
New cards

Breusch-Pagan test for heteroskedasticity

Ho: constant variance/hetsked (σ1 = σ2, etc.)

Ha: inconstant variance/homosked (σ1 ≠ σ2, etc).

To run this test in Stata, use the command ESTAT HETTEST

33
New cards

robust standard errors

standard errors adjusted for heteroskedastiicity. HOW TO CALC IN STATA

34
New cards

standard errors

= variance / square root( # of observations)

35
New cards

Type I neoclassical measurement error

the error is uncorrelated with the true value of the variable (eg: independent inaccuracies in reporting one’s weight)

36
New cards

Type II neoclassical measurement error

the error is correlated with the true-value or with other variables (eg: many observations intentionally misrepresent a characteristic like income)

37
New cards

conditions for instrumented regression

  1. relevance: the instrument must correlate with the problematic endogenous variable.

  2. exclusive restriction: the instrument only affects the outcome through the endogenous x-variable

38
New cards

rule of thumb for weak instrument identification

F-stat < 10 for a significance test for

39
New cards

Hausman test

A test to determine if the estimator is consistent & efficient (adheres to BLUE)

Ho: the regressor is exogenous (E(Xiεi) = 0)

Ha: the regressor is endogenous (E(Xiεi) ≠ 0)

40
New cards

linear probability model

a OLS model following the binomial distribution that uses limited dependent variables. These models can suffer from issues like predicted probabilities outside the 0-1 range and heteroskedasticity.

41
New cards

probit model

the cumulative distribution function of independent variables which models the probability of an event’s occurrence. This model follows the standard normal distribution. Use STATA command LOGIT.

42
New cards

logit model

the log of the probability of an event’s occurrence. this model follows the logistic distribution and is interpreted as the “odds” of an event happening. Use STATA command LOGIT

43
New cards

latent variable

a variable that cannot be observed, but can be inferred from other observable variables

44
New cards

maximum likelihood estimation

estimating the parameters of an assumed probability distribution based on some observed data to maximise a likelihood function so that, under the assumed statistical model, the observed data is most probable.

45
New cards

MARGINS command

finds marginal effects

46
New cards

multinomial regressions

regressions for categorical data with no order/ranking

47
New cards

cross-sectional data

48
New cards

time-series data

49
New cards

panel data

a combination of cross-sectional data and time series data

50
New cards

Chow Test for structural change

Ho: coefficients are the same for every Y

Ha: coefficients are different for every Y

51
New cards

Difference-in-Diffferences

Causal estimator method of using control and treatments groups to examine trends in 2 groups pre- and post-intervention. This method addresses biases from pre-existing differences between the two groups and omitting time trends that would have occurred regardless of the intervention.

This method assumes:

  • parallel trend

  • exogeneity

  • conditional independence

52
New cards

balanced panel data

panel data with an equal number of observations in each cross-section and time period

53
New cards

unbalanced panel data

panel data with an unequal number of observations in each cross-section and time period

54
New cards

Fixed Effect Model

55
New cards

STATA command XTSET

organises panel data properly