Linear Regression - Practice Questions (1&2 EMF)

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/29

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

30 Terms

1
New cards

What does β₁ mean in

y = β₀ + β₁x + u

Change in y for a 1-unit change in x, ceteris paribus

2
New cards

What does β₁ mean in

wage = β₀ + β₁mast + u?

(mast is a dummy variable, where 1= have master, 0=no master)

Average wage difference between those with and without master’s degree

3
New cards

If wage = 1500 + 500mast, what is wage with master? Any issue?

  • Predicted wage = 2000

  • Issue: may omit factors correlated with mast → bias risk

4
New cards

What is a dummy variable in regression?

  • Variable coded 0 or 1 to indicate membership in a category

  • Example: mast = 1 if a person has a master’s degree, 0 otherwise

5
New cards

Does unbiasedness require high R²?

No. Unbiasedness relies on assumptions, not on fit

R² measures fit (Proportion of variation in y explained by the model)

6
New cards

How to read coefficients when the dependent variable is ln(y)?

  • Continuous x: +1 in x → ≈ 100·β% change in y

  • Dummy D (0/1): D=1 vs 0 → ≈ 100·δ% difference in y

  • Quick feel: β = 0.05 ⇒ ≈ 5% (exact 5.13%)

7
New cards

What is the base group in a regression with dummies?

The group where all dummy variables = 0

  • Example: wage = β₀ + β₁IQ + β₂south + β₃(IQ×south), the base group is non-southerners (south=0)

8
New cards

What does a p-value measure in regression output?

  • Probability of seeing an estimate as extreme as β^​ if the true β = 0

  • Small p (<0.05): strong evidence against H₀ → effect significant

  • Large p: no strong evidence → cannot reject H₀ → not significant

  • p-value does not measure size or importance of effect, only evidence against H₀

9
New cards

What is the formula for the t-test on a regression coefficient?

Usually test H₀: βⱼ = 0, so denominator is the standard error of βⱼ

<p>Usually test H₀: βⱼ = 0, so denominator is the standard error of βⱼ</p>
10
New cards

How do you test whether a group of regressors has no effect?

  • Use the F-test

  • Null hypothesis: all coefficients in the group = 0

  • If F is large and p-value is small, reject the null

11
New cards

Why does multivariate regression provide a better ceteris paribus interpretation than univariate regression?

  • In univariate models, β₁ may capture both x’s effect and effects of omitted correlated variables

  • In multivariate models, including controls separates effects, so β₁ reflects only the effect of x

12
New cards

What is the population regression model in simple linear regression?

  • y = β₀ + β₁x + u

  • β₀ = intercept, β₁ = slope, u = error term

13
New cards

Interpret β₁ in:

wage = β₀ + β₁Female + u

while: β₁ = –5 and female is dummy (1=female, 0=male)

β₁ = –5 → women earn 5 less on average than men, cetris paribus

14
New cards

What happens if you rescale dependent variable (income € → thousands €)?

  • Coefficients and SE shrink by factor 1000

  • t-statistics unchanged

15
New cards

Two algebraic properties of OLS residuals

  • Residuals sum to 0: Σû=0

  • Residuals uncorrelated with regressors: Σx·û=0

16
New cards

Population Regression Function (PRFs) for

y = β₀ + β₁D + u

  • D=0: E[y|D=0]=β₀

  • D=1: E[y|D=1]=β₀+β₁

17
New cards

Interpret R²=0.25 in exam scores vs hours studied

25% of variation in scores explained by hours studied

18
New cards

Interpret β₁ in y = β₀ + β₁ln(x) + u

  • β₁/100 = change in y for 1% ↑ in x

  • Example: β₁=2 → 1% ↑ in x raises y by 0.02

19
New cards

State variance decomposition

TSS = ESS + RSS (total = explained + residual variation)

20
New cards

fe = β₀ + β₁Tech + u

β̂₁=0.025

se=0.010

=> Interpret & test

  • Tech firms’ forecast error 2.5% higher

  • t=2.5 → significant at 5%

21
New cards

R²=0.02. Interpret & relevance

  • Only 2% of y variation explained

  • Still fine for causal inference if regressors exogenous

22
New cards

Why log(y) can reduce heteroskedasticity?

  • Compresses scale of y

  • Stabilizes variance of errors

23
New cards

Compare log vs level wage regressions

  • log(wage)=β₀+β₁Education+u → β₁ ≈ % effect of education

  • wage=β₀+β₁Education+u → β₁ = absolute wage change per education year

24
New cards

Reading regression output

  • Variable significant if p<α (1%,5%,10%)

  • Significant coefficient → effect different from 0

  • Non-significant → cannot reject H₀

25
New cards

Why high R² may be useless

Spurious regression: x and y both trend but no causality

26
New cards

What is an exogenous variable?

  • A regressor uncorrelated with the error term (E[u|x]=0)

  • Ensures OLS is unbiased and consistent

  • Example: randomized treatment in an experiment

27
New cards

What is an endogenous variable?

  • A regressor correlated with the error term (E[u|x]≠0)

  • Causes OVB and biased OLS estimates

  • Sources: omitted variables, simultaneity, measurement error

28
New cards

Why can a high R² be useless?

  • R² only measures % of y’s variation explained by x

  • It does not prove unbiasedness or causality

  • Example: time-series data where GDP and global temperature both trend upward → regression shows high R² but relationship is spurious (driven by common trend, not causal link)

29
New cards

Give two endogeneity threats.

  • Omitted variable (e.g. ability with education)

  • Simultaneity (e.g. price & demand)

30
New cards

What are omitted variables?

  • Relevant factors affecting y but excluded from regression

  • If correlated with regressors → E[u|x]≠0 → OLS biased

  • Example: Ability omitted in wage–education regression biases education effect