L3: The multiple regression model

3.1 Multiple Regression Model

A dependent variable is influenced by multiple explanatory variables (X1, X2, X3, …)
Example scenarios:
- Earnings: Determined not only by education but also by experience.
- Demand: Influenced by price, income, price of substitutes, etc.
- Output: Relies on various inputs of production and other environmental factors.

Mathematical Representation

The general formula for the multiple regression model: $Yi = \beta1 + \beta2 X{1i} + \beta3 X{2i} + u_i$
Where:
- $\beta_1$ = intercept
- $\beta2, \beta3$ = partial regression coefficients
- The coefficients represent the marginal effects on Y of each X variable. The term "partial" is key here: it means we are looking at the effect of one X variable while carefully "holding all other X variables constant" (this is known as ceteris paribus).
- This isolation allows us to understand the unique contribution of each independent variable to the dependent variable, preventing confusion from intertwined effects.
If both $X1$ and $X2$ change, then $\delta Y = \beta2 \cdot \delta X1 + \beta3 \cdot \delta X2$

Practical Examples

Example 1: Family consumption as a function of income and number of children:
- $Ci = \beta1 + \beta2 INi + \beta3 CHi + u_i$
- Interpretation of coefficients:
- $\beta2$ = effect on family consumption of an additional pound of income (e.g., $\beta2 = 0.8$ means an extra £1 increases weekly consumption by 80 pence).
- $\beta3$ = effect on family consumption of an additional child (e.g., $\beta3 = 12.5$ gives an increase of £12.5).
- Example: If $Ci = 45.98 + 0.8 INi + 12.5 CH_i$
Example 2: Manufacturing output as a function of labor and capital inputs:
- Log-linear function: $Y = 2.89 + 0.37 \ln Li + 0.52 \ln Ki$
- Coefficients represent partial elasticities:
- A 1% increase in labor input yields a 0.37% increase in output.
- A 1% increase in capital input yields a 0.52% increase in output.
- Total returns to scale (TRS):
- $TRS = 0.37 + 0.52 = 0.89$ (indicates decreasing returns to scale since TRS < 1).
Example 3: Production function with three inputs:
$Y = 3.44 + 0.32 \ln X1 + 0.41 \ln X2 + 0.38 \ln X*3$
- Coefficient interpretation:
- A 1% increase in $X*1$ increases output by 0.32%.
- A 1% increase in $X*2$ increases output by 0.41%.
- A 1% increase in $X*3$ increases output by 0.38%.
- Total returns to scale:
- $TRS = 0.32 + 0.41 + 0.38 = 1.11$ (indicates increasing returns to scale).

Estimation and Goodness of Fit

The Core Idea: Finding the Best Fit

The goal of multiple regression is to estimate the relationship between a dependent variable (Y) and multiple independent variables ( $X1, X2, \dots$ ).
We aim to find a line, or more accurately, a hyperplane (a multi-dimensional line), that best represents the trend in our data. This "best fit" helps us predict Y based on the X variables.

Ordinary Least Squares (OLS) Theory

The most common method to find this "best-fit" line is called Ordinary Least Squares (OLS).
What is a "residual" (or error term)? It's the difference between the actual observed value of Y for a data point and the value of Y predicted by our regression model. It tells us how far off our model's prediction is from the reality for each point ( $ui = Yi - \hat{Y_i}$ ).
Why "Least Squares"? OLS works by choosing the regression coefficients ( $\beta$ s) that minimize the sum of the squared residuals (RSS).
- We square the residuals ( $u_i^2$ ) to ensure that positive and negative errors don't cancel each other out, and to penalize larger errors more heavily.
- Minimizing this sum means finding the line that, on average, is closest to all the data points, ensuring the best possible fit.
The coefficients calculated through OLS are called OLS estimators ( $\hat{\beta}$ s). These estimators are unbiased and consistent under certain assumptions, meaning they provide reliable estimates of the true population parameters.
If error terms are distributed normally, the estimates will also be normally distributed:
- $\hat{\beta1} \sim N(\beta1, \sigma^2_1)$ ,
- $\hat{\beta2} \sim N(\beta2, \sigma^2_2)$ ,
- $\hat{\beta3} \sim N(\beta3, \sigma^2_3)$ .
The goodness of fit is measured using the coefficient of determination, $R^2$ :
- Total Sum of Squares (TSS) = Explained Sum of Squares (ESS) + Residual Sum of Squares (RSS)
- $R^2 = \frac{ESS}{TSS}$
- This indicates the proportion of total variation in Y explained by the model.

Hypothesis Testing: One Parameter

Example: Earnings as a function of age and education:
$Wi = \beta1 + \beta2 Ai + \beta3 Ei + u_i$
- Where:
- $W_i$ = yearly salary
- $A_i$ = age
- $E_i$ = education years
Testing the significance of coefficients:
- Null hypothesis for age: $H0: \beta2 = 0$ vs alternate H1: \beta2 > 0.
- Critical value from the t-distribution used with degrees of freedom $n - k$ ; in this case, $n = 73$ and $k = 3$ .
Conducting the test:
- If the calculated t-statistic exceeds critical value, reject $H_0$ .
- In this case:
- $F^*$ value calculated and compared against critical $F$ value from the F-distribution for multiple parameters.
- Example Calculation: If $F^* = 2.5$ from $n = 34, RSS = 73$ , and ESS = 176.66, can infer if the parameters are significant collectively.

Conclusion and Implications

For the regression model to be statistically significant, at least one variable must provide explanatory power.
When conducting tests involving marginal factors, significance provides insight into the relationship between the dependent variable and selected independent variables, aiding in decision-making and comprehension of underlying patterns.