topic 4

Overview

Topic 4 covers essential concepts in econometrics, focusing on different ways to model relationships between economic variables (functional forms), how we measure those variables (units of measurement), and what these choices mean for understanding data and its real-world implications. These concepts are crucial for accurately interpreting regression results and making informed economic decisions.

Chapter References

Relevant chapters to refer to include:
- Chapter 6.2: Slope coefficients interpretation
- Chapter 6.4: Semi-log (or log-lin) model: semi-elasticities
- Chapter 6.5: Lin-log model: semi-elasticities
- Chapter 6.6: Polynomial equations
- Chapter 4.1: Scaling and units of measurement

Slope Coefficients Interpretation (Section 4.1)

The marginal effect of one variable (X) on another (Y) is simply the slope of the relationship between them. This slope tells us how much Y changes for a small change in X. Mathematically, it's calculated as a derivative, $\frac{dY}{dX}$ , which measures the instantaneous rate of change.
It is crucial to observe how variables are entered into the equation: either as their original values (natural units) or transformed using a natural logarithm (ln).
- Linear Model Definition: This is the simplest model where variables are in their natural units.
 $Yi = \beta1 + \beta2 Xi + u_i$
 - Here:
 - $Y_i$ represents the dependent variable (the outcome we are trying to explain) for observation $i$ .
 - $X_i$ represents the independent variable (the factor we believe influences Y) for observation $i$ .
 - $\beta_1$ is the intercept, indicating the value of $Y$ when $X$ is zero.
 - $\beta_2$ is the slope coefficient, which is the primary focus of interpretation.
 - $u_i$ represents the error term, accounting for unobserved factors or random noise.
- Interpretation of $\beta2$ : In this linear model, $\beta2$ tells us the actual number of units that $Y$ is expected to change for every one-unit increase in $X$ , assuming all other factors remain constant.

Semi-log (Log-Lin) Model (Section 4.2)

Log-Lin Model Definition: In this model, both the dependent variable (Y) and the independent variable (X) are transformed using their natural logarithms.
$\ln Yi = \beta1 + \beta2 \ln Xi + u_i$
```
- The natural logarithm (often written as `ln`) helps us interpret changes in terms of percentages rather than absolute units.
```
- Interpretation of $\beta2$ : Here, $\beta2$ is called the elasticity of $Y$ with respect to $X$ . This means it represents the approximate percentage change in $Y$ that results from a one percent change in $X$ . For example, if $\beta_2$ is 0.5, a 1% increase in $X$ leads to a 0.5% increase in $Y$ .
The transformation from absolute changes to relative (percentage) changes can be understood as follows:
- When we talk about changes in variables:
  - Change in $Y$ : $dY$ refers to a change in $Y$ expressed in its original units.
  - Change in $X$ : $dX$ refers to a change in $X$ expressed in its original units.
  - Change in $\ln(Y)$ : $d\ln(Y)$ is approximately equal to $\frac{dY}{Y}$ which represents the relative change (or percentage change if multiplied by 100) in $Y$ .
  - Change in $\ln(X)$ : $d\ln(X)$ is approximately equal to $\frac{dX}{X}$ which represents the relative change (or percentage change if multiplied by 100) in $X$ .
- Therefore, when both sides of the equation are in logarithms, the coefficient $\beta_2$ directly captures the ratio of the relative change in $Y$ to the relative change in $X$ .

Interpretation of Semi-elasticity in Semi-log Models

In the semi-log model, only the dependent variable (Y) is transformed using its natural logarithm, while the independent variable (X) remains in its natural units.
- Model Definition:
 $\ln(Yt) = \beta1 + \beta2 Xt + u_t$
- Interpretation for $\beta2$ : In this case, $\beta2$ represents the approximate relative change in $Y$ (as a decimal) for a one-unit increase in $X$ . To express this as a percentage, you multiply $\beta_2$ by 100.
 - So, $\beta2 \times 100$ gives the percentage change in $Y$ due to a one-unit increase in $X$ . For example, if $\beta2$ is 0.04, a one-unit increase in $X$ leads to an approximately 4% increase in $Y$ .

Practical Example of Semi-log Model

Consider a model used to analyze the growth of an economy's Gross Domestic Product (GDP) over time:
- Model Used for GDP: $\ln(GDPt) = \beta1 + \beta2 t + ut$
- Here, $t$ denotes time, typically measured in years.
- Interpretation of $\beta2$ : In this context, $\beta2$ is directly interpreted as a growth rate. For example, if $\beta_2$ is 0.04:
 - An increase of one year ( $t$ increases by 1) results in an approximate $(0.04 \times 100)\% = 4\%$ growth rate in GDP. This means GDP is expected to increase by 4% each year.

Lin-log Model (Section 4.3)

This model is the reverse of the semi-log model: the dependent variable (Y) is in natural units, but the independent variable (X) is transformed using its natural logarithm.
- Lin-Log Model Definition:
 $Yi = \beta1 + \beta2 \ln(Xi) + u_i$
- Interpretation of $\beta2$ : Here, $\beta2$ measures the units change in $Y$ resulting from a one percent increase in $X$ . Specifically, for a 1% increase in $X$ , $Y$ is expected to change by $\frac{\beta_2}{100}$ units.
Example: Let's look at expenditure on services as a function of total personal consumption expenditure:
- $\text{Expenditure on services}t = \beta1 + \beta2 \ln(\text{Total personal consumption}t) + u_t$
- Empirical Findings from 1975-2006: If a study found that $\beta2$ was 18.44, then a 1% increase in total personal consumption is associated with an increase of $\frac{18.44}{100} = 0.1844$ units in expenditure on services. If Y is measured in billions of dollars, this would mean an increase of $0.1844 billion, or $184.4 million, in expenditure on services, not $18.44 billion as implied in the original note's example. To clarify for the note, if $\beta2$ is 1844 (for example) then a 1% increase in consumption increases expenditure by $18.44 billion.

Polynomial Equations (Section 4.4)

Polynomial functions are used when the relationship between variables is not a simple straight line but rather curves. They are frequently employed in economic contexts such as modeling cost functions (how total cost changes with production) or production functions (how output changes with input).
Quadratic Function Definition: This involves $X$ and $X$ squared ( $X^2$ ), creating a single bend or curve.
$Yi = \beta1 + \beta2 Xi + \beta3 Xi^2 + u_i$
Cubic Function Definition: This includes $X$ , $X$ squared ( $X^2$ ), and $X$ cubed ( $X^3$ ), allowing for two bends in the curve.
$Yi = \beta1 + \beta2 Xi + \beta3 Xi^2 + \beta4 Xi^3 + u_i$
Insights on the effect of $X$ on $Y$ when using quadratic terms (only consider $\beta_3$ for the quadratic relationship):
- If \beta_3 > 0: The model exhibits a U-shape (convex). This means $Y$ initially decreases as $X$ increases, reaches a minimum point, and then starts to increase. A common example is an average cost curve, which initially falls due to economies of scale and then rises due to diminishing returns.
- If \beta_3 < 0: The model exhibits an inverted U-shape (concave). This means $Y$ initially increases as $X$ increases, reaches a maximum point, and then starts to decrease. An example is an average product curve, where adding more input initially boosts productivity but eventually leads to reduced average output.
The stationary point (maximum or minimum of $Y$ ) is determined by the first-order condition (F.O.C). This is found by setting the first derivative of the function with respect to $X$ to zero:
- $\frac{\delta Y}{\delta X} = 0$
- In simpler terms, we are finding the point on the curve where the slope is perfectly flat (zero). This flat point indicates either the highest (maximum) or lowest (minimum) value of $Y$ along that curve, given the values of $X$ .

Implications of Scaling and Units of Measurement (Section 4.5)

Regression analysis is flexible and allows data to be expressed in various measurement units, such as individual units, thousands, millions, or billions. While convenient, it has important implications for interpreting regression results.
Important implications regarding Regression results:
- The estimated coefficients (intercepts and slopes) will change if the units of measurement for $X$ or $Y$ are changed. However, the fundamental interpretation of the underlying economic relationship, the R-squared (which measures how well the model fits the data), and the statistical significance of the coefficients remain the same.
- Example: Analyzing pints of beer sold ( $Y$ ) based on price ( $X$ ).
 - Suppose a model gives us the following coefficients:
 - $\beta_2 = -85$ : This means if the price ( $X$ ) increases by $1 (assuming $X$ is in dollars), the consumption of beer ( $Y$ ) is expected to decrease by 85 pints.
 - $\beta_1 = 1200$ : This is the consumption in pints when the price is zero (though this might not be economically relevant, it's the intercept).
 - Conversion of price to pence: If we re-estimate the model but measure price in pence instead of dollars ($1 = 100$ pence), the new slope coefficient ( $\gamma_2$ ) would adjust.
 - The new model might look like: $Yi = \gamma1 + \gamma2 X^pi + u_i$ (where $X^p$ is price in pence).
 - The new slope $\gamma2$ would be calculated as $\frac{\beta2}{100}$ . So, $\gamma_2 = \frac{-85}{100} = -0.85$ . This means if the price increases by 1 penny, consumption would decline by 0.85 pints. The impact is the same; it's just scaled correctly for the new unit.
 - Changing $Y$ from pints to half-pints: If we decide to measure consumption in half-pints ( $Y^2i$ ) instead of pints ( $Yi$ ), where $Y^2i = 2 \times Yi$ , the coefficients would again adjust.
 - The new model might be: $Y^2i = \delta1 + \delta2 Xi + ui$ . Here, $\delta1 = 2 \times \beta1$ and $\delta2 = 2 \times \beta_2$ . So, an increase of $1 in price would now lead to a decrease of $2 \times 85 = 170$ half-pints.
These examples demonstrate how changes in the units of measurement for either the dependent or independent variables lead to corresponding adjustments in the regression coefficients. While the numerical values of the coefficients change, the underlying economic relationship and the interpretation of the impact (e.g., that an increase in price reduces consumption) remain consistent. This understanding is crucial for correctly interpreting models, especially when dealing with large numbers like government expenditures over time, where units (pounds, millions, billions) can drastically alter coefficient values but not the fundamental economic insight like growth metrics or the trend in governmental expenditure analysis.