topic 4

Overview
  • Topic 4 covers essential concepts in econometrics, focusing on different ways to model relationships between economic variables (functional forms), how we measure those variables (units of measurement), and what these choices mean for understanding data and its real-world implications. These concepts are crucial for accurately interpreting regression results and making informed economic decisions.

Chapter References
  • Relevant chapters to refer to include:

    • Chapter 6.2: Slope coefficients interpretation

    • Chapter 6.4: Semi-log (or log-lin) model: semi-elasticities

    • Chapter 6.5: Lin-log model: semi-elasticities

    • Chapter 6.6: Polynomial equations

    • Chapter 4.1: Scaling and units of measurement

Slope Coefficients Interpretation (Section 4.1)
  • The marginal effect of one variable (X) on another (Y) is simply the slope of the relationship between them. This slope tells us how much Y changes for a small change in X. Mathematically, it's calculated as a derivative, dYdX\frac{dY}{dX}, which measures the instantaneous rate of change.

  • It is crucial to observe how variables are entered into the equation: either as their original values (natural units) or transformed using a natural logarithm (ln).

    • Linear Model Definition: This is the simplest model where variables are in their natural units.

      Y<em>i=β</em>1+β<em>2X</em>i+uiY<em>i = \beta</em>1 + \beta<em>2 X</em>i + u_i

      • Here:

        • YiY_i represents the dependent variable (the outcome we are trying to explain) for observation ii.

        • XiX_i represents the independent variable (the factor we believe influences Y) for observation ii.

        • β1\beta_1 is the intercept, indicating the value of YY when XX is zero.

        • β2\beta_2 is the slope coefficient, which is the primary focus of interpretation.

        • uiu_i represents the error term, accounting for unobserved factors or random noise.

    • Interpretation of β<em>2\beta<em>2: In this linear model, β</em>2\beta</em>2 tells us the actual number of units that YY is expected to change for every one-unit increase in XX, assuming all other factors remain constant.

Semi-log (Log-Lin) Model (Section 4.2)
  • Log-Lin Model Definition: In this model, both the dependent variable (Y) and the independent variable (X) are transformed using their natural logarithms.

    lnY<em>i=β</em>1+β<em>2lnX</em>i+ui\ln Y<em>i = \beta</em>1 + \beta<em>2 \ln X</em>i + u_i

    - The natural logarithm (often written as `ln`) helps us interpret changes in terms of percentages rather than absolute units.
    
    • Interpretation of β<em>2\beta<em>2: Here, β</em>2\beta</em>2 is called the elasticity of YY with respect to XX. This means it represents the approximate percentage change in YY that results from a one percent change in XX. For example, if β2\beta_2 is 0.5, a 1% increase in XX leads to a 0.5% increase in YY.

  • The transformation from absolute changes to relative (percentage) changes can be understood as follows:

    • When we talk about changes in variables:

      • Change in YY: dYdY refers to a change in YY expressed in its original units.

      • Change in XX: dXdX refers to a change in XX expressed in its original units.

      • Change in ln(Y)\ln(Y): dln(Y)d\ln(Y) is approximately equal to dYY\frac{dY}{Y} which represents the relative change (or percentage change if multiplied by 100) in YY.

      • Change in ln(X)\ln(X): dln(X)d\ln(X) is approximately equal to dXX\frac{dX}{X} which represents the relative change (or percentage change if multiplied by 100) in XX.

    • Therefore, when both sides of the equation are in logarithms, the coefficient β2\beta_2 directly captures the ratio of the relative change in YY to the relative change in XX.

Interpretation of Semi-elasticity in Semi-log Models
  • In the semi-log model, only the dependent variable (Y) is transformed using its natural logarithm, while the independent variable (X) remains in its natural units.

    • Model Definition:

      ln(Y<em>t)=β</em>1+β<em>2X</em>t+ut\ln(Y<em>t) = \beta</em>1 + \beta<em>2 X</em>t + u_t

    • Interpretation for β<em>2\beta<em>2: In this case, β</em>2\beta</em>2 represents the approximate relative change in YY (as a decimal) for a one-unit increase in XX. To express this as a percentage, you multiply β2\beta_2 by 100.

      • So, β<em>2×100\beta<em>2 \times 100 gives the percentage change in YY due to a one-unit increase in XX. For example, if β</em>2\beta</em>2 is 0.04, a one-unit increase in XX leads to an approximately 4% increase in YY.

Practical Example of Semi-log Model
  • Consider a model used to analyze the growth of an economy's Gross Domestic Product (GDP) over time:

    • Model Used for GDP: ln(GDP<em>t)=β</em>1+β<em>2t+u</em>t\ln(GDP<em>t) = \beta</em>1 + \beta<em>2 t + u</em>t

    • Here, tt denotes time, typically measured in years.

    • Interpretation of β<em>2\beta<em>2: In this context, β</em>2\beta</em>2 is directly interpreted as a growth rate. For example, if β2\beta_2 is 0.04:

      • An increase of one year (tt increases by 1) results in an approximate (0.04×100)%=4%(0.04 \times 100)\% = 4\% growth rate in GDP. This means GDP is expected to increase by 4% each year.

Lin-log Model (Section 4.3)
  • This model is the reverse of the semi-log model: the dependent variable (Y) is in natural units, but the independent variable (X) is transformed using its natural logarithm.

    • Lin-Log Model Definition:

      Y<em>i=β</em>1+β<em>2ln(X</em>i)+uiY<em>i = \beta</em>1 + \beta<em>2 \ln(X</em>i) + u_i

    • Interpretation of β<em>2\beta<em>2: Here, β</em>2\beta</em>2 measures the units change in YY resulting from a one percent increase in XX. Specifically, for a 1% increase in XX, YY is expected to change by β2100\frac{\beta_2}{100} units.

  • Example: Let's look at expenditure on services as a function of total personal consumption expenditure:

    • Expenditure on services<em>t=β</em>1+β<em>2ln(Total personal consumption</em>t)+ut\text{Expenditure on services}<em>t = \beta</em>1 + \beta<em>2 \ln(\text{Total personal consumption}</em>t) + u_t

    • Empirical Findings from 1975-2006: If a study found that β<em>2\beta<em>2 was 18.44, then a 1% increase in total personal consumption is associated with an increase of 18.44100=0.1844\frac{18.44}{100} = 0.1844 units in expenditure on services. If Y is measured in billions of dollars, this would mean an increase of $0.1844 billion, or $184.4 million, in expenditure on services, not $18.44 billion as implied in the original note's example. To clarify for the note, if β</em>2\beta</em>2 is 1844 (for example) then a 1% increase in consumption increases expenditure by $18.44 billion.

Polynomial Equations (Section 4.4)
  • Polynomial functions are used when the relationship between variables is not a simple straight line but rather curves. They are frequently employed in economic contexts such as modeling cost functions (how total cost changes with production) or production functions (how output changes with input).

  • Quadratic Function Definition: This involves XX and XX squared (X2X^2), creating a single bend or curve.

    Y<em>i=β</em>1+β<em>2X</em>i+β<em>3X</em>i2+uiY<em>i = \beta</em>1 + \beta<em>2 X</em>i + \beta<em>3 X</em>i^2 + u_i

  • Cubic Function Definition: This includes XX, XX squared (X2X^2), and XX cubed (X3X^3), allowing for two bends in the curve.

    Y<em>i=β</em>1+β<em>2X</em>i+β<em>3X</em>i2+β<em>4X</em>i3+uiY<em>i = \beta</em>1 + \beta<em>2 X</em>i + \beta<em>3 X</em>i^2 + \beta<em>4 X</em>i^3 + u_i

  • Insights on the effect of XX on YY when using quadratic terms (only consider β3\beta_3 for the quadratic relationship):

    • If \beta_3 > 0: The model exhibits a U-shape (convex). This means YY initially decreases as XX increases, reaches a minimum point, and then starts to increase. A common example is an average cost curve, which initially falls due to economies of scale and then rises due to diminishing returns.

    • If \beta_3 < 0: The model exhibits an inverted U-shape (concave). This means YY initially increases as XX increases, reaches a maximum point, and then starts to decrease. An example is an average product curve, where adding more input initially boosts productivity but eventually leads to reduced average output.

  • The stationary point (maximum or minimum of YY) is determined by the first-order condition (F.O.C). This is found by setting the first derivative of the function with respect to XX to zero:

    • δYδX=0\frac{\delta Y}{\delta X} = 0

    • In simpler terms, we are finding the point on the curve where the slope is perfectly flat (zero). This flat point indicates either the highest (maximum) or lowest (minimum) value of YY along that curve, given the values of XX.

Implications of Scaling and Units of Measurement (Section 4.5)
  • Regression analysis is flexible and allows data to be expressed in various measurement units, such as individual units, thousands, millions, or billions. While convenient, it has important implications for interpreting regression results.

  • Important implications regarding Regression results:

    • The estimated coefficients (intercepts and slopes) will change if the units of measurement for XX or YY are changed. However, the fundamental interpretation of the underlying economic relationship, the R-squared (which measures how well the model fits the data), and the statistical significance of the coefficients remain the same.

    • Example: Analyzing pints of beer sold (YY) based on price (XX).

      • Suppose a model gives us the following coefficients:

        • β2=85\beta_2 = -85: This means if the price (XX) increases by $1 (assuming XX is in dollars), the consumption of beer (YY) is expected to decrease by 85 pints.

        • β1=1200\beta_1 = 1200: This is the consumption in pints when the price is zero (though this might not be economically relevant, it's the intercept).

      • Conversion of price to pence: If we re-estimate the model but measure price in pence instead of dollars ($1 = 100$ pence), the new slope coefficient (γ2\gamma_2) would adjust.

        • The new model might look like: Y<em>i=γ</em>1+γ<em>2Xp</em>i+uiY<em>i = \gamma</em>1 + \gamma<em>2 X^p</em>i + u_i (where XpX^p is price in pence).

        • The new slope γ<em>2\gamma<em>2 would be calculated as β</em>2100\frac{\beta</em>2}{100}. So, γ2=85100=0.85\gamma_2 = \frac{-85}{100} = -0.85. This means if the price increases by 1 penny, consumption would decline by 0.85 pints. The impact is the same; it's just scaled correctly for the new unit.

      • Changing YY from pints to half-pints: If we decide to measure consumption in half-pints (Y2<em>iY^2<em>i) instead of pints (Y</em>iY</em>i), where Y2<em>i=2×Y</em>iY^2<em>i = 2 \times Y</em>i, the coefficients would again adjust.

        • The new model might be: Y2<em>i=δ</em>1+δ<em>2X</em>i+u<em>iY^2<em>i = \delta</em>1 + \delta<em>2 X</em>i + u<em>i. Here, δ</em>1=2×β<em>1\delta</em>1 = 2 \times \beta<em>1 and δ</em>2=2×β2\delta</em>2 = 2 \times \beta_2. So, an increase of $1 in price would now lead to a decrease of 2×85=1702 \times 85 = 170 half-pints.

  • These examples demonstrate how changes in the units of measurement for either the dependent or independent variables lead to corresponding adjustments in the regression coefficients. While the numerical values of the coefficients change, the underlying economic relationship and the interpretation of the impact (e.g., that an increase in price reduces consumption) remain consistent. This understanding is crucial for correctly interpreting models, especially when dealing with large numbers like government expenditures over time, where units (pounds, millions, billions) can drastically alter coefficient values but not the fundamental economic insight like growth metrics or the trend in governmental expenditure analysis.