Chapter 11: Multiple Regression and Correlation Flashcards

1. Introduction to Multivariate Relationships

  • Bivariate regression: relationship between two quantitative variables.

  • Multivariate regression: includes multiple explanatory variables to improve prediction and control for confounding.

  • Purpose: Identify whether associations change or disappear when controlling for other variables.

2. Multiple Regression Function

  • General form (two predictors):
    E(y) = α + β₁x₁ + β₂x₂

  • Each β coefficient = partial regression coefficient (effect of a predictor while holding others constant).

  • With more predictors: add more βx terms.

3. Interpretation of Parameters

  • α (intercept): Mean of y when all predictors = 0.

  • βᵢ: Change in mean of y for a one‑unit increase in xᵢ, controlling for all other predictors.

4. Example: Education & Crime in Florida

  • Bivariate model:
    E(y) = –51.3 + 1.5x₁ (crime ↑ as education ↑)

  • But education and urbanization are highly correlated → spurious correlation.

  • Multiple regression model:
    E(y) = 58.9 – 0.6x₁ + 0.7x₂

  • Controlling for urbanization reverses the effect → Simpson’s Paradox.

5. Prediction Equation & Residuals

  • Prediction:
    ŷ = a + b₁x₁ + b₂x₂ + … + bₖxₖ

  • Residual: y – ŷ

  • SSE: Σ(y – ŷ)²

  • Least squares chooses coefficients that minimize SSE.

6. Case Study: Mental Health in Alachua County

  • Predictors:

    • x₁ = Life events

    • x₂ = SES

  • Regression equation:
    ŷ = 28.230 + 0.103x₁ – 0.097x₂

  • Example: For x₁ = 46, x₂ = 84 → ŷ = 24.8; residual = –7.8.

7. R and R²

  • R: Correlation between observed y and predicted ŷ.

  • R²: Proportion of variance in y explained by all predictors.

  • Formula:
    R² = (TSS – SSE) / TSS

  • Properties:

    • Between 0 and 1

    • Cannot decrease when adding predictors

8. Multicollinearity

  • Predictors highly correlated with each other → unstable coefficients.

  • Ideal: predictors strongly related to y but weakly related to each other.

  • Rule of thumb: sample size ≥ 10 × number of predictors.

9. Inference for Regression Coefficients

  • Global F‑test: tests all βᵢ = 0.

  • Individual t‑tests: test βᵢ = 0.

  • df = n – (k + 1)

10. ANOVA Components

  • Conditional SD:
    s = √(SSE / [n – (k + 1)])

  • MSE = Residual Mean Square

  • F = Regression MS / Residual MS

  • Relationship: F = t² when df₁ = 1.

11. Interaction

  • Interaction model:
    E(y) = α + β₁x₁ + β₂x₂ + β₃x₁x₂

  • Slope of x₁ depends on x₂:
    β₁ + β₃x₂

  • Centering predictors improves interpretability.

12. Comparing Models

  • Complete model vs. reduced model

  • F‑test for comparing SSE values:
    F = [(SSEᵣ – SSE꜀)/df₁] / [SSE꜀/df₂]

13. Partial Correlation

  • Measures association between y and x₁ controlling for x₂.

  • Formula:
    rᵧₓ₁·ₓ₂ = (rᵧₓ₁ – rᵧₓ₂ rₓ₁ₓ₂) / √[(1 – rᵧₓ₂²)(1 – rₓ₁ₓ₂²)]

14. Standardized Coefficients (Beta Weights)

  • β* = bᵢ × (sₓᵢ / sᵧ)

  • Allows comparison across predictors with different units.

  • Standardized prediction equation has no intercept.


What is the purpose of a multivariable regression model?
To improve prediction of y and analyze relationships while controlling for other variables.

What is the general form of a multiple regression function with two predictors?
E(y) = α + β₁x₁ + β₂x₂.

What does α represent in a regression model?
The population mean of y when all predictors equal 0.

What does a β coefficient represent?
The change in mean of y for a one‑unit increase in a predictor, holding all others constant.

What is Simpson’s Paradox in the education–crime example?
The effect of education reverses direction when controlling for urbanization.

What is a residual?
The difference between observed y and predicted ŷ.

What is SSE?
The sum of squared residuals: Σ(y – ŷ)².

What is the prediction equation for the mental health study?
ŷ = 28.230 + 0.103x₁ – 0.097x₂.

What does R measure?
The correlation between observed y and predicted ŷ.

What does R² measure?
The proportion of variance in y explained by all predictors.

Can R² decrease when adding predictors?
No, R² cannot decrease when a predictor is added.

What is multicollinearity?
A condition where predictors are highly correlated with each other.

What is the global F-test used for?
Testing whether all β coefficients equal 0.

What is the t-test used for in regression?
Testing whether an individual β coefficient equals 0.

What is the formula for conditional standard deviation s?
s = √(SSE / [n – (k + 1)]).

What is the relationship between F and t for a single predictor?
F = t².

What is an interaction term?
A term like β₃x₁x₂ indicating that the effect of one predictor depends on another.

Why are variables centered in interaction models?
To make coefficients interpretable at the mean of other predictors.

What is a complete model?
A model containing all predictors.

What is a reduced model?
A simpler model nested within the complete model.

What is partial correlation?
The association between y and one predictor while controlling for others.

What is a standardized regression coefficient?
The effect of a one‑SD increase in a predictor on y, in SD units.

Why does the standardized regression equation have no intercept?
Because standardized variables have mean 0, so predicted z is 0 when all predictors are 0.