PSCH 443 Multiple Regression Overview and Example
Overview of Multiple Regression
Definition: Multiple regression aims to explain the variation in an outcome variable (Y) using multiple predictor variables (X).
Goal: Similar to bivariate regression, it seeks to explain why some individuals score high or low on the Y variable.
Overlap Consideration: It is essential to account for the unique contributions of each predictor as well as the overlap between predictors themselves concerning their influence on the outcome variable.
Model Parameters and Techniques
Estimate Model Parameters:
Use least squares techniques for estimating model parameters.
Include an intercept, multiple B parameters, and regression weights for each predictor.
Evaluate Model Fit:
Assess overall model fit using R-squared.
Examine individual predictors' contributions to ascertain if they add meaningful value to the model.
Structure of the Multiple Regression Equation
Generic Structure: Similar to a simple regression equation, including:
A parameter: Intercept (value of Y when all X's are zero).
B weights: Slope parameters for each predictor variable.
Error term: Accounts for variability not explained by the predictors.
Example of Multiple Regression Analysis
Context
Scenario: Assess whether GRE scores can predict graduate school performance.
Dataset Details:
36 participants with variables including undergraduate GPA, GRE scores, and years to complete a master's degree.
Predictors:
GRE scores (verbal sub-score)
Undergraduate GPA
Outcome: Years taken to complete a master's program.
Initial Steps
Examine Bivariate Relationships:
Look at the correlation matrix between variables to identify expected associations (e.g., higher GPA correlates with higher GRE scores).
Check for potential issues such as unexpected correlation directions or strong correlations between predictor variables (multicollinearity).
Model Summary Statistics
Multiple Correlation (R): Correlation of all predictor variables with the outcome Y variable.
R-squared Value:
Represents the proportion of variance explained by predictors; in this example, about 26% of the variance in completion years is explained.
Adjusted R-squared:
Adjusts R-squared based on sample size and number of predictors; relevant when considering generalizability of the results.
Standard Error of Estimate: Reflects average prediction error, relevant for model accuracy.
ANOVA and Degrees of Freedom
ANOVA Overview: Ratio of effect sums of squares to error sums of squares to test the overall significance of the model.
Degrees of Freedom: Determines total degrees of freedom based on the number of participants.
Total DF: Total participants - 1 (e.g., 35 for 36 participants).
Regression DF: Coefficients estimated - 1 (e.g., 2 predictors + intercept = 3 - 1 = 2).
Coefficient Interpretation
Constant (Intercept): Found in the table as unstandardized coefficients.
B Weights:
Indicate the change in Y for a one-unit change in X.
Example: Holding GPA constant, each GRE point results in a 0.002 years reduction in completion time; a full GPA grade point increase results in a 0.634 years reduction.
Standardized Coefficients (Beta Weights): Reflect changes in Y in standard deviation units, allowing comparison of predictor effects.
GRE: A standard deviation increase predicts a 0.201 standard deviation reduction in years to completion.
GPA: A standard deviation increase predicts a 0.393 standard deviation reduction in years to completion.
Statistical Significance of Coefficients
T Statistics: Evaluate significance of B parameters (unstandardized coefficients).
GRE scores significance: P = 0.233 (not significant).
Undergraduate GPA significance: P = 0.023 (significant).
Practical Importance of Effects: Consider the size of coefficients in relation to real-world applications beyond p-values; significant testing alone does not confirm practical importance.
Assumptions and Next Steps
Importance of Assumptions: Results interpretation depends on meeting regression assumptions, types of regression used, and the overall purpose of the analysis.
Critical Evaluation: Emphasize understanding beyond statistical significance for practical application and insights.