Wk 6: Multi-Factor Between-Participant Designs - Regression
Correlation, Z-Scores, and Variance Explained
Correlation and variance explained are two variables that are fundamentally linked; stronger correlations imply that more variance is explained in the criterion.
This connection allows for the estimation of statistics in regression without solving for regression coefficients, predicting data, or manually summing squared residuals. Instead, sets of correlation coefficients can be used to calculate .
Correlation is a standardized measure of the degree of association between two variables, which can be calculated in two ways:
Calculating the covariance first and then standardizing.
Standardizing the variables into Z-scores first and then calculating the covariance.
Standardization before calculating covariance is often easier. If systematic variance exceeds unsystematic variance, the vector will correlate with the criterion.
This means positive Z-scores on the vector will correspond with positive Z-scores on the criterion, and negative Z-scores will correspond similarly.
The average product of Z-scores results in the correlation coefficient ().
Multi-Factor Between-Participant Designs
A multi-factor design involves the manipulation of more than one independent variable (IV) or factor.
e.g. medication x2, and therapy type x2
It is termed a factorial design if all possible conditions created by crossing the factors are included in the study.
In a factorial design, researchers examine two types of effects:
Main Effect: The effect of one factor considered separately from the effect of the other factor.
Interaction: The effect of one factor in combination with another factor. This occurs when the effect of Factor A differs across different levels of Factor B (and vice versa).
Example of a 2-factor design:
Main effect of Factor A (averaged across levels of Factor B).
Main effect of Factor B (averaged across levels of Factor A).
An interaction effect ().
Factorial Designs in Regression and Vector Coding
Estimating main effects and interactions in regression follows the same process as one-way designs: coding vectors to capture between-groups variability.
Vector Counts:
In one-way designs, there is one set of vectors containing one less vector than the number of conditions.
In factorial designs, sets of vectors are created for each main effect and each interaction.
In a design, there is 1 vector for the first factor, 1 vector for the second factor, and 1 vector for the interaction.
Coding Schemes:
Dummy-coding: Participants in one condition are coded as and others as .
Contrast-coding: Participants in one condition are coded as and others as .
3-Level Factors: These require two vectors ( and ) to capture the variability among three groups.
Coding the Interaction:
Interaction vectors are created by multiplying the vectors coding for the main effects.
For a design with (Factor A) and (Factor D), the interaction vector () is calculated as .
For a design with Factor A (, ) and Factor D (), there are two interaction vectors: and .
Regression Equations for Factorial Designs
The linear regression equation adds terms for every coded vector.
Design Equation:
is the slope for Factor A; is the slope for Factor D; is the slope for the interaction.
Design Equation:
and represent the slopes for the two vectors of Factor A; represents Factor D; and represent the interaction vectors.
Interpretation of Intercept () and Slopes ():
Dummy-coding: is the mean of the category coded as across all vectors (a single cell). for a main effect vector is the difference between the relevant cell in the same row/column as and the value of .


Contrast-coding:
is the grand mean.
= difference between mean of category coded 1 on that vector and grand mean

green: A2 marginal mean vs grand mean
blue: A3 marginal mean vs grand mean
Vector Sets and Calculating
To derive values for each effect (Main Effect A, Main Effect D, Interaction AXD), separate values must be calculated: , , and .
variance explained by the whole model
variance explained by factor A
variance explained by factor D
variance explained by interaction

If a set contains only a single vector, is simply the square of the correlation between that vector and the criterion ().
If a set contains more than one vector (e.g., Factor A in a design), the inter-correlation between those vectors must be controlled using the formula:
Where and are correlations with the criterion and is the correlation between the vectors in the set.
Inference and Significance Testing
Residual Variance: .
Degrees of Freedom ():
(number of vectors for Factor A).
(number of vectors for Factor D).
(product of the degrees of freedom of the main factors).
.
Mean Squares ():
F-Ratios:
.
If F_{obs} > F_{crit}, reject the null hypothesis ().

Example 1: Factorial Design
Design: 2 (Animal Type: Approach/Avoid) 2 (Mood: Positive/Negative) Between-Participants.
Animal Groups:
Approach: dog, cat, rabbit, beaver, quokka.
Avoid: lion, tiger, bear, rhino, bison.
Dependent Variable (DV): Number of animals remembered from a list.
Data Recap:
Grand mean () = .
(Approach mean - Grand mean) = .
(Positive mean - Grand mean) = .
Step-by-Step Results:
Correlation () with criterion: , , .
, , .
.
Degrees of freedom: , , , .
Mean Squares: , , , .
F-ratios:
Animal Type: (p < 0.001).
Mood: (p < 0.001).
Interaction: (p < 0.001).
Interpretation: Significant main effects (greater memory for avoid animals and negative moods) and a significant interaction.
Example 2: Factorial Design
Design: 3 (Animal Type: Approach/Avoid/Neutral) 2 (Mood: Positive/Negative).
Contrast Coding Scheme:
V1 (Avoid vs. Others): Avoid = , Approach = , Neutral = .
V2 (Neutral vs. Others): Neutral = , Approach = , Avoid = .
V3 (Mood): Positive = , Negative = .
Calculations:
Grand Mean () = .
(Avoid Mean - Grand Mean) = .
(Neutral Mean - Grand Mean) = .
(Positive Mean - Grand Mean) = .
Correlations and :
Animal Type Set (): , , . .
Mood Set (): . .
Interaction Set (): , , . .
F-Statistics:
, , .
Animal Type: (p < 0.001).
Mood: (p < 0.001).
Interaction: (p < 0.001).
Effect Sizes
While Significance testing (-values) helps decide whether to reject , effect sizes describe the magnitude.
: The proportion of total criterion variance explained by an effect. However, it does not account for the overlap or presence of other effects.
Partial : The proportion of residual criterion variance explained by an effect, controlling for other effects.
Formula: .
Cut-offs for and Partial :
Small: >0.01 to .
Medium: >0.06 to .
Large: >0.14.
Comparison Example ():
Animal Type: ; Partial .
Mood: ; Partial .
Interaction: ; Partial .