Repeated-Measures ANOVA Notes
Repeated-Measures ANOVA (Within-Subjects ANOVA)
Learning Objectives
Describe repeated-measures ANOVA (within-subjects ANOVA).
Understand the rationale of repeated-measures ANOVA, including its benefits and potential issues.
Conduct and interpret ANOVA.
Report ANOVA results.
Experimental Designs
Between-subjects design: Two or more independent groups of subjects are tested (e.g., males vs. females, depressed vs. controls, young vs. old).
Repeated-measures design (aka within-subjects design): The same subjects are tested under all experimental conditions.
Example: One group of muppets receives treatment 1, then treatment 2, and then treatment 3.
Example of Between-Subjects Experiment
Research question: Does weather affect math problem solving?
27 undergrads assigned to 1 of 3 conditions (groups):
Raining outside
Snowing outside
Sunny outside
Subjects took 30 minutes to solve problems.
H0 (Null Hypothesis): No significant difference in the number of problems solved between the 3 conditions.
H1 (Alternative Hypothesis): There will be at least two means that differ significantly.
Variability Between Subjects
In between-subjects designs, random sampling aims to cancel out individual characteristics across groups.
Repeated-Measures ANOVA: Rationale
Individual differences should matter less because the same people are tested in all conditions.
Repeated ANOVA: Sources of Variance
Variance created by the experimental manipulation.
Variance created by individual differences is 'cancelled out' by repeatedly testing the same participants.
Benefits of Repeated Measures Designs
Increased sensitivity: Error variance is reduced by 'cancelling out' individual differences.
Economy: Fewer participants are needed.
Counteracting Fatigue/Learning: Randomization and counterbalancing are used to mitigate fatigue or learning effects.
Repeated Measure ANOVA Example
Research question: Effect of weather on problem-solving.
IV (Independent Variable): Weather conditions
DV (Dependent Variable): Number of problems solved
Design: One-way repeated measures ANOVA
H0: No significant difference in number of problems solved in 3 conditions.
H1: At least one pair of conditions will differ.
Partitioning the Variance
Analogy: It's all about the cake!
Partitioning Variability for Between-Subjects ANOVA
Total variability
Between-group variability
Within-group (error) variability
Partitioning Variability for Repeated-Measures ANOVA
Mean differences reflect:
Experimental manipulation (weather)
Error
Error here is variability within each person, not between people.
Subject 1’s performance change between conditions is partly due to treatment and partly due to 'natural' variability.
The goal is to 'cancel out' this 'natural' variability.
Within-Subjects Variability (SS_{within-subjects})
Calculate variability of each subject around their own mean and sum them all up.
SS{within-subjects} = SS{treatment} + SS_{residual (error)}
Determining if there is more treatment variability or 'error' variability in SS_{within-subjects}.
Treatment (Model) Variability (SS_{treatment})
Variability of treatment means from the grand mean.
The Repeated-Measures ANOVA Ratio
SS{model} and SS{residual} are adjusted by df (degrees of freedom) to produce MS{model} and MS{residual}.
The df error for repeated measures is (nr. treatments – 1)*(nr. participants – 1).
Calculate the F-ratio as usual.
Comparing to Between-Subjects ANOVA
Unsystematic (residual or error) variability = individual differences + differences in how someone behaves at different times.
Individual differences are a factor in between-subjects ANOVA but not in repeated-measures ANOVA.
Residual variability 'should' be lower in repeated-measures ANOVA.
SS_{model} is the same (variation of the three treatment means around grand mean).
The error (or residual) variance is different (smaller for the repeated-measures).
Reducing the denominator of the F-ratio increases the F-ratio.
Assumptions in Repeated-Measures ANOVA
Sphericity assumption: "The variance of the difference across pairs of conditions (treatments) should be almost the same."
Assessed using Mauchly’s test.
p < .05: Sphericity is violated.
p > .05: Sphericity is not violated.
What is Sphericity?
Is the variance in the difference columns too different?
Mauchly’s test determines if the difference is too different.
At least 3 columns (i.e., conditions) are needed to compare the variance of these differences.
Mauchly’s Test
"Mauchly's Test of Sphericity indicated that the assumption of sphericity has not been violated, \chi^2(2) = 3.35, p = .19."
Mauchly’s test has a \chi^2 distribution.
Corrections for Violation of Sphericity Assumption
Greenhouse-Geisser
Huynh-Fieldt
Estimate
These corrections rely on adjusting the degrees of freedom.
Repeated Measures ANOVA: Main Output Table
Greenhouse-Geisser is a correction used when the sphericity assumption is not met (Mauchly’s test p < .05).
This correction 'corrects' the degrees of freedom.
Interpreting Results
If F-value is significant, at least one pair of means should be different.
Posthocs for Repeated-Measures ANOVA
Some of the familiar post-hoc tests are not valid with repeated-measures ANOVA.
Posthoc Tests in Repeated Measures ANOVA
Compare all pairs of means.
Some posthocs are more conservative than others.
Less 'powerful'
Less 'sensitive'
More likely to result in a 'false-negative' error.
The Bonferroni test is more conservative than the Holm test.
Many posthocs are based on a t-test but with the p-value adjusted to penalize the number of tests carried out.
Repeated Measures Design ANOVA with Two IVs
Example: Test participants on both easy and difficult problems under three weather conditions.
IV1: weather
IV2: problem difficulty
DV: number of problems solved
*Note that participants contribute scores to each experimental condition.
Repeated Measures ANOVA: Another Example
Each column represents a combination of weather condition and task’s difficulty.
Table: Tests of repeated-measures Effects
Factor names are tested
Test of Sphericity: Cannot perform sphericity tests because there are only two levels of the RM factor, or because the SSP matrix is singular.
Interaction Term
What is this “Weather * Problem Difficulty ” term and why is it significant?
An interaction term indicates that the effect of weather significantly depends on how difficult the problems were.
Readings
Corresponding book chapters/pages
Timetable Week | Lecture Topic | Tutorial Topic |
|---|---|---|
26 (13 Jan) | 1.Introduction & Revision of PS2103 stats | |
2. One-way between-subjects | One way between-subjects ANOVA | |
27 (20 Jan) | 1. One-way repeated measures | ANOVA (incl. revision) |
2. Variations on the 1-way ANOVA: | ANCOVA/Repeated-Measures ANOVA | |
28 (27 Jan) | 1. Factorial ANOVA | |
2. Individual Differences in | Experimental Research | |
29 (3 Feb) | 1. Mixed ANOVA, some ANOVA | Mixed designs |
extensions | ||
2. Non-parametric alternatives | ||
30 (10 Feb) | Results write-up exercise | |
31 (17 Feb) | Reading Week |