Repeated-Measures ANOVA Notes

Repeated-Measures ANOVA (Within-Subjects ANOVA)

Learning Objectives

Describe repeated-measures ANOVA (within-subjects ANOVA).
Understand the rationale of repeated-measures ANOVA, including its benefits and potential issues.
Conduct and interpret ANOVA.
Report ANOVA results.

Experimental Designs

Between-subjects design: Two or more independent groups of subjects are tested (e.g., males vs. females, depressed vs. controls, young vs. old).
Repeated-measures design (aka within-subjects design): The same subjects are tested under all experimental conditions.
- Example: One group of muppets receives treatment 1, then treatment 2, and then treatment 3.

Example of Between-Subjects Experiment

Research question: Does weather affect math problem solving?
27 undergrads assigned to 1 of 3 conditions (groups):
- Raining outside
- Snowing outside
- Sunny outside
Subjects took 30 minutes to solve problems.
H0 (Null Hypothesis): No significant difference in the number of problems solved between the 3 conditions.
H1 (Alternative Hypothesis): There will be at least two means that differ significantly.

Variability Between Subjects

In between-subjects designs, random sampling aims to cancel out individual characteristics across groups.

Repeated-Measures ANOVA: Rationale

Individual differences should matter less because the same people are tested in all conditions.

Repeated ANOVA: Sources of Variance

Variance created by the experimental manipulation.
Variance created by individual differences is 'cancelled out' by repeatedly testing the same participants.

Benefits of Repeated Measures Designs

Increased sensitivity: Error variance is reduced by 'cancelling out' individual differences.
Economy: Fewer participants are needed.
Counteracting Fatigue/Learning: Randomization and counterbalancing are used to mitigate fatigue or learning effects.

Repeated Measure ANOVA Example

Research question: Effect of weather on problem-solving.
IV (Independent Variable): Weather conditions
DV (Dependent Variable): Number of problems solved
Design: One-way repeated measures ANOVA
H0: No significant difference in number of problems solved in 3 conditions.
H1: At least one pair of conditions will differ.

Partitioning the Variance

Analogy: It's all about the cake!

Partitioning Variability for Between-Subjects ANOVA

Total variability
Between-group variability
Within-group (error) variability

Partitioning Variability for Repeated-Measures ANOVA

Mean differences reflect:
- Experimental manipulation (weather)
- Error
Error here is variability within each person, not between people.
Subject 1’s performance change between conditions is partly due to treatment and partly due to 'natural' variability.
The goal is to 'cancel out' this 'natural' variability.

Within-Subjects Variability (SS_{within-subjects})

Calculate variability of each subject around their own mean and sum them all up.
SS{within-subjects} = SS{treatment} + SS_{residual (error)}
Determining if there is more treatment variability or 'error' variability in SS_{within-subjects}.

Treatment (Model) Variability (SS_{treatment})

Variability of treatment means from the grand mean.

The Repeated-Measures ANOVA Ratio

SS{model} and SS{residual} are adjusted by df (degrees of freedom) to produce MS{model} and MS{residual}.
The df error for repeated measures is (nr. treatments – 1)*(nr. participants – 1).
Calculate the F-ratio as usual.

Comparing to Between-Subjects ANOVA

Unsystematic (residual or error) variability = individual differences + differences in how someone behaves at different times.
Individual differences are a factor in between-subjects ANOVA but not in repeated-measures ANOVA.
Residual variability 'should' be lower in repeated-measures ANOVA.
SS_{model} is the same (variation of the three treatment means around grand mean).
The error (or residual) variance is different (smaller for the repeated-measures).
Reducing the denominator of the F-ratio increases the F-ratio.

Assumptions in Repeated-Measures ANOVA

Sphericity assumption: "The variance of the difference across pairs of conditions (treatments) should be almost the same."
Assessed using Mauchly’s test.
- p < .05: Sphericity is violated.
- p > .05: Sphericity is not violated.

What is Sphericity?

Is the variance in the difference columns too different?
Mauchly’s test determines if the difference is too different.
At least 3 columns (i.e., conditions) are needed to compare the variance of these differences.

Mauchly’s Test

"Mauchly's Test of Sphericity indicated that the assumption of sphericity has not been violated, \chi^2(2) = 3.35, p = .19."
Mauchly’s test has a \chi^2 distribution.

Corrections for Violation of Sphericity Assumption

Greenhouse-Geisser
Huynh-Fieldt
Estimate
These corrections rely on adjusting the degrees of freedom.

Repeated Measures ANOVA: Main Output Table

Greenhouse-Geisser is a correction used when the sphericity assumption is not met (Mauchly’s test p < .05).
This correction 'corrects' the degrees of freedom.

Interpreting Results

If F-value is significant, at least one pair of means should be different.

Posthocs for Repeated-Measures ANOVA

Some of the familiar post-hoc tests are not valid with repeated-measures ANOVA.

Posthoc Tests in Repeated Measures ANOVA

Compare all pairs of means.
Some posthocs are more conservative than others.
- Less 'powerful'
- Less 'sensitive'
- More likely to result in a 'false-negative' error.
The Bonferroni test is more conservative than the Holm test.
Many posthocs are based on a t-test but with the p-value adjusted to penalize the number of tests carried out.

Repeated Measures Design ANOVA with Two IVs

Example: Test participants on both easy and difficult problems under three weather conditions.
IV1: weather
IV2: problem difficulty
DV: number of problems solved
*Note that participants contribute scores to each experimental condition.

Repeated Measures ANOVA: Another Example

Each column represents a combination of weather condition and task’s difficulty.

Table: Tests of repeated-measures Effects

Factor names are tested
Test of Sphericity: Cannot perform sphericity tests because there are only two levels of the RM factor, or because the SSP matrix is singular.

Interaction Term

What is this “Weather * Problem Difficulty ” term and why is it significant?
An interaction term indicates that the effect of weather significantly depends on how difficult the problems were.

Readings

Corresponding book chapters/pages

Timetable Week	Lecture Topic	Tutorial Topic
26 (13 Jan)	1.Introduction & Revision of PS2103 stats
	2. One-way between-subjects	One way between-subjects ANOVA
27 (20 Jan)	1. One-way repeated measures	ANOVA (incl. revision)
	2. Variations on the 1-way ANOVA:	ANCOVA/Repeated-Measures ANOVA
28 (27 Jan)	1. Factorial ANOVA
	2. Individual Differences in	Experimental Research
29 (3 Feb)	1. Mixed ANOVA, some ANOVA	Mixed designs
	extensions
	2. Non-parametric alternatives
30 (10 Feb)		Results write-up exercise
31 (17 Feb)	Reading Week