Post Hoc Pairwise Comparisons of Means

8.2 Post Hoc Pairwise Comparisons of Means

Introduction

  • This lecture focuses on post hoc pairwise comparisons, which are essential for understanding the results of a one-way between-subjects ANOVA when a significant result is obtained.
  • In the previous lecture (8.1), a significant ANOVA result was obtained and this lecture will address the next steps.

Pairwise vs. Setwise Comparisons

  • Pairwise Comparison: Comparing two means (e.g., imagery group vs. modeling group).
  • Setwise Comparison: Comparing a combination of means against another (e.g., average of imagery and modeling groups vs. average of all self-talk groups).
  • This lecture will focus on pairwise comparisons. Setwise comparisons are more advanced and will be covered later in the course.

The Problem of Multiple Comparisons

  • When conducting multiple pairwise comparisons, the alpha level is altered, which increases the Type I error rate (false alarms).
  • With more than two groups, it's more likely to incorrectly conclude there's a significant difference when there isn't one.
  • Example: With 4 groups, there are 6 possible pairwise comparisons. This increases the alpha level from 0.05 to 0.27, meaning a 27% chance of making a Type I error due to inflated false alarms.
  • With 5 means, there are 10 comparisons.
  • Making many comparisons increases the likelihood of finding a significant result by chance.
  • Statistical adjustments are needed to account for this inflated error rate.

Post Hoc Tests

  • ANOVA: A significant ANOVA result is required before moving on to pairwise comparisons or post hoc tests.
  • Post Hoc: Means "after the fact." These tests are conducted after a significant ANOVA finding.
  • Example: A significant ANOVA effect was found that the F value was significant (less than 1 in 1000 chance).

Null and Alternate Hypotheses

  • Null Hypothesis: All means are equal.
  • Alternate Hypothesis: At least one mean is different.
  • Descriptive statistics can provide insights into which groups might be different. (e.g., If the instructional self-talk group has the lowest score and the imagery group has the highest, differences may be between these two groups).
  • If no significant difference between two groups, it's unlikely find a significant difference between other comparisons.

Independent vs. Non-Independent Comparisons

  • If we had 4 groups, that would mean we'd have 6 comparisons.
  • The alpha rate could explode up to 26.5 \%, which is much higher than the 0.05 \%. That means it is more than 5 times higher than the 0.05 \%.
  • If comparisons are not independent (e.g., pre-test vs. post-test), the experiment-wise error rate increases further.
  • With 10 groups, the 0.05 comparison rate could increase to a 50% chance of rejecting the null hypothesis.
  • The experiment-wise error rate increases with many comparisons and increases even further if the comparisons are not independent.

Pairwise Comparison Methods

  • Student's t-test: Used only when comparing two means.
  • Student-Newman-Keuls (SNK) test / Tukey test: Needed when comparing more than two means to control for the inflated error rate.

Pairwise Comparison Tests

  • Student's t-test Used when 2 means.
  • SNK test (Student Newman Kuels) Use when there are 3 means (SNK3).
  • Tukey test Use when there are four means.