Follow-up Comparisons and ANOVA

A significant F test (assuming more than 2 levels to an IV) indicates statistically significant differences among groups, but doesn't specify which groups differ.
Follow-up comparisons are necessary to examine the differences between the means of the groups of interest.
The reason for using the omnibus test before multiple t-tests is to control for family-wise Type I error.

The alpha rate for significance tests in psychology is conventionally set at 0.05, meaning there's a 5% chance of making a Type I error for each test.
When conducting multiple tests on the same data, the risk of committing a Type I error increases.

If comparing three diet groups (Fruit, Veggie, Donut) with three t-tests (Fruit vs. Veggie, Fruit vs. Donut, Veggie vs. Donut) without correction, the Type I error rate would approximate to 0.15.
This elevated error rate increases the likelihood of incorrectly identifying an effect as real.

It's crucial to account for family-wise error (FWE) when performing multiple comparisons.
Two main approaches exist to address this issue:
- Post Hoc Procedures
- Planned Comparisons

Post Hoc procedures are appropriate when there is no a priori theoretical basis for expecting specific group differences.
- For example, one might expect that diet affects happiness without knowing how.
The atheoretical nature of post hoc tests is acceptable and can be advantageous.
However, post hoc tests tend to be more conservative than planned comparisons, because they are not guided by theory.

Various post hoc procedures are available, generally falling into two categories:
- Adjusting the Type I error rate to accommodate multiple comparisons.
- Calculating a new, more conservative test statistic.
Both approaches are more conservative compared to planned comparisons.

The Bonferroni correction adjusts the alpha value for multiple comparisons.
Formula: $α<em>B = {α</em>{FWE} \over c}$
- $α_B$ is the corrected alpha level.
- $α_{FWE}$ is the desired family-wise error rate (usually 0.05).
- $c$ is the number of comparisons.

Example:
- Fruit vs. Veggie: t = -0.658, p = 0.514
- Fruit vs. Donut: t = -2.840, p = 0.007
- Veggie vs. Donut: t = -2.386, p = 0.022
With the Bonferroni correction (αB = 0.016), only the Fruit vs. Donut comparison is significant (p = 0.007 < 0.016).

Tukey's HSD test calculates a new test statistic representing the minimum mean difference required for statistical significance.
Assumes all means are being compared.
SPSS can calculate this.

Dependent Variable: Happy

(I) Group	(J) Group	Mean Difference (I-J)	Std. Error	Sig.
Tukey HSD
Fruit	Veggie	-.20000	.33103	.818
	Donut	-1.00000	.33103	.010
Veggie	Fruit	.20000	.33103	.818
	Donut	-.80000	.33103	.049
Donut	Fruit	1.00000	.33103	.010
	Veggie	.80000	.33103	.049
Bonferroni
Fruit	Veggie	-.20000	.33103	1.000
	Donut	-1.00000	.33103	.011
Veggie	Fruit	.20000	.33103	1.000
	Donut	-.80000	.33103	.057
Donut	Fruit	1.00000	.33103	.011
	Veggie	.80000	.33103	.057

Post Hoc tests are conservative, making it more difficult to achieve statistical significance.
- It involves a trade-off between Type I and Type II error.
The level of conservativeness varies among different post hoc tests.
Post Hoc tests are best suited for situations with a significant omnibus F-test but no specific predicted differences.
Their conservative nature is appropriate in such cases. In other words, post hoc tests are perfect when you use them for what they were designed:
- Significant omnibus F, but no specific differences predicted.
- Appropriate to be conservative.