Analysis of Variance Lecture Notes Review
EFT Quiz Feedback
- The feedback is only available when answers are incorrect due to Canvas limitations.
- The quiz is worth 1% of the final grade.
- If students encounter issues or want to retake the test, they can request additional attempts.
- The questions are designed to assess understanding of simple concepts; struggling indicates a need to review notes.
- The instructor is willing to provide full marks or allow retakes, emphasizing that it's low stakes.
- The goal is to identify areas for review rather than stress over the grade.
Introduction to Analysis of Variance (ANOVA)
- ANOVA is a powerful statistical analysis that can incorporate many predictors and factors.
- The course will progress from one-way ANOVA to two-way ANOVA, eventually covering multifactorial ANOVA.
- While large models with many parameters (like large language models) can be built, they may sacrifice understanding for predictive power.
One-Way ANOVA: Comparing Multiple Groups
- One-way ANOVA will be approached from a t-test perspective initially.
- Example: Comparing weight gain in chicks fed four different diets.
- Experimental Design:
- 20 chicks are randomly assigned to four diet groups.
- Weight gain is measured after one week.
- The goal is to determine which diet is most effective.
The Problem with Multiple T-Tests
- To compare all diets, multiple t-tests would be needed, comparing all possible pairs.
- However, performing multiple t-tests on the same dataset increases the probability of finding a significant difference by chance.
- With six t-tests and a 95% confidence interval (5% significance level), the probability of detecting a false positive increases to 26%.
- This is the fallacy of multiple t-tests: repeated probability calculations inflate the likelihood of error.
- The goal is to maintain a 95% confidence interval across the entire analysis.
- Solution: Use ANOVA, which compares all groups at once.
How ANOVA Works
- ANOVA computes two types of differences:
- Between-group differences (between-effects).
- Within-group differences.
- Between-group differences are assessed by comparing the mean values of the groups.
- Within-group differences are assessed by computing the variance within each group.
Null and Alternative Hypotheses in ANOVA
- ANOVA determines whether the data can be better explained by an overall mean or by separate group means.
- Null Hypothesis: The overall mean is sufficient to explain the data.
- Alternative Hypothesis: Group means provide a better explanation of the data.
ANOVA Terminology
- Treatment/Factors: Categorical variables used to predict responses (e.g., diet types).
- Levels: The different categories within a treatment (e.g., four different diets).
- Samples: Replicates within each group (e.g., the number of chicks per diet).
- Observations: The measured values (e.g., weight gain of individual chicks).
- Replicates: Independent repetitions (e.g., each chick is a replicate).
Model Equation and Assumptions
- Model equation: Observations are defined by the means and error terms of each group.
- Model Assumptions:
- Normality: Samples within each group are normally distributed.
- Equal Variances: Each group has equal variance.
Assessing Normality
- Box plots can be used to visually assess normality.
- ANOVA is robust against violations of normality.
- Histograms are problematic for assessing normality in ANOVA unless separate histograms are plotted per group.
- Shapiro-Wilk tests can be used, but be cautious of their reliability, especially with complex designs.
- Residuals should be examined for proper normality testing (discussed later).
Assessing Equal Variances
- Calculate the ratio of the largest standard deviation to the smallest standard deviation; if less than two, the assumption is met.
- Bartlett's test can also be used, but it is unreliable if data is not normal.
- Residual Plots will eventually be used.
Hypothesis Testing in ANOVA
- Null Hypothesis: All group means are equal (overall mean represents the data).
- Alternative Hypothesis: At least two group means are different (not all are equal).
- ANOVA can only indicate that at least two groups differ; post-hoc tests are needed to determine which groups specifically differ.
Variability in ANOVA
- ANOVA analyzes variances to determine if differences are due to treatments or random factors.
- Mathematically, variances are partitioned:
- Treatment Sums of Squares (SST).
- Residual Sums of Squares (SSE).
- Total Sums of Squares (SSTO).
- Equation: SSTO = SST + SSE
- Degrees of Freedom are critical for understanding the ANOVA table.
The ANOVA Table
- The ANOVA table summarizes the experiment, including treatment, residual, total, and associated numbers.
- The goal is to determine if differences are due to treatment or random effects by examining the ratio of treatment effect to random effect.
- Differences are represented as Mean Squares (MS).
- The F-statistic (F stat) is the ratio of treatment mean squares to residual mean squares: F = \frac{MST}{MSE}
- The F-statistic is related to the t-test statistic.
- Total Sum of Squares (SSTO): Measures total variation.
- SSTO = \sum (xi - \bar{x})^2 where xi are data points and \bar{x} is the overall mean.
- Treatment Sum of Squares (SST): Measures variation between treatments.
- Residual Sum of Squares (SSE): Measures random variation within treatments.
Understanding the F-Statistic (F stat)
- The F-statistic is a ratio that explains the entire analysis.
- It indicates how much greater the variance attributed to treatment is compared to the residual variance.
- Example: An F-statistic of 6.65 means the treatment variation is 6.65 times higher than the residual variation.
- The F-statistic is used to calculate a p-value, which determines if the differences are statistically significant.
- This significant dictates a difference in at least two levels of a treatment.
Interpreting Variability
- Sums of squares indicate variability.
- Mean squares standardize the data to the number of samples, reflecting the accuracy of the measure.
Post-Hoc Tests
- If ANOVA shows a significant difference, post-hoc tests are used to determine where the differences lie.
- 95% confidence intervals are computed for each group. Overlapping confidence intervals indicate no significant difference.
- Estimated marginal means are used to calculate confidence levels.
- Visual techniques, such as plotting confidence intervals, can aid in comparison.
- Example Interpretation: If the confidence interval for diet 4 does not overlap with those of diets 1, 2, and 3, diet 4 is significantly different.
Review
- ANOVA is a method for determining if differences exist, while post-hoc testing drills into the specifics.
- Concepts will be repeated and expanded upon in future lectures on two-way and multifactorial ANOVA.
Key Takeaways
- ANOVA moves beyond t-tests to compare multiple groups simultaneously.
- Focus shifts towards understanding and communicating results rather than manual calculations.