Lecture Notes
Pairwise Comparisons & Interaction Plots
- Revisiting the interaction with witchweed and fertilizer.
- Use emmeans to produce interaction plots.
- Differences can be subtle but significant; observe changes in slopes.
- Plots guide recommendations for fertilizer or weed control.
Tukey's Tests
- Focus on the two-way interaction.
- P-values are adjusted (family-wise error rate).
- Error rates add up quickly with multiple combinations.
- Effect sizes can still be substantial.
Factorial Designs: Power & Efficiency
- Address multiple research questions at once.
- Reduces Type I errors.
- Better understanding of mechanisms via interaction analysis.
- More representative of real-world scenarios.
Key Concepts
- Main effect vs. interaction.
- Reading ANOVA tables (start from the bottom).
- If interaction is significant, focus on it.
- Avoid simply dumping significant results from the ANOVA table.
Model Assumptions for ANOVA
- Easier to check with residuals, especially in complex models.
- Graphical approaches: residuals vs. fitted, QQ plots.
- Use residuals from ANOVA model for plots.
- Testing assumptions with raw data is penalized; use residuals.
Workflow & Confidence Intervals
- Extra notes include confidence intervals on emmeans plots.
- Confidence intervals can reveal where differences lie.
- Example: Levels of nitrogen, varieties of rice.
- Overlapping confidence intervals indicate non-significance.
Assumptions of Maize Example
- Fitted vs. residual plots test equal variance.
- QQ plots assess normality.
- Real-world data can be messy; normality may be questionable.
- Transformations (e.g., log) might worsen other assumptions.
ANOVA Robustness
- ANOVA robust against departures from normality, especially with balanced designs (equal replication).
- Less robust against unequal variances.
- If transformation fails, cite literature (e.g., Tony Underwood) to justify proceeding with caution.
- Acknowledge increased risk of Type I errors and interpret results cautiously.
Transformations
- Transform with purpose.
- Log transformation for count data.
- Square root for harsher transformation.
- Arc-sine for proportion data (percentages).
- Avoid excessive transformations that make data unrecognizable.
- If transformations fail, acknowledge and proceed with caution.
Alternative Tests
- Kruskal-Wallis test for simple one-way ANOVA designs only.
Project introduction
Project 2 Details
- Focus: Real-world data analysis from a provided dataset related to a published paper.
- Data Pre-processing: Some pre-processing done, dataset on Canvas (not from the paper directly).
- Task: Choose a dataset (or derive your own from the provided sheets). Formulate scientific questions answerable with the data.
- Deliverable: A full scientific paper, including analysis pipeline.
- Timeline: Start early; this is a significant task.
Data and Question Generation
- Read the original paper thoroughly.
- Identify papers that cite the original paper for inspiration.
- Questions: Can be agricultural, ecological, or a combination. Follow your interests.
- Analysis Design: Determine experimental design, treatment design, and model equation.
- Analysis Techniques: Primarily ANOVAs and multiple regressions.
- No t-tests (focus on techniques learned in the unit).
- Always analyze residuals, not raw data.
- Scientific Paper Guide
- Provided on canvas.
- Use notes from first year biology.
Writing a Scientific Paper
- Structure: Follow a set formula.
- Key Components:
- Abstract: (\approx 250 words, overview of the whole paper).
- Introduction: Context, background, and justification. What's the question?
- Methods: Experimental design, treatments, and analysis.
- Results: Present data, not interpret it.
- Discussion: Interpret your results and relate them back to the question and other studies.
- References: Cite appropriately.
Data Set & Template
- Metadata Sheet: Included in the Excel file, explains the columns (a 'read me' for the data).
- Data papers - publish the data set for other to use.
- Templates: Provided in Quattro or R Markdown (use R Markdown if Quattro rendering issues persist). Word document is also an option.
- Embed resources: Important to set as TRUE to see renders when using Quattro.
AI Use
- AI tools can be used in coding and plots (improving). AVOID using it to do interpretations, do reports for you or doing analysis for you.
- Acknowledge if used. Include in referencing section.
- State prompts and how verified.
References & Appendices
- References: Follow journal style guide; be consistent.
Primary literature (journal articles, textbooks, reputable web pages. - Appendix: Supplementary information (optional).
Example: Plots about testing the ANOVA model assumptions.