Christoph: Experimental Design

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/144

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

145 Terms

1
New cards

What are the 5 components of an experiment?

  • Question

  • Design

  • Execute

  • Statistical Analysis

  • Interpret Results

2
New cards

Can you fix a bad design with good data analysis?

No, poor experimental design limits your ability to draw valid conclusions.

3
New cards

What is a mensurative experiment?

An observational study with no treatment or manipulation; only measurements are taken.

4
New cards

Example of a simple mensurative experiment?

Measuring leaf decomposition by weighing leaf bags before and after submersion.

5
New cards

What is a manipulative experiment?

An experiment that includes a treatment and a control to assess effect.

6
New cards

What is pseudoreplication?

Using non-independent samples as if they are replicates, inflating statistical power.

7
New cards

Example of spatial pseudoreplication?

Measuring multiple spots in the same pool and treating them as independent samples.

8
New cards

Example of temporal pseudoreplication?

Measuring the same pool every week and treating each measurement as independent.

9
New cards

What is sacrificial pseudoreplication?

When true replicates exist, but internal measurements are treated as replicates instead of averaging.

10
New cards

Key principles of manipulative experimental design?

  • Randomization

  • Replication

  • Interspersion of treatments

  • Independence of treatments

11
New cards

What is interspersion?

Spreading out treatments to avoid confounding spatial patterns.

12
New cards

What does Hurlbert (1984) say about pseudoreplication?

Half of ecological papers had some form of pseudoreplication.

13
New cards

What was wrong with the skink/gecko predator study?

Only one replicate per treatment and confounded with agricultural land use.

14
New cards

What was the main issue in the stream invertebrate restoration study?

Lack of replication – only one restored and one natural reach.

15
New cards

Why was the mayfly drift experiment limited?

Unrealistic conditions (no algae) and not enough mayflies used.

16
New cards

What is a repeated-measures design?

Same subjects measured multiple times (e.g. before and after treatment).

17
New cards

What is a nested design?

Hierarchical sampling (e.g., sites → riffles → stone groups → stones).

18
New cards

What characterizes a split-plot design?

Whole plot and subplot factors with randomization at different levels; complex ANOVA required.

19
New cards

What is a BACIP design?

Before-After-Control-Impact design with multiple time points after the impact.

20
New cards

What is MBACI(P)?

Multiple BACI sites, analyzed using repeated and nested ANOVA.

21
New cards

When is a gradient design used?

When treatments are continuous (e.g., farming intensity from 0–100%).

22
New cards

How are gradient designs analyzed?

Using regression or generalized linear models with model selection.

23
New cards

How do you avoid a poor experimental design?

  • Diagram the layout

  • Use true replication

  • Randomize treatments

  • Describe stats analysis

  • Discuss limitations if replication is impossible

24
New cards

What is a sample in statistics?

A collection of randomly selected, independent observations from a defined statistical population.

25
New cards

What is a statistical population?

The full set of possible observations of interest.

26
New cards

Why must samples be randomly selected?

To avoid sampling bias and ensure validity.

27
New cards

Why must samples be independent?

So that each observation gives unique, unbiased information about the population.

28
New cards

Name 6 common sample summary statistics.

  • Mean

  • Variance

  • Standard Deviation

  • Standard Error

  • Confidence Limits (Intervals)

  • Median (plus quartiles)

29
New cards

What is the formula for variance?

(∑(yi​−yˉ​)2)/(n−1)

30
New cards

What is standard deviation?

The square root of variance; shows spread of values around the mean.

31
New cards

What is standard error?

SE = SD / √n
It measures how well the sample mean estimates the population mean.

32
New cards

What do 95% confidence intervals tell you?

The range where the true population mean lies with 95% probability.

33
New cards

What assumptions do mean, SD, SE, and CI rely on?

That data are normally distributed.

34
New cards

What does the Central Limit Theorem state?

That with large samples, sample means are normally distributed even if the population is not.

35
New cards

What if ecological data aren’t normally distributed?

Use median and quartiles instead of mean and SD.

36
New cards

Are non-overlapping SDs equivalent to statistical tests?

No; even 95% CIs are only equivalent in very simple designs (like 1-factor ANOVA).

37
New cards

What is statistical robustness?

The ability of a test to yield valid results even when assumptions are slightly violated.

38
New cards

Is robustness a good thing for ecologists?

Yes, but assumptions still need to be checked.

39
New cards

What is exploratory data analysis?

Preliminary checks done before formal statistical analysis.

40
New cards

Why do exploratory data analysis?

  • Check data quality

  • Detect errors

  • Ensure assumptions are met

  • Identify outliers

41
New cards

What tools are used in exploratory data analysis?

  • Histograms (for normality)

  • Boxplots (for variance, outliers, normality)

42
New cards

Name 4 key parametric assumptions.

  • Normality

  • Equal variances

  • No outliers

  • Linearity (in regression/ANOVA)

43
New cards

What transformations help normalize data?

Log, arcsin(√x), 4th root.

44
New cards

When should you decide to transform data?

After EDA but before running formal tests.

45
New cards

Why can transformation be controversial?

It changes the scale and the null hypothesis.

46
New cards

How do you test for equal variances?

With Levene’s test (but be cautious – it also reacts to non-normality).

47
New cards

Why is equal variance more important than normality?

Tests are more robust to non-normality than unequal variances, especially when sample sizes differ.

48
New cards

What to do if variances increase with the mean?

Try log or 4th root transformations.

49
New cards

How to deal with outliers ethically?

  • Double-check for errors

  • Use a priori criteria

  • Compare analysis with and without them

  • Avoid removing them just to get significance

50
New cards

Parametric tests are based on...?

Measured values and strong assumptions (normality, equal variance, etc.)

51
New cards

Non-parametric tests are based on...?

Ranks of values; fewer assumptions.

52
New cards

Pros and cons of parametric tests?

Pros: More powerful, more test options
Cons: More assumptions

53
New cards

Pros and cons of non-parametric tests?

Pros: Fewer assumptions, more robust
Cons: Less powerful, fewer tests, lose detail

54
New cards

What are GLMs good for?

Handling non-normal data using specific link functions (e.g. Poisson, binomial).

55
New cards

Are GLMs a cure-all?

No — ecological data can be too irregular, still may need transformations.

56
New cards

Are zeros missing values?

No! Zeros are valid observations.

57
New cards

Why are missing values problematic?

They cause unequal sample sizes, reducing test robustness.

58
New cards

What are (bad) ways people deal with missing data?

  • Deleting other observations to balance

  • Substituting values (not recommended)

59
New cards

What is the null hypothesis (H₀)?

Usually that there is no difference between population variables (e.g., means).

60
New cards

What is the alternative hypothesis (Hₐ)?

Must be true if H₀ is rejected; not formally tested but assumed true by default.

61
New cards

What are the 6 steps of hypothesis testing?

  1. Specify H₀, Hₐ, and test statistic

  2. Choose significance level (usually α = 0.05)

  3. Collect data and calculate test statistic

  4. Compare test stat to its null distribution

  5. If p < α → reject H₀ (significant)

  6. If p ≥ α → fail to reject H₀ (non-significant)

62
New cards

What are degrees of freedom (df)?

Number of values that are free to vary; df = n - 1

63
New cards

What is a Type I error?

False positive – rejecting H₀ when it's actually true.

64
New cards

What is a Type II error?

False negative – failing to detect an effect that actually exists.

65
New cards

What is β (beta)?

The probability of making a Type II error.

66
New cards

What is statistical power?

Probability of detecting a true effect (Power = 1 - β); should be > 0.80.

67
New cards

Name 5 factors that influence statistical power

  1. Effect size

  2. Sample size

  3. Variance

  4. Significance level (α)

  5. Statistical test used

68
New cards

What is effect size?

The magnitude of the difference or effect you're trying to detect.

69
New cards

Why is power analysis done a priori?

To determine the sample size needed to detect a meaningful effect before data collection.

70
New cards

Why is post hoc power analysis done?

To explain why non-significant results occurred, based on known sample size and variance.

71
New cards

What does traditional hypothesis testing focus on?

Testing H₀ using p-values; Hₐ supported by default if p is small.

72
New cards

What do information-theoretic (AIC) or Bayesian approaches focus on?

Comparing multiple models (hypotheses) and estimating strength of evidence for each.

73
New cards

When is correlation used?

To test relationships between two continuous variables (no predictor/response distinction).

74
New cards

What is the Pearson correlation coefficient (rₚ)?

Measures strength/direction of a linear relationship (parametric).

75
New cards

What are some possible values for rₚ?

  • +1: perfect positive

  • 0: no relationship

  • -1: perfect negative

76
New cards

What is the non-parametric equivalent of Pearson correlation?

Spearman rank correlation (rₛ)

77
New cards

When is simple linear regression used?

When one variable predicts another; defines a line using the least squares method.

78
New cards

What are residuals in regression?

The differences between observed and predicted y-values.

79
New cards

What are 3 aims of regression?

  1. Test linear relationship

  2. Quantify variation in y explained by x

  3. Predict new y-values (less common in ecology)

80
New cards

What is R² and how is it interpreted?

Proportion of variation in y explained by x.

  • <0.10: trivial

  • 0.10: weak

  • 0.30: moderate

  • 0.50: strong

81
New cards

What assumptions does regression make?

  • Normality

  • Equal variances

  • No outliers

  • Independence

82
New cards

What is Cook’s Distance?

Measures influence of a point based on leverage and residuals.

83
New cards

When is multiple linear regression used?

When there are two or more continuous predictor variables.

84
New cards

What is Partial Eta Squared?

% of variation in y uniquely explained by a predictor, controlling for other predictors.

85
New cards

What is collinearity and why is it a problem?

When predictor variables are highly correlated — inflates variance, affects estimates.

86
New cards

How to test for collinearity?

  • Tolerance > 0.1

  • VIF < 10

87
New cards

How to fix collinearity?

  • Drop one predictor

  • Center variables (subtract their means)

88
New cards

What are interaction terms in regression?

Combine predictors to see if their joint effect differs from individual effects (e.g., X₁ * X₂).

89
New cards

When is polynomial regression used?

When the relationship between x and y is non-linear.

90
New cards

What does polynomial regression often involve?

Adding terms like X² or X³ to the model.

91
New cards

What is a Chi-Square test used for?

Testing frequencies of categorical variables against expected distributions.

92
New cards

What are contingency tables?

Tables showing frequencies for combinations of two or more categorical variables.

93
New cards

Assumptions of Chi-Square tests?

  • No more than 20% of expected frequencies < 5

  • Observations are independent

94
New cards

Why are contingency tables limited?

Can’t test interactions — better to use a GLM (e.g., Poisson regression) or ANOVA on percentages.

95
New cards

When is One-Way ANOVA used?

When testing the effect of 1 categorical predictor (2+ groups) on 1 continuous response variable.

96
New cards

What are the two goals of One-Way ANOVA?

  1. Assess how much variation is explained by group differences.

  2. Test whether group means are equal (H₀: all means equal).

97
New cards

What does the ANOVA table partition?

  • Total sum of squares (SStotal) = explained + residual variation

  • Between-group variation: df = number of groups - 1

  • Within-group (residual) variation: unexplained

98
New cards

What affects statistical power in One-Way ANOVA?

  • Increases with total sample size

  • Decreases as the number of groups increases (if sample size is fixed)

  • More replicates per group = more power

99
New cards

What are Post Hoc Tests (e.g., Tukey HSD) used for?

To determine which group means differ after a significant overall ANOVA.

100
New cards

What are the assumptions of One-Way ANOVA?

  • Normality

  • Homogeneity of variances (very important!)

  • No outliers

  • Independence