biostats - unit 3

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/39

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

40 Terms

1
New cards

problem multivariate analysis solves

in biology, many factors simultaneously affect response variable

2
New cards

factorial experimental designs

multiple variables are tested simultaneously and every possible combination of the factors is included

3
New cards

assessing parametric assumptions

use residuals because raw data contains too many groups

4
New cards

poison distribution

data are discrete (no fractions), counting rare (random) events

5
New cards

tweedie distribution

use for data with a lot of zeroes

6
New cards

interaction

effect of one explanatory variable depends on effect of another explanatory variable

7
New cards

two-way ANOVA

have two independent categorical predictor variables and one dependent continuous response variable

8
New cards

two-way ANOVA vs normal ANOVA

two-way: effects of two independent variables on a dependent variable, and also tests for any interaction between the two

normal ANOVA: compares means of three or more groups based on the independent variable (only one independent variable)

9
New cards

hypotheses for interactions

knowt flashcard image
10
New cards

what does an interaction imply

effect of 1 variable on the response depends on the value of the 2nd variable

11
New cards

why we don’t just conduct two t-tests or two regressions separately?

leads to multiple comparisons fallacy

12
New cards

multiple comparisons fallacy

when you run too many statistical tests and forget that each test has it’s own chance of a false positive —> start finding “significant” results by random chance, not cuz they’re real

  • essentially, it inflates probability of running into a false positive

13
New cards

why we can’t just include every variable

will lead to overfitting and collinearity

14
New cards

overfitting

model is too complex and memorizes noise from training data instead of learning the true signal

  • model will fit current data well but not new data

15
New cards

collinearity

two or more predictors in model are highly correlated, making it hard to estimate the true effect of each separately

  • like asking if the sun or the temperature causes ice cream to melt more quickly…

16
New cards

what does emmeans do

reports means for us to compare in a table format

17
New cards

what does Tukey-Kramer test do

helps determine which groups were significantly different from the others (absolutely need to use if we have more than 2 levels of any explanatory variables)

18
New cards

principles of experimental design

replication, randomization, reduction of variance, manipulation vs observation, types of variables, hypotheses

19
New cards

replication

does mean trait value of our sample represent the true mean of our population?

20
New cards

randomization

does every part of our population have equal chances of being in our sample?

does every part of our sample have equal chances of being assigned to any treatment group?

21
New cards

reduction of variance

reduce noise - how can we reduce variation that is not relevant to our research question in our samples?

how can we distribute naturally occurring variation across our treatments?

22
New cards

matched pairs design

helps reduce noise —> match each participant with closest partner —> randomly assign one partner and automatically assign the other to the other group

23
New cards

manipulation vs observation

are we creating the groups or are we making comparisons between naturally occurring groups?

24
New cards

drawback of experimental design

can add structure we don’t want to our data

25
New cards

structure we want to our data

  • experimental treatment groups

  • naturally-occurring variation we are interested in studying

26
New cards

structure we don’t want in our data

  • familial, temporal, or spatial similarity

    • individual-level effects (repeated measurements on same individual)

27
New cards

non-independence

examples = structure we don’t want

means we have committed pseudoreplication

28
New cards

pseudoreplication

researchers mistakenly treat non-independent observations as if they were independent replicates in statistical analysis

  • inflates degrees of freedom and sample size

29
New cards

pairing and random effects

accounts for structure we don’t want mathematically

30
New cards

paired t-tests

pair experimental units based on shared information to distribute natural variation across treatments

31
New cards

two-sample design vs paired design

knowt flashcard image
32
New cards

random effects

simplest way to control for unwanted structure in our data in all other tests

33
New cards

fixed effects

predefined, repeatable groups with measurable magnitudes

  • treatments chosen by researcher

    • interested in magnitude of differences between groups

34
New cards

random effects

occur when groups are randomly sampled from all possible groups in the population

  • treatments are a random sample from all possible treatments

    • interested in amount of variation that can be attributed to this effect: variance components

35
New cards

variance components

tells you how much variation is due to random groups, such as families, subjects, etc.

36
New cards

fixed effects model ANOVA

focuses on which groups are different

37
New cards

random effect model ANOVA

focuses on how much does grouping adding variation?

38
New cards

paired t-test assumption check

normality of differences between the two groups

differences ← (blackbirdData$logAfterImplant - blackbirdData$logBeforeImplant)

shapiro.test(differences)

39
New cards

variation in a mixed model

more generalizable (individual identity >1 data point from particular individual or location >1 data point from a particular location)

40
New cards

how glmer controls random effects

allows each group to have it’s own intercept because of the random effect, but still allows us to check for the overall effect of our variable of interest