Experimental Design Exam 1

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/88

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

89 Terms

1
New cards

Statistics

are two samples drawn from the same population.

gives info to help explain variation.

2
New cards

Biological differences

how large is the difference and is it important to biological function

3
New cards

Laplace and Gauss (1700s)

Created Normal Distribution and Central limit theorem

4
New cards

Normal Distribution

continuous probability distribution (bell curve)

Mean and median will be the same number

5
New cards

Central limit theorem

independent and identically distributed samples, the mean tends towards the normal distribution even if original variables are not normally distributed created by Laplace.

6
New cards

Galton and Pearson

Created correlation testing and chi-squared test. the second guy was the first to do formal statistical testing in the 1800s

7
New cards

Correlation

linear correlation between two sets of data, covariance of two variables and the product of their standard deviations (-1 to 1 perfect)

8
New cards

Chi-squared test

applied sets of categorical data to evaluate how likely that any observed difference between sets arose by chance (p value test)

9
New cards

Gosset and Fisher

T-test, ANOVA, Experimental design

T test started with the one who worked at a Guinness factory and published his work under the pseudonym “student”

was known as the “King of Genetics” he published 100s of articles on genetics.

10
New cards

Experimental Units (EU)

Able to retrieve treats i.e (A tank of water that has mercury added: the tank is the EU)

Some type of container (Box, tank, classroom, etc)

11
New cards

Measurement Unit (MU)

What is being measured. Units inside the container (Fish in a tank)

12
New cards

Problem with confusing MU for EU

Over optimistic analysis, rejection of H0 , C.I. will be too short

13
New cards

Treatment Effect

What is being changed in an experiment

14
New cards

Degrees of freedom

maximum number of logically independent variables which may vary in a sample

15
New cards

Randomization

  • A guard against situational errors

  • requires identification of all the objects that might be selected

  • Used in selection in the field and setting up the experiment

  • Can be used when their is a limited sample

16
New cards

Skew

direction a normal distribution (bell curve) graph leans

17
New cards

Left skew

√b1 < 0

18
New cards

No skew

√b1 = 0

19
New cards

Right skew

√b1 > 0

20
New cards

Kurtosis

The peakedness of the normal distribution graph (bell curve)

Broad v Narrow

21
New cards

Platykurtic - broad

b2 < 3

22
New cards

mesokurtic

b2 = 3

23
New cards

leptokurtic

b2 > 3

24
New cards

Random Selection with Replacement

The Best form of Randomization

  1. tag entire population of interest, so every individual has a unique identifying number

  2. random selection of individuals (cards, dice, random number table)

  3. measure the selected individual

  4. replace the individual back in the population before makin the next selection

  5. the probability of selection remains constant

25
New cards

Fisher has said

“Random assignment of treatments is required”

26
New cards

In a t-test; as N increases

t decreases, and SE (standard error) decreases

27
New cards

Replication

Most important part of experimental design

Most messed up

28
New cards

Experimental Design

Randomization + Replication

  1. Plan Experiment ← Chose stats analysis then

  2. Carry out experiment - data collection (where experimenters typically go wrong)

  3. Run stats on data

29
New cards

Factor

Treatment - Independent variable - x-axis

30
New cards

Variable

Measured - dependent variable - y-axis

31
New cards

Variation

is everywhere in the real world.

Includes Measurement error (ME), Natural inherent variation, variation in individuals, variation in mean values

Means by themselves explains nothing.

32
New cards

ME

due to limitations of people, instruments, and environment

33
New cards

Natural Inherent variation

Genetics, biological processes, environmental history

34
New cards

Variation in individuals

Sample standard deviation

35
New cards

Variation in mean values

Sample of means standard deviation, standard error of the mean (SEM)

36
New cards

Simple Pseudoreplication

There are single replicates of EU per treatment with nested MU

Shouldn’t run stats - Observation tests only (due to lack of significant difference)

Happens with large scale experiments

<p>There are single replicates of EU per treatment with nested MU</p><p>Shouldn’t run stats - Observation tests only (due to lack of significant difference)</p><p>Happens with large scale experiments</p>
37
New cards

Sacrificial Pseudoreplication

true replication of treatments but data for replicates are pooled for statistical analysis (sacrifices independence by using individuals as EU not MU) confuses MU for EU which gives a larger n value leading to a wrong df → wrong df error term → wrong MS Error term  → wrong F term → incorrect P value which would create an incorrect interpretation and assumptions.

<p>true replication of treatments but data for replicates are pooled for statistical analysis (sacrifices independence by using individuals as EU not MU) confuses MU for EU which gives a larger n value leading to a wrong df → wrong df error term → wrong MS Error term&nbsp; → wrong F term → incorrect P value which would create an incorrect interpretation and assumptions.</p>
38
New cards

Temporal Pseudoreplication

multiple samples from each EU (one per treatment) are not taken simultaneously but instead over several dates

Treats each measurement as a separate EU.

Don’t treat successive dates as independent replicates of a treatment

  • no longer independent - same sample multiple times

  • need multiple EU over time

<p>multiple samples from each EU (one per treatment) are not taken simultaneously but instead over several dates</p><p>Treats each measurement as a separate EU.</p><p>Don’t treat successive dates as independent replicates of a treatment</p><ul><li><p>no longer independent - same sample multiple times</p></li><li><p>need multiple EU over time</p></li></ul><p></p>
39
New cards

Implicit Pseudoreplication

When Standard Error and Confidence Interval is reported along with their means and discuss the effects of the treatments, but there is no application of any direct tests of significance. It is psuedoreplication when if the experimenter does not give a specific disclaimer acknowledging that their data is inadequate for assessing treatment effects. 

If only standard deviation is presented than it isn’t psuedoreplication.

40
New cards

SE of a sample

is an estimate of the variation of means

41
New cards

Difference between Treatment

EU - variation regardless of treatments

MU - variation among MU regardless of treatment

TE - variation caused by treatment effect

42
New cards

Difference withing treatment

EU - variation among EU

MU - variation among MU

43
New cards

Independent Error Terms

Is all based on design

44
New cards

HOV (draw table)

Variances of treatments are similar for test of means to be accurate

MS (mean square) - variance within treatments

  • one number to tell if treatments are different

Uses pooled variance in calculation

is an assumption because all variances have to all be the same for it to be accurate if one number is off  the resulting calculations could all be off i.e. if the F-value is off the P-value is off.

45
New cards

ANOVA table

Top row - between treatments

Bottom row - within treatments (error)

Represents the variation of means among treatments.

Change in the variance is an indication in the change the biologist made

Change in the mean might not happen

Variance carries biological information from the outcome.

46
New cards

F

between treatments/within = TE + all other variation/all other variation = treatment effect + EU + MU / EU + MU

  • Everything cancels out to just leave TE

    • If others don’t cancel, wrong analysis of treatment effect and generates wrong F term (typically caused by improper replication)

47
New cards

P values

Value less than alpha reject H0 if larger don’t reject H0

Represents the probability of getting a more extreme result than what the data set says.

Area under the curve outside of the data set.

Used when you design an experiment on purpose the value can be used for interpretation.

Change when there is a change in sample size

48
New cards

Type 1 error

alpha: reject H is wrong (usually happens if p value is very large

to lower in crease n

49
New cards

Type 2 error

Beta - fail to reject H0 is wrong

50
New cards

Random Selection without Replacement

  1. tag entire population of interest, so every individual has a unique identifying number

  2. random selection of individuals (cards, dice, random number table)

  3. measure the selected individual

  4. do not replace the individual back in the population before making the next selection

  5. the probability of selection changes, but in a predictable way

51
New cards

Haphazard Selection

  1. do not tag entire population, or any of the population of interest

  2. no random selection, pick up whatever you find, or buy things from supply houses

  3. measure individual

  4. replacement is irrelevant, but usually does not happen

  5. there is no probability of selection

52
New cards

J

bias correction factor

used to adjust for sample size

53
New cards

One-Sample t-Test

Tests a single factor at only one treatment level/sample against a null test mean μ: mean of Column A1 = null test mean μ

Ex: A factory is measuring if the average box of cereal contains 24oz cereal compared to the hypothesized value of 24 oz.

54
New cards

√s2/ n

level of variation

55
New cards

Effect size

The replacement for p for non-strict science journals

Larger number more effect there is

opposite direction of p-value

< 0.2 : small, around 0.5 in the middle, > 0.8 : large, and >2.0 : very large

Does not change with sample size

56
New cards

S2

Variation within

Further variance gets less accurate

57
New cards

One-factor Completely Randomized ANOVA

One Factor of multiple treatment levels

How you randomize defines the design

How design defines how you analyze

Step 1: Randomly assign treatments to EU matching one list of treatment numbers with one list of EU numbers in random order → design

Step 2: Randomly assign MUs to EUs matching one list of MU numbers with one list of EU numbers in random order → helps remove bias

H0: mean of each column is the same as the other.

58
New cards

One-factor Completely Randomized ANOVA Assumptions

Assumptions: The data points withing each column (treatment group) are from randomly drawn individuals, and are normally distributed

The data points are independent of one another within and between columns (treatment groups) (the within relates to independent replication of EU within each treatment; the between relates to the completely randomized design across all treatments).

The variances of the columns (treatment groups) are similar

59
New cards

One-Sample t-test Assumption

Assumptions: The data points are from randomly drawn individuals, normally distributed, and independent of one another

60
New cards

SS

sum of squares of a column

start of calculations for variation

61
New cards

ω2 (omega)

the effect size given is a biased-corrected estimate of effect size compared to η2

62
New cards

η2 (eta)

effect size estimate for effect or factor of interest

gives the percent amount of how much a Factor is apart of the variation in SS in the ANOVA

The more of the variation in the ANOVA table that a factor explains the more important

63
New cards

Two-Sample (Independent) t-Test

Used when there is only two treatment groups for one factor

The design and set up is identical to the One-Factor Completely Randomized ANOVA.

Effect size is calculated differently.

H0: mean of Column A1 = mean of Column A2

64
New cards

Welch t-test

Used when there are only two treatment groups for one factor.

t-value uses separate variance instead of the pooled variance.

H0: mean of Column A1 = mean of Column A2

65
New cards

Welch t-test assumption

Same as One-Factor Completely Randomized ANOVA except homogeneity of variance is not required.

66
New cards

Two-Sample (Independent) t-Test

Same as One-Factor Completely Randomized ANOVA

67
New cards

Cohen’s d*s

effect size estimate based on the average variance

68
New cards

Hedges g*

Cohen’s d*s bias correction factor

69
New cards

Normal probability plots

If all dots line up in a straight line you have normal data

<p>If all dots line up in a straight line you have normal data</p>
70
New cards
71
New cards
72
New cards
73
New cards
74
New cards
75
New cards
76
New cards
77
New cards
78
New cards
79
New cards
80
New cards
81
New cards
82
New cards
83
New cards
84
New cards
85
New cards
86
New cards
87
New cards
88
New cards
89
New cards