Experimental Design Exam 1

0.0(0)
studied byStudied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/86

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 2:53 PM on 10/6/25
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

87 Terms

1
New cards

Statistics

are two samples drawn from the same population?

gives info to help explain variation.

2
New cards

Biological differences

how large is the difference and is it important to biological function

3
New cards

Laplace and Gauss (1700s)

Created Normal Distribution and Central limit theorem

4
New cards

Normal Distribution

continuous probability distribution (bell curve)

Mean and median will be the same number

5
New cards

Central limit theorem

in independent and identically distributed samples, the mean tends towards the normal distribution even if original variables are not normally distributed created by Laplace.

6
New cards

Galton and Pearson

Created correlation testing and chi-squared test. the second guy was the first to do formal statistical testing in the 1800s

7
New cards

Correlation

linear correlation between two sets of data, covariance of two variables and the product of their standard deviations (-1 to 1 perfect)

8
New cards

Chi-squared test

applied sets of categorical data to evaluate how likely that any observed difference between sets arose by chance (p value test)

9
New cards

Gosset and Fisher

T-test, ANOVA, Experimental design

T test started with the one who worked at a Guinness factory and published his work under the pseudonym “student”

was known as the “King of Genetics” he published 100s of articles on genetics.

10
New cards

Experimental Units (EU)

Able to retrieve treatments i.e (A tank of water that has mercury added: the mercury is the treatment)

Some type of container (Box, tank, classroom, etc)

Differences in the unit isn’t caused by the treatment.

11
New cards

Measurement Unit (MU)

What is being measured. Units inside the container (Fish in a tank)

12
New cards

Problem with confusing MU for EU

Over optimistic analysis, rejection of H0 , C.I. will be too short

13
New cards

Treatment Effect

What is being changed in an experiment

14
New cards

Degrees of freedom

maximum number of logically independent variables which may vary in a sample

15
New cards

Randomization

  • A guard against situational errors

  • requires identification of all the objects that might be selected

  • Used in selection in the field and setting up the experiment

  • Can be used when their is a limited sample

  • produces correct independent error terms

  • reduces bias in selection

16
New cards

Skew

direction a normal distribution (bell curve) graph leans

17
New cards

Left skew

√b1 < 0

18
New cards

No skew

√b1 = 0

19
New cards

Right skew

√b1 > 0

20
New cards

Kurtosis

The peakedness of the normal distribution graph (bell curve)

Broad v Narrow

21
New cards

Platykurtic - broad

b2 < 3

22
New cards

mesokurtic

b2 = 3

23
New cards

leptokurtic

b2 > 3

24
New cards

Random Selection with Replacement

The Best form of Randomization

  1. tag entire population of interest, so every individual has a unique identifying number

  2. random selection of individuals (cards, dice, random number table)

  3. measure the selected individual

  4. replace the individual back in the population before makin the next selection

  5. the probability of selection remains constant

25
New cards

Fisher has said

“Random assignment of treatments is required”

26
New cards

In a t-test; as N increases

t decreases, and SE (standard error) decreases

27
New cards

Replication

Most important part of experimental design

Most messed up

28
New cards

Experimental Design

Randomization + Replication

  1. Plan Experiment ← Chose stats analysis then

  2. Carry out experiment - data collection (where experimenters typically go wrong)

  3. Run stats on data

29
New cards

Factor

Treatment - Independent variable - x-axis

30
New cards

Variable

Measured - dependent variable - y-axis

31
New cards

Variation

is everywhere in the real world.

Includes Measurement error (ME), Natural inherent variation, variation in individuals, variation in mean values

Means by themselves explains nothing.

32
New cards

Natural Inherent variation

Genetics, biological processes, environmental history

Reduced through Narrow selection, acclimate individuals (set individuals in a lab for a couple of weeks)

How to deal with what’s left: increase sample size, use blocking in analysis

33
New cards

Variation in individuals

Sample standard deviation

34
New cards

Variation in mean values

Sample of means standard deviation, standard error of the mean (SEM)

35
New cards

Simple Pseudoreplication

There are single replicates of EU per treatment with nested MU

Shouldn’t run stats - Observation tests only (due to lack of significant difference)

Happens with large scale experiments

<p>There are single replicates of EU per treatment with nested MU</p><p>Shouldn’t run stats - Observation tests only (due to lack of significant difference)</p><p>Happens with large scale experiments</p>
36
New cards

Sacrificial Pseudoreplication

true replication of treatments but data for replicates are pooled for statistical analysis (sacrifices independence by using individuals as EU not MU) confuses MU for EU which gives a larger n value leading to a wrong df → wrong df error term → wrong MS Error term  → wrong F term → incorrect P value which would create an incorrect interpretation and assumptions.

<p>true replication of treatments but data for replicates are pooled for statistical analysis (sacrifices independence by using individuals as EU not MU) confuses MU for EU which gives a larger n value leading to a wrong df → wrong df error term → wrong MS Error term&nbsp; → wrong F term → incorrect P value which would create an incorrect interpretation and assumptions.</p>
37
New cards

Temporal Pseudoreplication

multiple samples from each EU (one per treatment) are not taken simultaneously but instead over several dates

Treats each measurement as a separate EU.

Don’t treat successive dates as independent replicates of a treatment

  • no longer independent - same sample multiple times

  • need multiple EU over time

<p>multiple samples from each EU (one per treatment) are not taken simultaneously but instead over several dates</p><p>Treats each measurement as a separate EU.</p><p>Don’t treat successive dates as independent replicates of a treatment</p><ul><li><p>no longer independent - same sample multiple times</p></li><li><p>need multiple EU over time</p></li></ul><p></p>
38
New cards

Implicit Pseudoreplication

When Standard Error and Confidence Interval is reported along with their means and discuss the effects of the treatments, but there is no application of any direct tests of significance. It is psuedoreplication when the experimenter does not give a specific disclaimer acknowledging that their data is inadequate for assessing treatment effects. 

If only standard deviation is presented than it isn’t psuedoreplication.

39
New cards

SE of a sample

is an estimate of the variation of means

40
New cards

Difference between Treatment

EU - variation regardless of treatments

MU - variation among MU regardless of treatment

TE - variation caused by treatment effect

41
New cards

Difference within treatment

EU - variation among EU

MU - variation among MU

42
New cards

Independent Error Terms

Is all based on design

43
New cards

HOV (draw table)

Variances of treatments are similar for test of means to be accurate

MS (mean square) - variance within treatments

  • one number to tell if treatments are different

Uses pooled variance in calculation

is an assumption because all variances have to all be the same for it to be accurate if one number is off  the resulting calculations could all be off i.e. if the F-value is off the P-value is off.

44
New cards

ANOVA table

Top row - between treatments

Bottom row - within treatments (error)

Represents the variation of means among treatments.

Change in the variance is an indication in the change the biologist made

Change in the mean might not happen

Variance carries biological information from the outcome.

45
New cards

P values

Value less than alpha reject H0 if larger don’t reject H0

Represents the probability of getting a more extreme result than what the data set says.

Area under the curve outside of the data set.

Used when you design an experiment on purpose the value can be used for interpretation.

Change when there is a change in sample size

46
New cards

Type 1 error

alpha: reject H is wrong (usually happens if p value is very large

to lower increase n

47
New cards

Type 2 error

Beta - fail to reject H0 is wrong

48
New cards

Random Selection without Replacement

  1. tag entire population of interest, so every individual has a unique identifying number

  2. random selection of individuals (cards, dice, random number table)

  3. measure the selected individual

  4. do not replace the individual back in the population before making the next selection

  5. the probability of selection changes, but in a predictable way

49
New cards

Haphazard Selection

  1. do not tag entire population, or any of the population of interest

  2. no random selection, pick up whatever you find, or buy things from supply houses

  3. measure individual

  4. replacement is irrelevant, but usually does not happen

  5. there is no probability of selection

50
New cards

J

bias correction factor

used to adjust for sample size

51
New cards

One-Sample t-Test

Tests a single factor at only one treatment level/sample against a null test mean μ: mean of Column A1 = null test mean μ

Ex: A factory is measuring if the average box of cereal contains 24oz cereal compared to the hypothesized value of 24 oz.

52
New cards

√s2/ n

level of variation

53
New cards

Effect size

The replacement for p for non-strict science journals

Larger number more effect there is

opposite direction of p-value

< 0.2 : small, around 0.5 in the middle, > 0.8 : large, and >2.0 : very large

Does not change with sample size

54
New cards

S2

Variation within

Further variance gets less accurate

55
New cards

One-factor Completely Randomized ANOVA

One Factor with multiple treatment levels

How you randomize defines the design

How you design defines how you analyze

Step 1: Randomly assign treatments to EU matching one list of treatment numbers with one list of EU numbers in random order → design

Step 2: Randomly assign MUs to EUs matching one list of MU numbers with one list of EU numbers in random order → helps remove bias

H0: mean of each column is the same as the other.

Normality and HOV are dependent of the data and can be tested for and adjusted after completion of the experiment

56
New cards

One-factor Completely Randomized ANOVA Assumptions

Assumptions: The data points within each column (treatment group) are from randomly drawn individuals, and are normally distributed

The data points are independent of one another within and between columns (treatment groups) (the within relates to independent replication of EU within each treatment; the between relates to the completely randomized design across all treatments).

The variances of the columns (treatment groups) are similar

57
New cards

One-Sample t-test Assumption

Assumptions: The data points are from randomly drawn individuals, normally distributed, and independent of one another

58
New cards

SS

sum of squares of a column

start of calculations for variation

59
New cards

ω2 (omega)

the effect size given is a biased-corrected estimate of effect size compared to η2

60
New cards

η2 (eta)

effect size estimate for effect or factor of interest

gives the percent amount of how much a Factor is a part of the variation in SS in the ANOVA

The more of the variation in the ANOVA table that a factor explains the more important

61
New cards

Two-Sample (Independent) t-Test

Used when there is only two treatment groups for one factor

The design and set up is identical to the One-Factor Completely Randomized ANOVA.

Effect size is calculated differently.

H0: mean of Column A1 = mean of Column A2

62
New cards

Welch t-test

Used when there are only two treatment groups for one factor.

t-value uses separate variance instead of the pooled variance.

H0: mean of Column A1 = mean of Column A2

63
New cards

Welch t-test assumption

Same as One-Factor Completely Randomized ANOVA except homogeneity of variance is not required.

64
New cards

Two-Sample (Independent) t-Test

Same as One-Factor Completely Randomized ANOVA

65
New cards

Cohen’s d*s

effect size estimate based on the average variance

66
New cards

Hedges g*

Cohen’s d*s bias correction factor

67
New cards

Normal probability plots

If all dots line up in a straight line you have normal data

<p>If all dots line up in a straight line you have normal data</p>
68
New cards

Population

The total number individuals or items in which a sample is drawn from.

69
New cards

Sample

A subset of the population of which you can gain mean values and other data from.

70
New cards

C.I.

As n increases the range becomes more narrow

The percentage is determined by the chances of being wrong and the consequences

EX: 95% means 95 will hit out of 100

Is determined from the standard error and the t-value

71
New cards

Standard Error

Generated from the SD

Is how variable mean values can be if you sample multiple times

72
New cards

df group

related to the number of columns (treatment groups) in an experiment.

k-1

Used to find MS group

73
New cards

Continuous 

There is no gap in measurement

Ex: lengths and widths

74
New cards

Discrete

There are gaps

Whole numbers no decimals

Ex: spines on an insect

75
New cards

Accuracy

How close to the truth (mean)

76
New cards

Precision

How close multiple measurements are to each other (variance)

77
New cards

Resolution

How small of a difference can be recorded (decilmals)

78
New cards

Measurement Error

Reduced by practice, instrument calibration, stable environment

Variation due to limitation of people, instruments, conditions; i.e. what are the conditions of the time you are taking measurements

How to deal with what’s left: Ignore it, take multiple measurements → average, multiple measures - nesting analysis

79
New cards

Standard Deviation

Represents the variation in the sample (measure of how variable numbers are in a sample)

80
New cards

Sample of means

When multiple samples are drawn from the same population and means are given for each sample.

The mean of these values is the most accurate representation of the population.

81
New cards

df error

relates to the the number of independently replicated EUs

N-k

Helps give MS error

82
New cards

MS error

SS error/ df error

83
New cards

MS group

SS group / df group

84
New cards

df total

N -1

Total number of EUs

85
New cards

F-statistic

MS column/ MS error

TE+ EU + MU/ EU + MU

  • Everything cancels out to just leave TE

    • If others don’t cancel, wrong analysis of treatment effect and generates wrong F term (typically caused by improper replication)

Follows an approximately normal distribution.

86
New cards

MS (mean square)

is the variation within and between treatments.

Serves as common variance term in an ANOVA test.

87
New cards