Biol 315

0.0(0)
studied byStudied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/117

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 4:03 PM on 4/19/25
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

118 Terms

1
New cards

population

the set of all “subjects” relevant to the scientific hypothesis under examination

2
New cards

variables

characteristics that differ among individuals

3
New cards

parameters

quantities describing a population (denoted by Greek letters)

4
New cards

census

a collection of data where the population is examined

5
New cards

random sample

each and every member of the population has an equal chance of being selected and each member is selected independently of others

6
New cards

mean

denoted by a bar

7
New cards

standard deviation

denoted by an “s”

8
New cards

sample

the subset of cases selected from a statistical population that are actually examined during a particular study

9
New cards

sample statistics

calculated from the collected sample and used to estimate the population parameters (denoted by roman letters)

10
New cards

how to get a good sample

take a random sample, be unbiased an precise

11
New cards

to get a good sample

carefully define your statistical population and select a sample that is as representative of the population a possible, where each subject is selected randomly and measurements are precise

12
New cards

bad samples

volunteer sample or a sample of convenience

13
New cards

experimental study

assigning treatment randomly, creating groups, imposing change

14
New cards

observational study

relying on comparisons of already existing conditions

15
New cards

2 types of variables

numerical (quantitative) and categorical (qualitative)

16
New cards

2 types of numerical variable

interval (arbitrarty zero) and ratio (true zero)

17
New cards

2 types of categorical variables

nominal (no order) and ordinal (ordered)

18
New cards

frequency distribution

describes the number of times each value of a variable occurs

19
New cards

histograms

used for numerical data - x axis has a continuous scale, data are “binned” into continuous categories, the bins are touching

20
New cards

histograms y-axis can be

frequency (count of observations in each bin), proportion(of the total observations in each bin) and density(the proportion of the total observations per unit of the bin width)

21
New cards

location or central tendency

distributions with a different central measurement using the mean

22
New cards

spread or scale

distributions with a different spread measured using the standard deviation

23
New cards

shape or skew

distributions with a long tail on one side or the other

24
New cards

mean

arithmetic average

25
New cards

median

middle of the data

26
New cards

mode

most commonly occurring observations

27
New cards

scale

most basic (max - min), not very informative

28
New cards

variance

“expected” squared difference between an observation and the mean

29
New cards

standard deviation is

positive square root of the variance

30
New cards

what is meant by “estimation”

it’s using the sample data to learn about the popualtion

31
New cards

estimation

the process of inferring a population parameter from sample data

32
New cards

uncertainty

a situation in which something is not known; in statistics it is the error of an estimate

33
New cards

Sampling distribution

the distribution of all the values for an estimate that we might have obtained when we sampled a population

34
New cards

a 95% confidence interval is a

range of values, calculated from sample data, that would contain the true population parameter in 95 out of 100 samples if the sampling process were repeated

35
New cards

uncertainty

decreases an precision increases with sample size

36
New cards

hypothesis testing

to determine whether an estimate can be simply explained by chance or is it special

37
New cards

Null hypothesis

is a specific statement made about population for the sake of argument, forces us to take a skeptical view

38
New cards

null hypothesis is used to

create a null model, compare test statistic calculated from the sample to the model

39
New cards

H0 is rejected if

we are surprised by the test statistic

40
New cards

P means

probability of observing a test statistic as extreme as, or more more extreme than, the one observed, assuming H0 is true

41
New cards

significance level α

a probability used as a criterion for rejecting the null hypothesis

42
New cards

P-value > α

fail to reject H0

43
New cards

P-value < α

reject H0

44
New cards

two-tailed tests

deviation is either direction would reject null hypothesis

45
New cards

type I error (α)

rejecting a true null hypothesis (false posititve)

46
New cards

type II error (β)

Failing to reject a false null hypothesis (false negative)

47
New cards

Power

the probability of correctly rejecting a false H0

48
New cards

Power depends on

how different the truth is from the null hypothesis, type I error rate, and sample size

49
New cards

things to consider when designing an experiment

reduce bias and decrease sampling error

50
New cards

reduce bias

have a control group, use randomization, use blinding

51
New cards

decrease sampling error

use replication, ensure balance, use blocking, implement extreme treatments

52
New cards

Control group

units that are similar to the treatment units except that they do not receive the treatment

53
New cards

random assignment

units that are otherwise “identical” are assigned to be controls or treatments

54
New cards

blinding

concealing information about whether a participant is in the control or treatment group (single blind) and sometimes researchers (double blind)

55
New cards

replication

application of treatment ti multiple, independent experimental subjects or units

56
New cards

balance

an equal number of units in the control and treatment minimizes the sampling error in both

57
New cards

blocking

divide experimental units into groups with known confounding variables

58
New cards

extreme treatments

a treatment you may add to an experiment to see if by doing more (or less) of a treatment will elicit more (or less) of an effect

59
New cards

probability distributions

the probability with which each possible observation of a variable occurs

60
New cards

normal distribution

continuous probability distribution, bell shaped curve, symmetrical around the mean

61
New cards

binomial distribution

discrete probability distribution, outcome of a number of bernouli trials

62
New cards

Bernoulli trials

only 2 possible outcomes, outcomes are indenpedent of each other

63
New cards

Poisson distribution

frequency distribution of events that occur rarely and randomly

64
New cards

Poisson conditions

the probability of 2 or more occurrences in a single sample subdivision is negligibly small, probability of one occurrence is proportional to the size of the subdivision, outcomes are independent, probability of an occurence is identical for all sample subdivisions

65
New cards

analysis of frequencies

compares the observed frequency of observations in different categories with the expected frequency under a null hypothesis

66
New cards

goodness-of-fit assumptions

no more than 20% of categories have expected counts <5, no category with expected count <1

67
New cards

how to calculate a G-test

calculate expected frequencies, calculate g based on the dissimilarities between observed and expected, compare G to the tabulated X² distribution

68
New cards

G test df

= k - p - 1, k= categories, p = estimated parameters

69
New cards

contigency analysis

asks whether 2 categorical variables are associated with one another

70
New cards

extrinsic hypothesis

derived from information other than the data you are analyzing

71
New cards

intrinsic hypothesis

expected frequencies are derived from the data you are analyzing

72
New cards

contingency df

k-p-1 or (r-1)(c-1), r = rows, c = columns

73
New cards

normal distribution defined by two parameters

mean and standard deviation

74
New cards

properties of a normal distribution

continuous distribution, probability measuted as the area under the curve, symmetrical, 2/3 of the are under the curve lies within on sd of the mean

75
New cards

central limit theorem

the mean of a large number of measurements randomly sampled from a non-normal (or normal) population is approximately normally distributed

76
New cards

Student’s t distribution

estimate the standard error or the mean use z-scores to estimate the probability of obtaining a particular sample mean given the population of means from which we are sampling

77
New cards

Z vs t

t distribution is more spread out because of uncertainty about the true population standard deviation

78
New cards

student’s t df

n - 1, n = sample size

79
New cards

single sample t - test

comapres the mean of a random sample to a population mean proposed in a null hypothesis

80
New cards

single-sample t-test assumptions

variable is normally distributed in the population, data is a random sample of the population

81
New cards

paired samples

individual observations in 2 samples are connected, i.e. an individual before and after treatment

82
New cards

paired t-test

difference between two paired observations, testing whether the mean difference between paired measurements equals some specified value (usually zero)

83
New cards

paired t-test assumptions

that the differences are normally distributed and the each pair is independent from the others

84
New cards

when can we ignore violations of assumptions

small deviations, large sample size, non-normality if deviations are small and size is large (central limit theory)

85
New cards

what to do when assumptions are violated

ignore violations, transformation, permutation/randomization/resampling methods

86
New cards

transformation

changes data to another scale of measurement to fit assumptions

87
New cards

data transformation by

applying the same mathematical formula to each observation, needs to be applied to each individual data point, must be backwards convertible to the original data

88
New cards

Anova

analysis of variance, compares and determines if there is significant difference between means of two or more unpaired groups

89
New cards

ANOVA assumptions

independence, normality, homogeneity of variance (homoscedasticity), random sampling

90
New cards

total sum of squares (SST)

measures the total variation in the dataset

91
New cards

between-group sum of squares (SSB or SSG)

this measures the variation between the group means and the grand mean

92
New cards

error sum of squares (SSE) or within-group SS (SSW)

this measures the variation within each group

93
New cards

summarizes the contribution of the group difference to the total variation

94
New cards

Anova is a general linear model GLM

a mathematical representation of the relationship between a response variable and one or more explanatory variables

95
New cards

GLM fixed

categories of the explanatory variable are pre-determined (drug trial dosage)

96
New cards

GLM random

randomly sampled from a larger pool of groups, groups are not of particular interest

97
New cards

Tukey’s test

tests which treatments/groups differ from each other, compares all possible pairs of means to infer which one(s) differ(s) while controlling type I error, only do if anova rejected H0

98
New cards

balanced design

all treatment groups have the same n, tukey’s test

99
New cards

unbalanced design

not all treatments have the same n, tukey-kramer test

100
New cards

tukey’s df

= N - k, N = total sample size (sum for all groups), k = total number of groups in the anova