biostats final

0.0(0)

Studied by 10 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/156

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

157 Terms

New cards

parameter

quantity describing entire population

New cards

estimate

inferring unknown parameter from sample data

New cards

3 major goals of stats

estimate characteristics of pops

objectively answer scientific questions

describe degrees of uncertainty in scientific findings

New cards

variable

charcateristic measured on individual drawn from population

New cards

properties of good sample

ind

random

sufficiently large

New cards

random sample

each member of pop has equal and ind chance of being selected

sample is representative of the population

New cards

effects of sample size increasing

SD will become more accurate but will not systematically directionally change

SE will become smaller/narrower

New cards

1 numerical graph

histogram

New cards

2 numerical graph

scatter plot

New cards

1 categorical graph

bar graph

New cards

2 categorical graph

grouped bar graph

New cards

1 numerical 1 categorical graph

multiple histograms, dot/box plot

New cards

when is mode useful

in voting/surveys

New cards

when to use mean or median as measure of average

usually use mean but if outliers exist, use median

New cards

variance

average of the squared differences from the mean

New cards

square root of variance

measure of inherent variability among individuals

New cards

coefficient of variation

SD/mean x 100

New cards

variability between samples

New cards

most dangerous eq

if sample size is small, SE appears more significant than it actually is. Small sample sizes can produce extreme values that can be misinterpreted

New cards

confidence interval rule of thumb

mean ± 2SE prpvides rough estimate of 95% CI

New cards

what does 95% CI mean

we are 95% confident that the true population mean lies within the 95% confidence interval

New cards

what happens to confidence interval as sample size inc

gets narrower

New cards

pseudoreplication

error that occurs when samples that are not independent are treated as though they are

New cards

characteristics of normal dist

symmetric around mean
about 2/3 of random samples are within 1 SD of the mean
about 95% of random samples are within 2 SD of the mean
mean=median=mode
- bell shaped

New cards

standard normal distribution characteristics

mean=0

SD=1

New cards

standard normal deviate

New cards

CLT

in a large sample, the mean of samples approaches a normal distribution regardless of if the population’s distribution is normal or not

New cards

how is t distribution different than z distribution

probailities based on a sample so need to account for greater uncertainty

confidence intervals wider and more probaility in the tails b/c only have estimates of mean and SD

New cards

number of observations = # of parameters

New cards

1 sample t text

compares mean of random sample w population mean

New cards

1 sample t test assumptions

random sample

independent measurements

varibale is normally distributed

New cards

how is 1 sample t test robust

CLT

New cards

paired t test

1 sample t test on differences between pairs

New cards

why is paired t test good

allows you to account for extraneous variation; greater statistical power

New cards

paired t test assumptions

random sample

each pair of data is independent

the diff between the pairs is normally distributed

New cards

paired t test robust

CLT

New cards

paired t test DF

#pairs - 1

New cards

2 sample t test

compares means of 2 samples

New cards

2 sample t test assumptions

random

independent

normal distribution

equal variance

equal sample size

New cards

2 sample t test robust

performs adequately if diff in SD is 3 fold or less and sample size is moderately large and sample size is similar in both groups

New cards

unequal variance/welch test

adjusts for very unequal variances

New cards

assumptions for welch

same as 2 sample but no equal variance

New cards

why not just always use welch instead of 2 sample t test

less statistically powerful

New cards

robust definition

test performs adequately even if assumptions aren’t met exactly

New cards

Q-Q plot

straight line means perfectly normal

New cards

informal normality checks

histogram and Q-Q

New cards

leptokurtic

too pointed

New cards

platykurtic

too flat

New cards

formal tests for normality

shapiro-wilks (n<50)

kolmogorov smirnow (n>50)

New cards

steps if parametric assumptions are violated

evaluate outliers

transform data

non-parametric test

New cards

positive skew transformation

slight: square root

moderate: ln or log

extreme: inverse

New cards

transform negative skew

first try square and if that fails

reflect data so it is now pos skewed and then use the pos transformations

New cards

backtransforming

backtransform important parameters like mean and SE/ 95% CI

DONT BACKTRANSFORM SD

New cards

when transformations don’t work

use non-parametric tests

New cards

non-parametric tests have…

fewer assumptions about shape and spread of data but are less statistically pwoerful

New cards

non-parametric version of 2 sample t test

mann-whitney u test

New cards

mann-whitney u test

converts data into ranks and tests for difference between medians

New cards

mann0whitney u test assumptions

similar shape and variance

New cards

non parametric test for paired and 1 sample

wilcoxon signed rank test and sign test

New cards

wilcoxon signed rank test

test difference between sample median and hypothesized median

turned diff data into ranks

New cards

wilcoxon signed rank test assumptions

data are symmetric around the mean

New cards

sign test

tests diff between sample median and hypotheiszed median

turned differences into +1 and -1

New cards

sign test assumptions

none

low statistical power

New cards

type 1 error

incorrectly rejecting the null hypothesis

populations not diff but saying they are

New cards

p value

estimate of likelihood of committing type 1 error

New cards

type 2 error

failure to reject the null hypothesis even though it is false

populations are different but saying they arent

New cards

ANOVA

1 categorical variable (2+ groups_) and 1 numerical value

New cards

example of ANOVA`

effect of 3 drugs and a placebo on blood pressure

New cards

what does anova compare

mean of 2+ groups

New cards

null hypothesis of ANOVa

all means are the same; there is no difference

New cards

alternate hypothesis for anova

there is at least one difference

New cards

anova assumptions

same as 2 sample tt est

New cards

anova robust

same as 2 sample t test

CLT and 3 fold

New cards

anova f statistic

F= s² between groups / s² within groups

New cards

when F=1

groups come from same population

New cards

when F>1

groups come from diff populations

New cards

degrees of freedom for ANOVA

1: k-1 = # groups -1

2: n-k= # observations = # groups

New cards

why not just do multiple t tests instead of ANOVA

-p value no longer representative of type 1 error

large probability of erroneous results

multiple comparisons would lead to too many type 1 errors

New cards

what to do after anova shows there is a diff

tukey test

New cards

why do we need tukey test

need to see WHICH groups are diff form each other

New cards

tukey test

compares all groups and determines which pairs are different

New cards

HSD

honestly significantly difference

New cards

why is tukey test important

protects us from making false conclusions due to many comparisons

New cards

tukey test assumptions

same as 2 sample t test

New cards

fixed effects

constant across treatments

New cards

random effects

not constant; size of effect varies within groups

New cards

E.g. testing 4 drugs and their speeds of recovery

fixed: care about specific drugs and the dosage that each person in each drug treatment group gets

New cards

e.g. testing dosage of drug A on speed of recovery

Fixed; care about particular dosage

New cards

e.g. compare GPA of students from wealthiest 10% and poorest 10% if families from 45 random schools

random effect; top and bottom 10% vary by school district

New cards

e.g. survey of patients about drug use vs. recovery time

random; low med high dosage groups but still variation in dosage in each group

New cards

random effects assumptions

same as 2 sample t test AND

groups are from random sample
- group means are normally distributed

New cards

welch’s ANOVA

used instead of ANOVA if variances are VERY different

New cards

welch’s anova assumptions

random
independent
- normally distributed
similar sample sizes

New cards

post hoc test used for welch’s anova

games-howell post hoc test

New cards

games- howell post hoc test

like tukey test but handles unequal variances

New cards

games-howell assumptions

same as welch’s anova

random, independent, normal dist, similar sample size

New cards

non-parametric ANOVA

kruskal-wallis

New cards

kruskal wallis test assumption

similar shape and variance (like mann whitney U)

New cards

null hypothesis of kruskal wallis

all medians and distributions are equal

100

New cards

alterante hypothesis kruskal wallis

all medians are not the same; at least one group is different