biostats final

0.0(0)
studied byStudied by 10 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/156

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

157 Terms

1
New cards

parameter

quantity describing entire population

2
New cards

estimate

inferring unknown parameter from sample data

3
New cards

3 major goals of stats

estimate characteristics of pops

objectively answer scientific questions

describe degrees of uncertainty in scientific findings

4
New cards

variable

charcateristic measured on individual drawn from population

5
New cards

properties of good sample

ind

random

sufficiently large

6
New cards

random sample

each member of pop has equal and ind chance of being selected

sample is representative of the population

7
New cards

effects of sample size increasing

SD will become more accurate but will not systematically directionally change

SE will become smaller/narrower

8
New cards

1 numerical graph

histogram

9
New cards

2 numerical graph

scatter plot

10
New cards

1 categorical graph

bar graph

11
New cards

2 categorical graph

grouped bar graph

12
New cards

1 numerical 1 categorical graph

multiple histograms, dot/box plot

13
New cards

when is mode useful

in voting/surveys

14
New cards

when to use mean or median as measure of average

usually use mean but if outliers exist, use median

15
New cards

variance

average of the squared differences from the mean

16
New cards

SD

square root of variance

measure of inherent variability among individuals

17
New cards

coefficient of variation

SD/mean x 100

18
New cards

SE

variability between samples

19
New cards

most dangerous eq

SE

if sample size is small, SE appears more significant than it actually is. Small sample sizes can produce extreme values that can be misinterpreted

20
New cards

confidence interval rule of thumb

mean ± 2SE prpvides rough estimate of 95% CI

21
New cards

what does 95% CI mean

we are 95% confident that the true population mean lies within the 95% confidence interval

22
New cards

what happens to confidence interval as sample size inc

gets narrower

23
New cards

pseudoreplication

error that occurs when samples that are not independent are treated as though they are

24
New cards

characteristics of normal dist

  • symmetric around mean

  • about 2/3 of random samples are within 1 SD of the mean

  • about 95% of random samples are within 2 SD of the mean

  • mean=median=mode

    • bell shaped

25
New cards

standard normal distribution characteristics

mean=0

SD=1

26
New cards

standard normal deviate

knowt flashcard image
27
New cards

CLT

in a large sample, the mean of samples approaches a normal distribution regardless of if the population’s distribution is normal or not

28
New cards

how is t distribution different than z distribution

probailities based on a sample so need to account for greater uncertainty

  • confidence intervals wider and more probaility in the tails b/c only have estimates of mean and SD

29
New cards

DF

number of observations = # of parameters

30
New cards

1 sample t text

compares mean of random sample w population mean

31
New cards

1 sample t test assumptions

random sample

independent measurements

varibale is normally distributed

32
New cards

how is 1 sample t test robust

CLT

33
New cards

paired t test

1 sample t test on differences between pairs

34
New cards

why is paired t test good

allows you to account for extraneous variation; greater statistical power

35
New cards

paired t test assumptions

random sample

each pair of data is independent

the diff between the pairs is normally distributed

36
New cards

paired t test robust

CLT

37
New cards

paired t test DF

#pairs - 1

38
New cards

2 sample t test

compares means of 2 samples

39
New cards

2 sample t test assumptions

random

independent

normal distribution

equal variance

equal sample size

40
New cards

2 sample t test robust

performs adequately if diff in SD is 3 fold or less and sample size is moderately large and sample size is similar in both groups

41
New cards

unequal variance/welch test

adjusts for very unequal variances

42
New cards

assumptions for welch

same as 2 sample but no equal variance

43
New cards

why not just always use welch instead of 2 sample t test

less statistically powerful

44
New cards

robust definition

test performs adequately even if assumptions aren’t met exactly

45
New cards

Q-Q plot

straight line means perfectly normal

46
New cards

informal normality checks

histogram and Q-Q

47
New cards

leptokurtic

too pointed

48
New cards

platykurtic

too flat

49
New cards

formal tests for normality

shapiro-wilks (n<50)

kolmogorov smirnow (n>50)

50
New cards

steps if parametric assumptions are violated

evaluate outliers

transform data

non-parametric test

51
New cards

positive skew transformation

slight: square root

moderate: ln or log

extreme: inverse

52
New cards

transform negative skew

first try square and if that fails

reflect data so it is now pos skewed and then use the pos transformations

53
New cards

backtransforming

backtransform important parameters like mean and SE/ 95% CI

DONT BACKTRANSFORM SD

54
New cards

when transformations don’t work

use non-parametric tests

55
New cards

non-parametric tests have…

fewer assumptions about shape and spread of data but are less statistically pwoerful

56
New cards

non-parametric version of 2 sample t test

mann-whitney u test

57
New cards

mann-whitney u test

converts data into ranks and tests for difference between medians

58
New cards

mann0whitney u test assumptions

similar shape and variance

59
New cards

non parametric test for paired and 1 sample

wilcoxon signed rank test and sign test

60
New cards

wilcoxon signed rank test

test difference between sample median and hypothesized median

turned diff data into ranks

61
New cards

wilcoxon signed rank test assumptions

data are symmetric around the mean

62
New cards

sign test

tests diff between sample median and hypotheiszed median

turned differences into +1 and -1

63
New cards

sign test assumptions

none

low statistical power

64
New cards

type 1 error

incorrectly rejecting the null hypothesis

populations not diff but saying they are

65
New cards

p value

estimate of likelihood of committing type 1 error

66
New cards

type 2 error

failure to reject the null hypothesis even though it is false

populations are different but saying they arent

67
New cards

ANOVA

1 categorical variable (2+ groups_) and 1 numerical value

68
New cards

example of ANOVA`

effect of 3 drugs and a placebo on blood pressure

69
New cards

what does anova compare

mean of 2+ groups

70
New cards

null hypothesis of ANOVa

all means are the same; there is no difference

71
New cards

alternate hypothesis for anova

there is at least one difference

72
New cards

anova assumptions

same as 2 sample tt est

73
New cards

anova robust

same as 2 sample t test

CLT and 3 fold

74
New cards

anova f statistic

F= s2 between groups / s2 within groups

75
New cards

when F=1

groups come from same population

76
New cards

when F>1

groups come from diff populations

77
New cards

degrees of freedom for ANOVA

1: k-1 = # groups -1

2: n-k= # observations = # groups

78
New cards

why not just do multiple t tests instead of ANOVA

-p value no longer representative of type 1 error

large probability of erroneous results

multiple comparisons would lead to too many type 1 errors

79
New cards

what to do after anova shows there is a diff

tukey test

80
New cards

why do we need tukey test

need to see WHICH groups are diff form each other

81
New cards

tukey test

compares all groups and determines which pairs are different

82
New cards

HSD

honestly significantly difference

83
New cards

why is tukey test important

protects us from making false conclusions due to many comparisons

84
New cards

tukey test assumptions

same as 2 sample t test

85
New cards

fixed effects

  • constant across treatments

86
New cards

random effects

  • not constant; size of effect varies within groups

87
New cards

E.g. testing 4 drugs and their speeds of recovery

fixed: care about specific drugs and the dosage that each person in each drug treatment group gets

88
New cards

e.g. testing dosage of drug A on speed of recovery

Fixed; care about particular dosage

89
New cards

e.g. compare GPA of students from wealthiest 10% and poorest 10% if families from 45 random schools

random effect; top and bottom 10% vary by school district

90
New cards

e.g. survey of patients about drug use vs. recovery time

random; low med high dosage groups but still variation in dosage in each group

91
New cards

random effects assumptions

same as 2 sample t test AND

  • groups are from random sample

    • group means are normally distributed

92
New cards

welch’s ANOVA

used instead of ANOVA if variances are VERY different

93
New cards

welch’s anova assumptions

  • random

  • independent

    • normally distributed

  • similar sample sizes

94
New cards

post hoc test used for welch’s anova

games-howell post hoc test

95
New cards

games- howell post hoc test

like tukey test but handles unequal variances

96
New cards

games-howell assumptions

same as welch’s anova

  • random, independent, normal dist, similar sample size

97
New cards

non-parametric ANOVA

kruskal-wallis

98
New cards

kruskal wallis test assumption

similar shape and variance (like mann whitney U)

99
New cards

null hypothesis of kruskal wallis

all medians and distributions are equal

100
New cards

alterante hypothesis kruskal wallis

all medians are not the same; at least one group is different