Biostats

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/133

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

134 Terms

1
New cards

What are the 4 steps of hypothesis testing?

1. Make hypothesis

2. Set criteria for decision

3. Gather data and conduct statistics

4. Make decisions (accept or reject the null)

2
New cards

What is a null hypothesis?

No difference; all samples from the same population and that the observed difference is due to chance (random sampling variation). Our goal is to REJECT the null

3
New cards

What is an alternate hypothesis?

At least one sample from different population; difference not due to chance

4
New cards

Do we prove Ha?

No (it's difficult); our goal is to disprove H0

5
New cards

What is our alpha?

a = .05

6
New cards

If alpha =.05, how sure are we that were correct?

95% sure; 5% chance of error

7
New cards

What is probability?

Chance; the likelihood that an event will occur

8
New cards

What does it mean that something is mutually exclusive?

Two events, X and Y, are mutually exclusive if occurrence of one precludes occurrence of other. Either/or but not both

9
New cards

What does it mean that something has independent probabilities?

Two events, X and Y, are independent if occurrence of one has no effect on occurrence of other

10
New cards

What is the probability of rolling either a 2 or a 5?

1/3

11
New cards

What it the additive rule?

OR - If X and Y are mutually exclusive, what is the probability of getting X or Y? ADD the probability of X + probability of Y

12
New cards

What is the probability of rolling 6 on first roll and 5 on second roll?

1/36

13
New cards

What is the multiplicative rule?

AND If X and Y are independent, what is the probability of getting X and Y? Probability of X times probability of Y

14
New cards

What happens to probability when we use the or rule?

Probability would increase, because we only want 1 of the 2 events to occur, we're increasing our odds

<p>Probability would increase, because we only want 1 of the 2 events to occur, we're increasing our odds</p>
15
New cards

What happens to probability when we use the and rule?

Probability decreases, because it is harder to get both events

<p>Probability decreases, because it is harder to get both events</p>
16
New cards

one test normal: 95% chance. 9 test normal? abnormal? error one test?

What is probability one test is normal (95%). Independent we multiply so 0.95*0.95 = 0.90.

Probability 9 tests are normal (0.95)^9 = 63.0% normal. More tests = more error. 5% error on one test and 9 tests is 45%.

37% chance on test is abnormal: complement rule states: 𝑃 ( At least one abnormal ) = 1 − 𝑃 ( All normal ) P(At least one abnormal)=1−P(All normal)

17
New cards

What is a sample?

Selection of study subjects who will be measured on some parameter to provide information about population

18
New cards

Why do we sample?

Because it costs a lot of time and money to poll everyone in a population

19
New cards

20
New cards

What is selection bias? is it unethical?

Asking the people that you want to because it's convenient to you, you can't do this - you need to give all eligible participants a chance to be selected; not unethical

<p>Asking the people that you want to because it's convenient to you, you can't do this - you need to give all eligible participants a chance to be selected; not unethical</p>
21
New cards

What is simple random sampling?

Each individual has an equal chance of being selected (ex. drawing names out of a hat)

<p>Each individual has an equal chance of being selected (ex. drawing names out of a hat)</p>
22
New cards

What is systematic sampling?

Select at regular intervals through ordered list. Pick random number (k), select every kth person

<p>Select at regular intervals through ordered list. Pick random number (k), select every kth person</p>
23
New cards

What is stratified sampling? (KNOW THIS ONE)

Separate population into strata based on a characteristic. Randomly select proportion of participants from each stratum

<p>Separate population into strata based on a characteristic. Randomly select proportion of participants from each stratum</p>
24
New cards

What sampling technique are we talking about if we say a representative proportion is used?

Stratified sampling

25
New cards

What is cluster sampling?

Population is already in strata, and we randomly select groups of strata. People are broken into already established groups (fresh, soph, junior, senior) and you pick a group (or 2) and sample everyone in that group

<p>Population is already in strata, and we randomly select groups of strata. People are broken into already established groups (fresh, soph, junior, senior) and you pick a group (or 2) and sample everyone in that group</p>
26
New cards

What is convenience sampling?

This is not science. It's cold calling and doesn't typically represent population.

<p>This is not science. It's cold calling and doesn't typically represent population.</p>
27
New cards

What is multistage sampling?

Combines more than 1 sampling. Surveys are sent out to randomly selected participants (Hey are you a dr.?); then from that answer you ask a second more detailed sample (Here's a survey about healthcare)

<p>Combines more than 1 sampling. Surveys are sent out to randomly selected participants (Hey are you a dr.?); then from that answer you ask a second more detailed sample (Here's a survey about healthcare)</p>
28
New cards

What variables are based on how data are measured?

Qualitative and quantitative

29
New cards

What variables are based on study design?

Dependent and independent

30
New cards

What are qualitative (categorical) variables?

groups, cohorts; Categorical data has a frequency. 50 ppl and 10 failed (20%). 20% is a frequency which is a categorical variable.

31
New cards

What are nominal categorical variables?

Multiple categories with no difference in quality. Ex. ethnicity, hair color, eye color. Different answers, but blue eyes is no better than green eyes

32
New cards

What are dichotomous (binary) categorical variables?

Only two labels for variable. Ex. gender, disease status, smoker/nonsmoker

33
New cards

What are ordinal categorical variables?

Ranked variables. Ex. age group, stage of cancer, pain rating

34
New cards

What are quantitative (continuous) variables?

Measured quantities

35
New cards

What are interval quantitative variables?

Any quantitative value. Ex. height, weight, temp

36
New cards

What are ratio quantitative variables?

Has meaning (ex. blood pressure) or meaningful zero (temperature)

37
New cards

What are independent variables?

Input, manipulated

<p>Input, manipulated</p>
38
New cards

What are dependent variables?

Outcome, response, predicted

39
New cards

What is sample size?

number of samples in each group

40
New cards

What kind of sample size has more statistical power?

Larger sample sizes (but this doesn't mean that small n is powerless)

41
New cards

What type of experiments are forced to have a small n?

Rare diseases, expensive experimental designs

42
New cards

What does it mean if mean and median are equal or approximately equal?

Normal distribution

43
New cards

What does it mean if mean and median aren't equal?

Skewed data set

44
New cards

What is variance?

Spread of data; if there is 0 variance, the values are identical and you can't run statistics

45
New cards

What are the measures of central tendency?

mean, median, mode

46
New cards

What is range?

Highest minus lowest value

47
New cards

What is interquartile range?

75th percentile - 25th percentile

48
New cards

What is standard deviation?

the square root of the variance; standard deviation is NOT dependent on sample size

49
New cards

What is standard error of the mean?

the standard deviation of a sampling distribution; IS dependent on sample size

50
New cards

How do you calculate standard error of the mean?

standard deviation / square root of sample size

51
New cards

What is a unimodal distribution?

when 1 score occurs in greatest frequency (has 1 peak)

<p>when 1 score occurs in greatest frequency (has 1 peak)</p>
52
New cards

What is a bimodal distribution?

when 2 scores both occur with the greatest frequency

<p>when 2 scores both occur with the greatest frequency</p>
53
New cards

What is a left-skewed distribution?

aka negative skew, the tail points to the left so most values on right

<p>aka negative skew, the tail points to the left so most values on right</p>
54
New cards

What is a right-skewed distribution?

aka positive skew, the tail points to the right so most values on left

<p>aka positive skew, the tail points to the right so most values on left</p>
55
New cards

If a statistic includes 95% of the data, how many standard deviations are included? What does it mean?

4 - +2 above the mean, -2 below the mean; NORMAL POP DISTRIBUTION

56
New cards

Know how to find standard deviation by looking at a bell curve

knowt flashcard image
57
New cards

What does it mean if a curve has a positive skew?

Tail is right, more values on left. The mean is being pulled in a positive direction

58
New cards

What does it mean if a curve has a negative skew?

Tail is left, more values on right. The mean is being pulled in a negative direction

59
New cards

What happens to the mean, median, mode in a positive skew? (KNOW THIS)

Mode < Median < Mean

<p>Mode &lt; Median &lt; Mean</p>
60
New cards

What happens to the mean, median, mode in a negative skew? (KNOW THIS)

Mean < Median < Mode

<p>Mean &lt; Median &lt; Mode</p>
61
New cards

What is a Type I error?

Probability of rejecting H0 when it is true

62
New cards

How do we calculate Type I error?

Same as significance level (your chosen alpha); telling someone they have HIV when they really don't

63
New cards

If alpha is .2, what is the chance of Type I error?

20% chance

64
New cards

What is worse, Type I or Type II error?

Type II - it's like telling someone they don't have HIV when they do, someone could die

65
New cards

What is Type II error?

Probability of accepting H0 when it is false. Telling someone they don't have HIV when they really do

66
New cards

Be able to draw the statistical decision tree

knowt flashcard image
67
New cards

When do we use a contingency table?

To analyze frequency distributions (proportions!)

68
New cards

When both variables are categorical what test do you use?

Chi Squared

69
New cards

When both variables are categorical, but you have a cell less than 5, what test do you use?

Fisher's Exact

70
New cards

What does C2 test for?

Goodness of fit and independence

71
New cards

72
New cards

What does it mean if your categorical IV is dependent?

You're your own control. Usually associated with before/after studies. You're comparing a change in yourself over time

73
New cards

What test would you use if your dependent variable is continuous, your independent variable is independent?

T-Test

For 2 groups, assumes data is normally distributed, variances equal

74
New cards

What test would you use if your dependent variable is continuous, your independent variable is dependent?

Paired T-Test

For matched samples, comparing change in individuals over time

75
New cards

What test would you use if your dependent variable is continuous, but you have more than 2 groups?

ANOVA (Analysis of Variance)

Same assumptions as T-test, but it's for more than 2 groups

76
New cards

What test would you use if your dependent variable is continuous, you have more than 2 groups, but your data isn't normally distributed?

Kruskal-Wallis test

Compares medians for more than 2 groups; nonparametric tests compare median of groups

77
New cards

What test would you use if your dependent variable is continuous, your independent variable is dependent, and your data isn't normally distributed?

Wilcoxon Rank Sum test

No assumptions about underlying distribution of data. Compares paired data; nonparametric tests compare median of groups

78
New cards

What test would you use if your dependent variable is continuous, your independent variable is independent, and your data isn't normally distributed?

Mann-Whitney U Test

Compares medians of 2 groups; nonparametric tests compare median of groups

79
New cards

When would you use a correlation?

When both the dependent and independent variable are continuous

80
New cards

What do correlations establish?

predictions

81
New cards

Does correlation imply causation?

NO

82
New cards

What test would you use if you have a continuous dependent variable and continuous independent variable?

Pearson Correlation

83
New cards

What test would you use if you have a continuous dependent variable and continuous independent variable, and skewed data?

Spearman's Rank

84
New cards

How do you measure degrees of correlation?

measured by r

Pearson Correlation Coefficient measures linear relationship between two variables (+1/-1/0)

85
New cards

How is Spearman's rank measured?

Rho

Measures extent to which, as one variable increases, other variable tends to increase. Does not require linear relationship

86
New cards

What is a regression analysis?

Measures impact of one variable on other. R2 = amount of variation in dependent variable explained by independent variable

87
New cards

What are the different values for correlation coefficient?

-1 to +1, 0 is no correlation

<p>-1 to +1, 0 is no correlation</p>
88
New cards

How do you draw a contingency table?

knowt flashcard image
89
New cards

What is sensitivity?

Proportion of patients with disease where screening test is positive

90
New cards

How do you calculate sensitivity?

= TP/ (TP+FN)

91
New cards

What does a high sensitivity mean?

Higher sensitivity, better test in finding diseased patients

92
New cards

Where on a contingency table do you look to calculate sensitivity and specificity?

COLUMNS

Sensitivity - LEFT COLUMN

Specificity - RIGHT COLUMN

93
New cards

What is specificity?

Proportion of patients without disease where screening test is negative

94
New cards

How do you calculate specificity?

= TN/ (FP + TN)

95
New cards

What does it mean if you have high specificity?

Healthy patients are not diagnosed as diseased

96
New cards

What is positive predictive value?

Proportion of positive test results where patients are diseased

97
New cards

How do you calculate PPV?

PPV = TP / (TP + FP)

98
New cards

What is negative predictive value?

Proportion of negative test results where patients are not diseased

99
New cards

How do you calculate NPV?

NPV = TN / (FN + TN)

100
New cards

What do you use on a contingency table to calculate PPV and NPV?

ROWS

PPV - top row

NPV - bottom row