Biostats (Cheng)

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 42

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

43 Terms

1

Descriptive Statistics

  • small population

  • can collect all data

  • cannot use to make conclusions beyond the data

New cards
2

Inferential Statistics

  • population is large (cannot collect all population data)

  • can only collect sample of population

  • can use sample data to make inferences about a population

  • focus on making predictions/generalizations about a larger dataset based on a sample

New cards
3

Estimation

  • inference about 1 group

  • can be point estimate or interval estimate

  • can estimate a proportion or mean

New cards
4

Comparison

  • inference about 2 or more groups

New cards
5

Correlation

  • relationship between 2 variables

New cards
6

Point estimate

  • single value

  • sample mean = population mean

New cards
7

Interval Estimate

  • defined by two numbers between which a population parameter is said to lie

New cards
8

Confidence interval

  • measure of how sure one can be

  • expressed as a percentage (most commonly 95%)

  • as confidence level (percentage) increases, the confidence interval widens

  • as sample size decreases, the confidence interval widens

  • represents confidence that population statistic is within the confidence interval

New cards
9

Prevalence

  • aka prevalence proportion

  • proportion of a population found to have a condition

    • includes ALL cases (new and pre-existing)

  • (number of subjects with disease)/(total population)

  • usually expressed as fraction, percentage, or number of cases per 10,000 or 100,000 people

New cards
10

Incidence rate

  • rate of new cases of a disease occurring in a specific population over a particular period of time

    • limited to NEW cases only

  • (number of NEW cases during a specified time period)/(person years at risk during the same time period)

New cards
11

Nominal data

  • categorical, unranked data

  • ex. gender, eye color, surgical outcome, blood type

  • when only 2 possible categories: dichotomous, binary, binomial

New cards
12

Ordinal Data

  • variables with an inherent order to the relationship among the different categories

  • implied ordering of the categories with unknown quantitative distance

  • distances between the levels may not be the same

  • meaning of different levels may not be the same for different individuals

  • utilizes numbers to indicate rank/order, but numerical values do not hold mathematical significance

  • ex. stages of cancer, education level, pain level, satisfaction level, agreement level

New cards
13

Unpaired samples

two groups from different populations

sample size may be different

New cards
14

Paired samples

Same samples undergoing same treatments

Same sample size

Can be same people measured at different times or asked about same products

New cards
15

Steps for group comparison

  1. Check data type

  2. Check dependence (paired or unpaired)

New cards
16

Unpaired Nominal Data

  • Chi-squared test

    • all values above or equal to 5 (large sample size)

  • Fisher’s exact test

    • any values below 5 (small sample size)

New cards
17

Paired Nominal Data

  • McNemar’s Test

  • Kappa statistics

    • measure of agreement

New cards
18

Unpaired ordinal data

Mann-Whitney U Test

aka Wilcoxon two sample test

New cards
19

Paired ordinal data

Wilcoxon paired sign rank test

New cards
20

Unpaired continuous data

unpaired t-test

New cards
21

Paired continuous data

paired t-test

New cards
22

Contingency table

  • table of observed data for categorical data

New cards
23

Expected table

  • table of expected data for categorical data (if no difference between groups)

  • for first row, first column = (first row margin)(first column margin)/(total)

  • has same margin and grand total values as contingency table

New cards
24

Odds ratio

  • odds: P/(1-P)

  • odds ratio: odds/odds

  • cross-product method: ad/bc

 

Yes (disease)

No (disease)

 Yes (risk factor)

 a

 No (risk factor)

 c

  • OR = 1 means no association between outcome and exposure

  • OR >1 means exposure associated with increased risk for outcome

    • harmful effect

  • OR <1 means exposure is associated with reduced risk for outcome

    • protective effect

  • consider confidence interval (if it contains 1, not statistically significant)

New cards
25

Accuracy

  • number of correct diagnoses divided by entire population

New cards
26

Sensitivity

  • used for paired nominal data

  • measures of performance of binary classification test

  • true positive rate

  • measures proportion of actual positives which are correctly identified

  • how good a test is at finding actual positive

    • complementary to false negative rate

    • used for diagnosis

  • (actual positives identified)/(actual positives)

New cards
27

Specificity

  • true negative rate

  • measures performance of binary classification test

  • proportion of negatives which are correctly identified

    • complementary to false positive rate

    • used for diagnosis

  • (actual negatives identified)/(actual negatives)

New cards
28

Positive predictive values (PPV)

  • (number of true positives)/(number of positive calls)

    • number of positive calls = number of true positives + number of false positives

  • the chance that a person with a positive test truly has the disease

    • used for patient knowledge

New cards
29

Negative predictive values (NPV)

  • probability that a subject with a negative screening test really does not have the disease

    • used for patient knowledge

  • (number of true negatives)/(number of negative calls)

    • number of negative calls = number of true negatives + number of false negatives

New cards
30

Kappa statistics

  • statistical measure of inter-rater agreement

    • agreement: both raters have same outcome

  • for paired nominal data

New cards
31

Kappa statistic strengths of agreement

  • Poor <0.2

  • Fair 0.21-0.4

  • Moderate 0.41-0.60

  • Good 0.61-0.8

  • Very Good 0.81-1

New cards
32

T-tests

  • assess whether the means of two groups are statistically different from each other

New cards
33

Non-parametric tests

  • distribution free test

  • does not assume anything about the underlying distribution

  • ex. Chi-squared test, Fisher’s exact tests, McNemar’s test, Mann-Whitney U Test, and Wilcoxon sign rank test

New cards
34

Parametric test

  • makes assumptions about a population’s parameters

  • usually means tests like t-test or ANOVA

    • assume the population data has a normal distribution

New cards
35

Tests that check normal distribution

  • QQ plot

  • Shapiro Wilk Test

New cards
36

QQ plot

  • quantie-quantile plot

  • shows distribution of the data against the expected normal distribution

  • for normally distributed data, observations should lie approximately on a straight line

  • possible outliers are points at the ends of the line

New cards
37

Shapiro Wilk Test

  • test of normality in frequentist statistics

  • null hypothesis: population is normally distributed

  • if P < 0.05, not normally distributed

    • nonparametric test should be used

  • if P > 0.05, normal distribution

    • t test can be used

  • best power for a given significance

New cards
38

Unpaired t-test

  • two sample t test

  • applied to 2 independent groups (different people in 2 different groups

  • sample size may be unequal in each group

New cards
39

Paired t-test

  • one sample t test

  • measures whether means from a within subjects test group vary over 2 test conditions (same people in same group)

  • equal sample size

  • takes into account the fact that pairs of subjects go together

New cards
40

One-tailed t-test

  • first mean expected to be larger than the second or first mean expected to be smaller than the second

  • expect the effect to be in a certain direction

New cards
41

2-tailed t tiest

  • first mean expected to be different from the second in EITHER direction

  • used when looking for any difference between samples

New cards
42

Test of equal variance (F-Test)

  • used to test if the variances of 2 populations are equal

    • ratio of standard deviations of each group

  • if variances are equal, F = 1

    • P>0.01

    • use unpaired t test

  • the more ratio deviates from 1, the stronger the evidence for unequal population variances

    • P<0.05

    • use Welch’s unpaired t-test

  • used for unpaired data

  • Excel: FTEST (array1,array2)

    • returns 2 tailed probability that the variances in array1 and array2 are not significantly different

  • should check normality before using

New cards
43
New cards

Explore top notes

note Note
studied byStudied by 29 people
400 days ago
5.0(1)
note Note
studied byStudied by 41 people
282 days ago
5.0(1)
note Note
studied byStudied by 6 people
882 days ago
5.0(1)
note Note
studied byStudied by 14 people
829 days ago
5.0(2)
note Note
studied byStudied by 12 people
64 days ago
4.0(2)
note Note
studied byStudied by 12 people
904 days ago
5.0(1)
note Note
studied byStudied by 10 people
1008 days ago
5.0(1)
note Note
studied byStudied by 275 people
681 days ago
5.0(1)

Explore top flashcards

flashcards Flashcard (20)
studied byStudied by 29 people
662 days ago
5.0(1)
flashcards Flashcard (259)
studied byStudied by 38 people
45 days ago
5.0(1)
flashcards Flashcard (111)
studied byStudied by 4 people
823 days ago
5.0(1)
flashcards Flashcard (143)
studied byStudied by 151 people
756 days ago
3.8(10)
flashcards Flashcard (72)
studied byStudied by 6 people
253 days ago
5.0(2)
flashcards Flashcard (164)
studied byStudied by 93 people
39 days ago
5.0(2)
flashcards Flashcard (24)
studied byStudied by 10 people
739 days ago
5.0(1)
flashcards Flashcard (30)
studied byStudied by 2761 people
417 days ago
4.8(33)
robot