Biostats/Epi II Final

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/113

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 9:25 PM on 4/8/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

114 Terms

1
New cards

Linear Regression — outcome type

Continuous (Gaussian); uses identity link

2
New cards

Linear Risk — outcome type

Binomial; uses identity link

3
New cards

Log Risk — outcome type

Binomial; uses log link

4
New cards

Logistic Regression — outcome type

Binomial; uses logit link

5
New cards

Poisson Regression — outcome type

Count or rate; uses log link

6
New cards

β₀ — Linear Regression

Expected value of Y for the reference group (e.g., males, age 0, no high school education)

7
New cards

β₀ — Linear Risk

Risk (probability) of the outcome in the reference group

8
New cards

β₀ — Log Risk

Log risk of outcome in reference group; exponentiated = risk for reference group

9
New cards

β₀ — Logistic Regression

Log odds of outcome in reference group; exponentiated = odds for reference group

10
New cards

β₀ — Poisson Regression

Log incidence rate in reference group; exponentiated = incidence rate for reference group

11
New cards

β₀ — Cox Proportional Hazard

Log hazard rate for reference group; exponentiated = hazard rate for reference group

12
New cards

β₁ — Linear Regression

For every 1-unit increase in age, E(Y) changes by β₁, conditioning on other covariates

13
New cards

β₁ — Linear Risk

Risk difference: estimated change in risk for every 1-year increase in age, adjusting for other covariates

14
New cards

β₁ — Log Risk

For every 1-unit increase in age, the log risk increases by β₁, conditioning on other covariates

15
New cards

β₁ — Logistic Regression

For every 1-unit increase in age, the log odds ratio increases by β₁, conditioning on other covariates

16
New cards

β₁ — Poisson Regression

For every 1-unit increase in age, the incidence rate increases by a factor of β₁, conditioning on other covariates

17
New cards

β₁ — Cox Proportional Hazard

For every 1-unit increase in age, the hazard increases by a factor of β₁, conditioning on other covariates

18
New cards

β₂ — Linear Regression

Change in E(Y) when sex changes from referent (male) to non-referent (female), conditioning on age and education

19
New cards

β₂ — Linear Risk

Risk difference comparing female to male, adjusting for age and education

20
New cards

β₂ — Log Risk

Log risk increases/decreases by β₂ when sex changes from male (referent) to female, conditioning on age and education

21
New cards

β₂ — Logistic Regression

Log odds increases/decreases by β₂ comparing female to male (referent), conditioning on age and education

22
New cards

β₂ — Poisson Regression

Incidence rate increases/decreases by β₂ when sex changes from male (referent) to female, conditioning on age and education

23
New cards

β₂ — Cox Proportional Hazard

Hazard increases/decreases by β₂ when sex changes from male (referent) to female, conditioning on age and education

24
New cards

β₃ — Linear Regression

Change in E(Y) from no HS (referent) to HS or college grad, conditioning on age and sex

25
New cards

β₃ — Linear Risk

Risk difference comparing HS or college grad to no HS (referent), conditioning on sex and age

26
New cards

β₃ — Log Risk

Log risk increases/decreases by β₃ from no HS (referent) to HS or college grad, conditioning on age and sex

27
New cards

β₃ — Logistic Regression

Log odds increases/decreases by β₃ from no HS (referent) to HS or college grad, conditioning on age and sex

28
New cards

β₃ — Poisson Regression

Incidence rate increases/decreases by β₃ from no HS (referent) to HS or college grad, conditioning on age and sex

29
New cards

β₄ — Linear Regression (interaction)

Incremental change in the relationship between age and outcome associated with being female

30
New cards

β₄ — Linear Risk (interaction)

Incremental change in risk difference in the relationship between age and outcome associated with being female

31
New cards

β₄ — Log Risk (interaction)

Change in log risk in the relationship between age and outcome associated with being female

32
New cards

β₄ — Logistic Regression (interaction)

Change in log odds in the relationship between age and outcome associated with being female

33
New cards

β₄ — Poisson Regression (interaction)

Change in log rate in relationship between age and outcome associated with being female; exp(β₄) = incremental change in IRR for age associated with being female

34
New cards

β₁ with interaction term (age × sex)

Change in outcome per 1-unit increase in age when sex is in the referent group (male = 0)

35
New cards

Cohort + binary outcome → model

Log risk, linear risk, or logistic regression; can estimate risk, RR, and RD

36
New cards

Case-control + binary outcome → model

Logistic regression only; can estimate odds only

37
New cards

Cohort + count or rate outcome → model

Poisson regression; can estimate incidence rates and IRR

38
New cards

Cohort + binary rare events → model

Poisson regression; can estimate risk ratio (RR)

39
New cards

Case report

Detailed descriptive report on a single individual; focuses on new or unusual symptoms; used for hypothesis generation

40
New cards

Case series

Detailed descriptive report on a single group of individuals defined by a specific disease or outcome

41
New cards

Ecological study — unit of analysis

The GROUP (e.g., country, state); both exposure and outcome are measured at the group level, not the individual

42
New cards

Ecological study — metric

Measures prevalence and incidence; useful for rare diseases and hypothesis generation

43
New cards

Ecologic fallacy

Primary bias of ecological studies; associations observed at the group level do not necessarily hold true for individuals

44
New cards

Ecological study — strengths

Inexpensive; uses routinely collected data; excellent for hypothesis generation; useful for inherently group-level questions

45
New cards

Ecological study — limitations

Ecologic fallacy; limited ability to adjust for confounders; can mask individual-level relationships

46
New cards

Cross-sectional study

"Snapshot" study; individuals defined by exposure and disease status at a single point in time; measures prevalence

47
New cards

Cross-sectional study — metric

Exposure prevalence in relation to disease prevalence; can estimate risk via prevalence ratios

48
New cards

Cross-sectional study — strengths

Quick and inexpensive; high generalizability; temporal issues less concerning for long-term inalterable exposures (e.g., genetics)

49
New cards

Cross-sectional study — temporal sequence limitation

Cannot determine if exposure preceded the outcome (e.g., does inactivity cause CHD, or does CHD cause inactivity?)

50
New cards

Survivorship bias (cross-sectional)

Only captures those who survived long enough to be in the study; ignores those who died or left due to the outcome

51
New cards

Cohort study — definition

Individuals defined by exposure status and followed forward in time to see if they develop the outcome; must NOT have outcome at enrollment

52
New cards

Cohort study — metrics

Estimates incidence, risk ratios (RR), and risk differences (RD)

53
New cards

Cohort study — strengths

Excellent for establishing temporal sequence; can calculate true risk; best observational design

54
New cards

Cohort study — limitations

Expensive; requires large samples and long follow-up; loss to follow-up can undermine validity; inefficient for rare diseases

55
New cards

Case-control study — definition

Individuals defined by outcome status (cases have disease, controls do not); past exposures are then compared between groups

56
New cards

Case-control study — metric

Can ONLY calculate odds ratios (OR); cannot calculate absolute risk or incidence

57
New cards

Case-control study — strengths

Efficient for rare diseases or long latency; useful when exposure data is expensive or difficult to obtain

58
New cards

Case-control study — limitations

Highly susceptible to recall bias and selection bias; limited to one outcome; inefficient for rare exposures

59
New cards

Experimental study (clinical trial) — definition

Investigators actively assign individuals to groups (e.g., treatment vs. placebo) and follow them to measure outcome incidence

60
New cards

Experimental study — strengths

Gold standard for evidence; randomization ensures group similarity at baseline and balances measured and unmeasured confounders

61
New cards

Experimental study — limitations

Expensive and resource-heavy; requires long follow-up; ethical concerns if risks/benefits not yet well understood

62
New cards

Causal (directed) path — DAG

All arrows point away from exposure toward outcome (e.g., E→M→D); represents the effect you are trying to estimate

63
New cards

Non-causal (backdoor) path — DAG

Contains at least one arrow pointing "the wrong way" (e.g., E←C→D); represents potential confounding/bias

64
New cards

Open path — DAG

Association can flow between variables; open non-causal paths represent bias that must be controlled

65
New cards

Closed (blocked) path — DAG

Association cannot flow through the path; naturally blocked by a collider

66
New cards

Collider — definition

A variable that is a common "child" of two variables on the same path; arrows from two different variables collide at this node (e.g., E→C←Z)

67
New cards

Collider — rule

Do NOT adjust for colliders; adjusting opens a previously closed path and creates a spurious association between its parent variables

68
New cards

Blocking an open path — DAG

Adjust for (condition on) a non-collider variable along that path

69
New cards

Minimally sufficient adjustment set

The smallest set of variables you must condition on to block all open non-causal paths while keeping causal paths open

70
New cards

Mediator — DAG adjustment rule

Generally do NOT adjust for mediators (E→M→D) unless estimating the direct effect rather than the total effect

71
New cards

Why use multivariable regression?

To control for confounding, identify independent associations, or evaluate interaction (effect measure modification)

72
New cards

Interaction term — when to use

When the effect of the main exposure differs across levels of another variable (e.g., effect of smoking on death differs by sex)

73
New cards

Interpreting main effect when interaction is present

Do not say "adjusting for"; interpret as the effect of the exposure "when the other variable = 0" (the referent group)

74
New cards

Poisson regression — when to use

Counts (e.g., number of clinic visits) or rates (e.g., mortality rates)

75
New cards

Poisson regression — offset term

log(person-time); adjusts for unequal follow-up time across individuals, standardizing results into a rate

76
New cards

Poisson regression — coefficient interpretation

Incidence rate ratio; "for every 1-unit increase in [predictor], the incidence rate changes by a factor of X"

77
New cards

Poisson regression — assumptions

Mean equals variance; independence

78
New cards

Survival analysis — key distinction

Considers WHEN an event happens, not just IF it happens

79
New cards

Kaplan-Meier

Non-parametric method that re-estimates survival probability at every event time

80
New cards

Median survival time

The time point on a KM curve where survival probability = 0.50 (50%)

81
New cards

Cox proportional hazards model

Estimates the hazard — instantaneous risk of the event occurring at time t given survival up to that point

82
New cards

Hazard ratio (HR)

Represents the relative risk of the event occurring at any given moment between two groups

83
New cards

Proportional hazard assumption

The hazard ratio between groups must remain constant over the entire follow-up period; underlying hazards can vary but their ratio stays the same

84
New cards

Right censoring

Most common type; participant exits before the outcome occurs (lost to follow-up or administrative censoring at study end)

85
New cards

Left censoring

The event occurred before the observation period began

86
New cards

Informative censoring

Reason for dropping out is related to the outcome (e.g., too sick to continue); introduces bias; ideally want non-informative censoring

87
New cards

Why use KM/Cox over logistic regression?

Survival methods account for timing of events and use data from censored individuals rather than discarding it

88
New cards

Loss to follow-up bias

Type of selection bias; participants who leave the study differ systematically from those who remain, distorting results

89
New cards

Non-response bias

Type of selection bias; those who do not respond to a survey/study differ from those who do

90
New cards

Healthy worker effect

Type of selection bias; workers are healthier than the general non-working population, making occupational exposures appear less harmful

91
New cards

Berkson's bias

Hospital patient bias; individuals in the hospital differ from the general population, distorting case-control studies using hospital controls

92
New cards

Recall bias

Type of information bias; cases remember past exposures differently than controls, distorting exposure estimates

93
New cards

Interviewer bias

Type of information bias; interviewer probes cases and controls unequally, influencing reported exposures

94
New cards

Confounding

A third variable distorts the exposure-disease relationship; it is associated with both the exposure and the outcome and is not on the causal pathway

95
New cards

Which of the following are common purposes of multivariate analyses?

Control confounding; Estimate associations adjusted for multiple covariates/predictors; Identify associations that are independent of other variables

96
New cards

In practice, how can the status of a path be changed from open to closed?

Restriction, stratification, multivariate regression, matching

97
New cards

Hair loss predicts disease A. Hair loss is a marker for high hormone levels, which are causally related to disease A. If you look at a sample all with the same hormone level, what would you expect to see?

Hair loss is not a predictor of disease A in the sample

98
New cards

A cohort study investigates smoking (4 categories: non-smokers; <1 pack/week; 2 packs/week; >2 packs/week) and colon cancer. Which statement about estimating strength of association is TRUE?

The most logical approach would be to calculate the relative risk of each of the smoking groups using non-smokers as a reference group

99
New cards

In order to estimate the excess risk caused by a risk factor, which measure of association should be calculated?

Risk difference

100
New cards

Case control studies are most useful in the following scenarios EXCEPT:

When the disease is rare

When the exposure is rare

When the disease has a long latency period

When little is known about the disease

When it is difficult or expensive to obtain exposure data

None of the above

When the exposure is rare