Biostats/Epi II Final

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/113

There's no tags or description

Looks like no tags are added yet.

Last updated 9:25 PM on 4/8/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

114 Terms

New cards

Linear Regression — outcome type

Continuous (Gaussian); uses identity link

New cards

Linear Risk — outcome type

Binomial; uses identity link

New cards

Log Risk — outcome type

Binomial; uses log link

New cards

Logistic Regression — outcome type

Binomial; uses logit link

New cards

Poisson Regression — outcome type

Count or rate; uses log link

New cards

β₀ — Linear Regression

Expected value of Y for the reference group (e.g., males, age 0, no high school education)

New cards

β₀ — Linear Risk

Risk (probability) of the outcome in the reference group

New cards

β₀ — Log Risk

Log risk of outcome in reference group; exponentiated = risk for reference group

New cards

β₀ — Logistic Regression

Log odds of outcome in reference group; exponentiated = odds for reference group

New cards

β₀ — Poisson Regression

Log incidence rate in reference group; exponentiated = incidence rate for reference group

New cards

β₀ — Cox Proportional Hazard

Log hazard rate for reference group; exponentiated = hazard rate for reference group

New cards

β₁ — Linear Regression

For every 1-unit increase in age, E(Y) changes by β₁, conditioning on other covariates

New cards

β₁ — Linear Risk

Risk difference: estimated change in risk for every 1-year increase in age, adjusting for other covariates

New cards

β₁ — Log Risk

For every 1-unit increase in age, the log risk increases by β₁, conditioning on other covariates

New cards

β₁ — Logistic Regression

For every 1-unit increase in age, the log odds ratio increases by β₁, conditioning on other covariates

New cards

β₁ — Poisson Regression

For every 1-unit increase in age, the incidence rate increases by a factor of β₁, conditioning on other covariates

New cards

β₁ — Cox Proportional Hazard

For every 1-unit increase in age, the hazard increases by a factor of β₁, conditioning on other covariates

New cards

β₂ — Linear Regression

Change in E(Y) when sex changes from referent (male) to non-referent (female), conditioning on age and education

New cards

β₂ — Linear Risk

Risk difference comparing female to male, adjusting for age and education

New cards

β₂ — Log Risk

Log risk increases/decreases by β₂ when sex changes from male (referent) to female, conditioning on age and education

New cards

β₂ — Logistic Regression

Log odds increases/decreases by β₂ comparing female to male (referent), conditioning on age and education

New cards

β₂ — Poisson Regression

Incidence rate increases/decreases by β₂ when sex changes from male (referent) to female, conditioning on age and education

New cards

β₂ — Cox Proportional Hazard

Hazard increases/decreases by β₂ when sex changes from male (referent) to female, conditioning on age and education

New cards

β₃ — Linear Regression

Change in E(Y) from no HS (referent) to HS or college grad, conditioning on age and sex

New cards

β₃ — Linear Risk

Risk difference comparing HS or college grad to no HS (referent), conditioning on sex and age

New cards

β₃ — Log Risk

Log risk increases/decreases by β₃ from no HS (referent) to HS or college grad, conditioning on age and sex

New cards

β₃ — Logistic Regression

Log odds increases/decreases by β₃ from no HS (referent) to HS or college grad, conditioning on age and sex

New cards

β₃ — Poisson Regression

Incidence rate increases/decreases by β₃ from no HS (referent) to HS or college grad, conditioning on age and sex

New cards

β₄ — Linear Regression (interaction)

Incremental change in the relationship between age and outcome associated with being female

New cards

β₄ — Linear Risk (interaction)

Incremental change in risk difference in the relationship between age and outcome associated with being female

New cards

β₄ — Log Risk (interaction)

Change in log risk in the relationship between age and outcome associated with being female

New cards

β₄ — Logistic Regression (interaction)

Change in log odds in the relationship between age and outcome associated with being female

New cards

β₄ — Poisson Regression (interaction)

Change in log rate in relationship between age and outcome associated with being female; exp(β₄) = incremental change in IRR for age associated with being female

New cards

β₁ with interaction term (age × sex)

Change in outcome per 1-unit increase in age when sex is in the referent group (male = 0)

New cards

Cohort + binary outcome → model

Log risk, linear risk, or logistic regression; can estimate risk, RR, and RD

New cards

Case-control + binary outcome → model

Logistic regression only; can estimate odds only

New cards

Cohort + count or rate outcome → model

Poisson regression; can estimate incidence rates and IRR

New cards

Cohort + binary rare events → model

Poisson regression; can estimate risk ratio (RR)

New cards

Case report

Detailed descriptive report on a single individual; focuses on new or unusual symptoms; used for hypothesis generation

New cards

Case series

Detailed descriptive report on a single group of individuals defined by a specific disease or outcome

New cards

Ecological study — unit of analysis

The GROUP (e.g., country, state); both exposure and outcome are measured at the group level, not the individual

New cards

Ecological study — metric

Measures prevalence and incidence; useful for rare diseases and hypothesis generation

New cards

Ecologic fallacy

Primary bias of ecological studies; associations observed at the group level do not necessarily hold true for individuals

New cards

Ecological study — strengths

Inexpensive; uses routinely collected data; excellent for hypothesis generation; useful for inherently group-level questions

New cards

Ecological study — limitations

Ecologic fallacy; limited ability to adjust for confounders; can mask individual-level relationships

New cards

Cross-sectional study

"Snapshot" study; individuals defined by exposure and disease status at a single point in time; measures prevalence

New cards

Cross-sectional study — metric

Exposure prevalence in relation to disease prevalence; can estimate risk via prevalence ratios

New cards

Cross-sectional study — strengths

Quick and inexpensive; high generalizability; temporal issues less concerning for long-term inalterable exposures (e.g., genetics)

New cards

Cross-sectional study — temporal sequence limitation

Cannot determine if exposure preceded the outcome (e.g., does inactivity cause CHD, or does CHD cause inactivity?)

New cards

Survivorship bias (cross-sectional)

Only captures those who survived long enough to be in the study; ignores those who died or left due to the outcome

New cards

Cohort study — definition

Individuals defined by exposure status and followed forward in time to see if they develop the outcome; must NOT have outcome at enrollment

New cards

Cohort study — metrics

Estimates incidence, risk ratios (RR), and risk differences (RD)

New cards

Cohort study — strengths

Excellent for establishing temporal sequence; can calculate true risk; best observational design

New cards

Cohort study — limitations

Expensive; requires large samples and long follow-up; loss to follow-up can undermine validity; inefficient for rare diseases

New cards

Case-control study — definition

Individuals defined by outcome status (cases have disease, controls do not); past exposures are then compared between groups

New cards

Case-control study — metric

Can ONLY calculate odds ratios (OR); cannot calculate absolute risk or incidence

New cards

Case-control study — strengths

Efficient for rare diseases or long latency; useful when exposure data is expensive or difficult to obtain

New cards

Case-control study — limitations

Highly susceptible to recall bias and selection bias; limited to one outcome; inefficient for rare exposures

New cards

Experimental study (clinical trial) — definition

Investigators actively assign individuals to groups (e.g., treatment vs. placebo) and follow them to measure outcome incidence

New cards

Experimental study — strengths

Gold standard for evidence; randomization ensures group similarity at baseline and balances measured and unmeasured confounders

New cards

Experimental study — limitations

Expensive and resource-heavy; requires long follow-up; ethical concerns if risks/benefits not yet well understood

New cards

Causal (directed) path — DAG

All arrows point away from exposure toward outcome (e.g., E→M→D); represents the effect you are trying to estimate

New cards

Non-causal (backdoor) path — DAG

Contains at least one arrow pointing "the wrong way" (e.g., E←C→D); represents potential confounding/bias

New cards

Open path — DAG

Association can flow between variables; open non-causal paths represent bias that must be controlled

New cards

Closed (blocked) path — DAG

Association cannot flow through the path; naturally blocked by a collider

New cards

Collider — definition

A variable that is a common "child" of two variables on the same path; arrows from two different variables collide at this node (e.g., E→C←Z)

New cards

Collider — rule

Do NOT adjust for colliders; adjusting opens a previously closed path and creates a spurious association between its parent variables

New cards

Blocking an open path — DAG

Adjust for (condition on) a non-collider variable along that path

New cards

Minimally sufficient adjustment set

The smallest set of variables you must condition on to block all open non-causal paths while keeping causal paths open

New cards

Mediator — DAG adjustment rule

Generally do NOT adjust for mediators (E→M→D) unless estimating the direct effect rather than the total effect

New cards

Why use multivariable regression?

To control for confounding, identify independent associations, or evaluate interaction (effect measure modification)

New cards

Interaction term — when to use

When the effect of the main exposure differs across levels of another variable (e.g., effect of smoking on death differs by sex)

New cards

Interpreting main effect when interaction is present

Do not say "adjusting for"; interpret as the effect of the exposure "when the other variable = 0" (the referent group)

New cards

Poisson regression — when to use

Counts (e.g., number of clinic visits) or rates (e.g., mortality rates)

New cards

Poisson regression — offset term

log(person-time); adjusts for unequal follow-up time across individuals, standardizing results into a rate

New cards

Poisson regression — coefficient interpretation

Incidence rate ratio; "for every 1-unit increase in [predictor], the incidence rate changes by a factor of X"

New cards

Poisson regression — assumptions

Mean equals variance; independence

New cards

Survival analysis — key distinction

Considers WHEN an event happens, not just IF it happens

New cards

Kaplan-Meier

Non-parametric method that re-estimates survival probability at every event time

New cards

Median survival time

The time point on a KM curve where survival probability = 0.50 (50%)

New cards

Cox proportional hazards model

Estimates the hazard — instantaneous risk of the event occurring at time t given survival up to that point

New cards

Hazard ratio (HR)

Represents the relative risk of the event occurring at any given moment between two groups

New cards

Proportional hazard assumption

The hazard ratio between groups must remain constant over the entire follow-up period; underlying hazards can vary but their ratio stays the same

New cards

Right censoring

Most common type; participant exits before the outcome occurs (lost to follow-up or administrative censoring at study end)

New cards

Left censoring

The event occurred before the observation period began

New cards

Informative censoring

Reason for dropping out is related to the outcome (e.g., too sick to continue); introduces bias; ideally want non-informative censoring

New cards

Why use KM/Cox over logistic regression?

Survival methods account for timing of events and use data from censored individuals rather than discarding it

New cards

Loss to follow-up bias

Type of selection bias; participants who leave the study differ systematically from those who remain, distorting results

New cards

Non-response bias

Type of selection bias; those who do not respond to a survey/study differ from those who do

New cards

Healthy worker effect

Type of selection bias; workers are healthier than the general non-working population, making occupational exposures appear less harmful

New cards

Berkson's bias

Hospital patient bias; individuals in the hospital differ from the general population, distorting case-control studies using hospital controls

New cards

Recall bias

Type of information bias; cases remember past exposures differently than controls, distorting exposure estimates

New cards

Interviewer bias

Type of information bias; interviewer probes cases and controls unequally, influencing reported exposures

New cards

Confounding

A third variable distorts the exposure-disease relationship; it is associated with both the exposure and the outcome and is not on the causal pathway

New cards

Which of the following are common purposes of multivariate analyses?

Control confounding; Estimate associations adjusted for multiple covariates/predictors; Identify associations that are independent of other variables

New cards

In practice, how can the status of a path be changed from open to closed?

Restriction, stratification, multivariate regression, matching

New cards

Hair loss predicts disease A. Hair loss is a marker for high hormone levels, which are causally related to disease A. If you look at a sample all with the same hormone level, what would you expect to see?

Hair loss is not a predictor of disease A in the sample

New cards

A cohort study investigates smoking (4 categories: non-smokers; <1 pack/week; 2 packs/week; >2 packs/week) and colon cancer. Which statement about estimating strength of association is TRUE?

The most logical approach would be to calculate the relative risk of each of the smoking groups using non-smokers as a reference group

New cards

In order to estimate the excess risk caused by a risk factor, which measure of association should be calculated?

Risk difference

100

New cards

Case control studies are most useful in the following scenarios EXCEPT:

When the disease is rare

When the exposure is rare

When the disease has a long latency period

When little is known about the disease

When it is difficult or expensive to obtain exposure data

None of the above

When the exposure is rare