GPH 380E Foundations of Biostatistics and Epidemiology - Notes

Absolute Risk

  • Absolute Risk (AR): The incidence of disease in a population. It is the probability that a person in a given population will develop the disease over a specified period.
  • Use: Useful in clinical decision-making or policy-making to understand burden, planning, and baseline levels.
  • Limitation: AR does not imply causation and provides no direct comparison between exposed and unexposed groups.
  • Example: 83% of people who ate egg salad got sick vs. 30% who didn’t eat it.
  • Absolute risk for exposed: AR_{exposed} = 0.83 = 83\%.

Comparing Risks

  • To determine if exposure is associated with disease, compare:
    • Risk Ratio (Relative Risk, RR)
    • Risk Difference (Excess Risk, i.e., Absolute Risk Increase)
  • Attack Rate (AR): Quantifies risk in the exposed or at-risk population:
    • Formula: AR = \frac{\text{Number of new cases}}{\text{Population at risk}} \times 100

Difference vs. Ratio: Example data (Table 12.3)

  • Community A:
    • Exposed risk = 40%
    • Unexposed risk = 10%
    • Relative Risk (RR) = 4.0
    • Excess risk (Risk Difference) = 30%
  • Community B:
    • Exposed risk = 90%
    • Unexposed risk = 60%
    • Relative Risk (RR) = 1.5
    • Excess risk (Risk Difference) = 30%

Difference in interpretation of Difference vs. Ratio

  • Risk Difference (Absolute Risk Increase):
    • Tells how many more people per 100 got the disease because of exposure.
    • In both communities, 30 extra people per 100 exposed individuals got sick compared to the unexposed.
  • Relative Risk (Strength of Association):
    • Tells how much more likely exposed individuals are to develop the disease relative to the unexposed baseline.
    • Community A: Exposure increases risk 4-fold (RR = 4.0).
    • Community B: Risk increases 1.5-fold (RR = 1.5).

What story do these numbers tell?

  • Community A:
    • Background risk among the unexposed is low (10%).
    • Jump to 40% among exposed is a dramatic relative increase; RR = 4.0 indicates a strong association.
  • Community B:
    • Baseline risk among the unexposed is already high (60%).
    • Rise to 90% among exposed is a smaller relative jump; RR = 1.5 suggests a weaker association relative to the baseline.

Key Takeaway

  • Risk Difference vs Relative Risk:
    • Risk Difference: absolute burden (how many more cases per population unit).
    • Relative Risk: strength of association (how much more likely the exposed group is to be diseased).
  • Context matters:
    • Public health planning often emphasizes Risk Difference to estimate burden.
    • Causal inference or scientific understanding often emphasizes Relative Risk to judge strength of association.

Interpreting Relative Risk (RR) (Table 12.4)

  • If RR = 1:
    • Risk in exposed = risk in unexposed; no association.
  • If RR > 1:
    • Risk in exposed greater than risk in unexposed; positive association; possibly causal.
  • If RR < 1:
    • Risk in exposed less than risk in unexposed; negative association; possibly protective.

Calculating Relative Risk in Cohort Studies (Table 12.5)

  • Study design: Cohort
  • Setup (2x2 table):
    • Exposed vs Not exposed
    • Disease yes vs Disease no
  • Entries: a, b, c, d where:
    • a = disease in exposed
    • b = no disease in exposed
    • c = disease in nonexposed
    • d = no disease in nonexposed
  • Incidence in exposed: Incidence_{exposed} = \frac{a}{a+b}
  • Incidence in nonexposed: Incidence_{unexposed} = \frac{c}{c+d}
  • Relative Risk: RR = \frac{\dfrac{a}{a+b}}{\dfrac{c}{c+d}}
  • Alternative representation: RR = \frac{\text{Incidence in exposed}}{\text{Incidence in unexposed}}

Another view of the 2x2 in words

  • In a cohort table: Exposed column shows incidence in exposed (a/(a+b)); Not exposed column shows incidence in unexposed (c/(c+d)); RR is the ratio of these two incidences.

Odds Ratio (OR)

  • What is the Odds Ratio (OR)?
    • OR is a measure of association between exposure and disease.
    • It compares the odds of exposure among cases to the odds of exposure among controls.
    • Can be used in both cohort and case-control studies.

Why use Odds Ratio (OR) instead of Relative Risk (RR)?

  • RR requires incidence data in exposed and unexposed groups, which is straightforward in cohort studies.
  • In case-control studies, we start with diseased (cases) and non-diseased (controls), so incidence rates are unknown.
  • Therefore, the odds ratio is the best available measure of association in case-control designs.

Calculating Odds Ratios (OR)

  • Standard 2×2 table (Exposed vs Not Exposed; Disease Yes vs No):
    • Entries: a (exposed with disease), b (exposed without disease), c (unexposed with disease), d (unexposed without disease)
  • OR formula: OR = \frac{a \times d}{b \times c}

Interpreting OR

  • OR = 1 (\rightarrow) No association
  • OR > 1 (\rightarrow) Positive association (potential risk factor)
  • OR < 1 (\rightarrow) Negative association (potential protective factor)

When is OR a good estimate of RR?

  • OR approximates RR well only when the disease is rare (low incidence).
  • This is known as the rarity assumption.
  • Conditions for the rarity assumption:
    • Cases represent all people with the disease in the population.
    • Controls represent all people without the disease in the population.
    • The disease is infrequent in the population.

Summary of key equations

  • Absolute Risk (AR) in a population: AR = \frac{\text{Number of new cases}}{\text{Population at risk}} \times 100
  • Risk Difference (Absolute Risk Increase): ARD = AR{\text{exposed}} - AR{\text{unexposed}}
  • Relative Risk (RR): RR = \frac{\dfrac{a}{a+b}}{\dfrac{c}{c+d}}
  • Attack Rate (AR) (definition): as above in context of incidence in a population at risk
  • Odds Ratio (OR): OR = \frac{a \times d}{b \times c}
  • Interpretation of RR: RR = 1 \Rightarrow \text{no association}; RR > 1 \Rightarrow \text{positive association}; RR < 1 \Rightarrow \text{negative association}
  • Interpretation of OR: OR = 1 \Rightarrow \text{no association}; OR > 1 \Rightarrow \text{positive association}; OR < 1 \Rightarrow \text{negative association}

Practical notes for exams

  • Distinguish between Absolute Risk (AR) and Relative Risk (RR): AR gives burden; RR gives strength of association.
  • Use Risk Difference when planning public health burden or evaluating the number of additional cases due to exposure.
  • Use RR to discuss how much more likely exposure leads to disease, especially when baseline risk is informative.
  • Use OR in case-control studies or whenever full incidence data are not available; remember OR approximates RR only when disease is rare.
  • Always consider the study design when choosing measures of association (cohort vs case-control).

Key Terms and Definitions

  • Absolute Risk (AR): The probability that an individual in a given population will develop a disease over a specified period. It indicates the incidence of disease.
  • Risk Ratio (Relative Risk, RR): A measure of the strength of association between an exposure and a disease, indicating how many times more likely exposed individuals are to develop the disease compared to unexposed individuals.
  • Risk Difference (Absolute Risk Increase, ARD): The absolute difference in risk between exposed and unexposed groups, indicating the number of additional cases of disease attributable to the exposure per unit of population.
  • Attack Rate (AR): A measure of the proportion of a population that develops a disease during an outbreak, often used to quantify risk in an exposed or at-risk group.
  • Odds Ratio (OR): A measure of association that compares the odds of exposure among cases to the odds of exposure among controls. It is often used in case-control studies.
  • Rarity Assumption: The condition under which the Odds Ratio (OR) provides a good approximation of the Relative Risk (RR), specifically when the disease incidence is low.
  • Cohort Study: An observational study design where groups of exposed and unexposed individuals are followed over time to compare disease incidence.
  • Case-Control Study: An observational study design where individuals with a disease (cases) are compared to individuals without the disease (controls) to examine past exposures.

Equations and Their Usage

  • Absolute Risk (AR):
    • Formula: AR = \frac{\text{Number of new cases}}{\text{Population at risk}} \times 100
    • Usage: To understand the overall burden of a disease in a population, for clinical decision-making, and public health planning.
  • Attack Rate (AR):
    • Formula: AR = \frac{\text{Number of new cases}}{\text{Population at risk}} \times 100
    • Usage: Quantifies risk in a specific exposed or at-risk population, particularly useful during outbreaks to determine the proportion of people who became ill.
  • Risk Difference (Absolute Risk Increase, ARD):
    • Formula: ARD = AR{\text{exposed}} - AR{\text{unexposed}}
    • Usage: To determine the public health burden, indicating how many extra cases per population unit are directly due to the exposure. Essential for prioritizing interventions.
  • Relative Risk (RR):
    • Formula: RR = \frac{\dfrac{a}{a+b}}{\dfrac{c}{c+d}} = \frac{\text{Incidence in exposed}}{\text{Incidence in unexposed}}
    • Usage: To assess the strength of association between an exposure and an outcome, indicating how much more likely the exposed group is to develop the disease. Crucial for causal inference.
  • Odds Ratio (OR):
    • Formula: OR = \frac{a \times d}{b \times c}
      -