GPH 380E Foundations of Biostatistics and Epidemiology - Notes
Absolute Risk
- Absolute Risk (AR): The incidence of disease in a population. It is the probability that a person in a given population will develop the disease over a specified period.
- Use: Useful in clinical decision-making or policy-making to understand burden, planning, and baseline levels.
- Limitation: AR does not imply causation and provides no direct comparison between exposed and unexposed groups.
- Example: 83% of people who ate egg salad got sick vs. 30% who didn’t eat it.
- Absolute risk for exposed: AR_{exposed} = 0.83 = 83\%.
Comparing Risks
- To determine if exposure is associated with disease, compare:
- Risk Ratio (Relative Risk, RR)
- Risk Difference (Excess Risk, i.e., Absolute Risk Increase)
- Attack Rate (AR): Quantifies risk in the exposed or at-risk population:
- Formula: AR = \frac{\text{Number of new cases}}{\text{Population at risk}} \times 100
Difference vs. Ratio: Example data (Table 12.3)
- Community A:
- Exposed risk = 40%
- Unexposed risk = 10%
- Relative Risk (RR) = 4.0
- Excess risk (Risk Difference) = 30%
- Community B:
- Exposed risk = 90%
- Unexposed risk = 60%
- Relative Risk (RR) = 1.5
- Excess risk (Risk Difference) = 30%
Difference in interpretation of Difference vs. Ratio
- Risk Difference (Absolute Risk Increase):
- Tells how many more people per 100 got the disease because of exposure.
- In both communities, 30 extra people per 100 exposed individuals got sick compared to the unexposed.
- Relative Risk (Strength of Association):
- Tells how much more likely exposed individuals are to develop the disease relative to the unexposed baseline.
- Community A: Exposure increases risk 4-fold (RR = 4.0).
- Community B: Risk increases 1.5-fold (RR = 1.5).
What story do these numbers tell?
- Community A:
- Background risk among the unexposed is low (10%).
- Jump to 40% among exposed is a dramatic relative increase; RR = 4.0 indicates a strong association.
- Community B:
- Baseline risk among the unexposed is already high (60%).
- Rise to 90% among exposed is a smaller relative jump; RR = 1.5 suggests a weaker association relative to the baseline.
Key Takeaway
- Risk Difference vs Relative Risk:
- Risk Difference: absolute burden (how many more cases per population unit).
- Relative Risk: strength of association (how much more likely the exposed group is to be diseased).
- Context matters:
- Public health planning often emphasizes Risk Difference to estimate burden.
- Causal inference or scientific understanding often emphasizes Relative Risk to judge strength of association.
Interpreting Relative Risk (RR) (Table 12.4)
- If RR = 1:
- Risk in exposed = risk in unexposed; no association.
- If RR > 1:
- Risk in exposed greater than risk in unexposed; positive association; possibly causal.
- If RR < 1:
- Risk in exposed less than risk in unexposed; negative association; possibly protective.
Calculating Relative Risk in Cohort Studies (Table 12.5)
- Study design: Cohort
- Setup (2x2 table):
- Exposed vs Not exposed
- Disease yes vs Disease no
- Entries: a, b, c, d where:
- a = disease in exposed
- b = no disease in exposed
- c = disease in nonexposed
- d = no disease in nonexposed
- Incidence in exposed: Incidence_{exposed} = \frac{a}{a+b}
- Incidence in nonexposed: Incidence_{unexposed} = \frac{c}{c+d}
- Relative Risk: RR = \frac{\dfrac{a}{a+b}}{\dfrac{c}{c+d}}
- Alternative representation: RR = \frac{\text{Incidence in exposed}}{\text{Incidence in unexposed}}
Another view of the 2x2 in words
- In a cohort table: Exposed column shows incidence in exposed (a/(a+b)); Not exposed column shows incidence in unexposed (c/(c+d)); RR is the ratio of these two incidences.
Odds Ratio (OR)
- What is the Odds Ratio (OR)?
- OR is a measure of association between exposure and disease.
- It compares the odds of exposure among cases to the odds of exposure among controls.
- Can be used in both cohort and case-control studies.
Why use Odds Ratio (OR) instead of Relative Risk (RR)?
- RR requires incidence data in exposed and unexposed groups, which is straightforward in cohort studies.
- In case-control studies, we start with diseased (cases) and non-diseased (controls), so incidence rates are unknown.
- Therefore, the odds ratio is the best available measure of association in case-control designs.
Calculating Odds Ratios (OR)
- Standard 2×2 table (Exposed vs Not Exposed; Disease Yes vs No):
- Entries: a (exposed with disease), b (exposed without disease), c (unexposed with disease), d (unexposed without disease)
- OR formula: OR = \frac{a \times d}{b \times c}
Interpreting OR
- OR = 1 (\rightarrow) No association
- OR > 1 (\rightarrow) Positive association (potential risk factor)
- OR < 1 (\rightarrow) Negative association (potential protective factor)
When is OR a good estimate of RR?
- OR approximates RR well only when the disease is rare (low incidence).
- This is known as the rarity assumption.
- Conditions for the rarity assumption:
- Cases represent all people with the disease in the population.
- Controls represent all people without the disease in the population.
- The disease is infrequent in the population.
Summary of key equations
- Absolute Risk (AR) in a population: AR = \frac{\text{Number of new cases}}{\text{Population at risk}} \times 100
- Risk Difference (Absolute Risk Increase): ARD = AR{\text{exposed}} - AR{\text{unexposed}}
- Relative Risk (RR): RR = \frac{\dfrac{a}{a+b}}{\dfrac{c}{c+d}}
- Attack Rate (AR) (definition): as above in context of incidence in a population at risk
- Odds Ratio (OR): OR = \frac{a \times d}{b \times c}
- Interpretation of RR: RR = 1 \Rightarrow \text{no association}; RR > 1 \Rightarrow \text{positive association}; RR < 1 \Rightarrow \text{negative association}
- Interpretation of OR: OR = 1 \Rightarrow \text{no association}; OR > 1 \Rightarrow \text{positive association}; OR < 1 \Rightarrow \text{negative association}
Practical notes for exams
- Distinguish between Absolute Risk (AR) and Relative Risk (RR): AR gives burden; RR gives strength of association.
- Use Risk Difference when planning public health burden or evaluating the number of additional cases due to exposure.
- Use RR to discuss how much more likely exposure leads to disease, especially when baseline risk is informative.
- Use OR in case-control studies or whenever full incidence data are not available; remember OR approximates RR only when disease is rare.
- Always consider the study design when choosing measures of association (cohort vs case-control).
Key Terms and Definitions
- Absolute Risk (AR): The probability that an individual in a given population will develop a disease over a specified period. It indicates the incidence of disease.
- Risk Ratio (Relative Risk, RR): A measure of the strength of association between an exposure and a disease, indicating how many times more likely exposed individuals are to develop the disease compared to unexposed individuals.
- Risk Difference (Absolute Risk Increase, ARD): The absolute difference in risk between exposed and unexposed groups, indicating the number of additional cases of disease attributable to the exposure per unit of population.
- Attack Rate (AR): A measure of the proportion of a population that develops a disease during an outbreak, often used to quantify risk in an exposed or at-risk group.
- Odds Ratio (OR): A measure of association that compares the odds of exposure among cases to the odds of exposure among controls. It is often used in case-control studies.
- Rarity Assumption: The condition under which the Odds Ratio (OR) provides a good approximation of the Relative Risk (RR), specifically when the disease incidence is low.
- Cohort Study: An observational study design where groups of exposed and unexposed individuals are followed over time to compare disease incidence.
- Case-Control Study: An observational study design where individuals with a disease (cases) are compared to individuals without the disease (controls) to examine past exposures.
Equations and Their Usage
- Absolute Risk (AR):
- Formula: AR = \frac{\text{Number of new cases}}{\text{Population at risk}} \times 100
- Usage: To understand the overall burden of a disease in a population, for clinical decision-making, and public health planning.
- Attack Rate (AR):
- Formula: AR = \frac{\text{Number of new cases}}{\text{Population at risk}} \times 100
- Usage: Quantifies risk in a specific exposed or at-risk population, particularly useful during outbreaks to determine the proportion of people who became ill.
- Risk Difference (Absolute Risk Increase, ARD):
- Formula: ARD = AR{\text{exposed}} - AR{\text{unexposed}}
- Usage: To determine the public health burden, indicating how many extra cases per population unit are directly due to the exposure. Essential for prioritizing interventions.
- Relative Risk (RR):
- Formula: RR = \frac{\dfrac{a}{a+b}}{\dfrac{c}{c+d}} = \frac{\text{Incidence in exposed}}{\text{Incidence in unexposed}}
- Usage: To assess the strength of association between an exposure and an outcome, indicating how much more likely the exposed group is to develop the disease. Crucial for causal inference.
- Odds Ratio (OR):
- Formula: OR = \frac{a \times d}{b \times c}
-