Notes on Causality, Evidence Hierarchy, and EBM

Causality, Evidence, and the Evidence-Based Medicine (EBM) Hierarchy

Topic context: examining causality and evidence in medicine, and how EBM attempts to define the best way to establish causal claims.
Key premise: evidence-based medicine promotes a specific methodological relationship with evidence, raising questions about its universality across disciplines.
Barrowman (2014) point: some causal relationships may have no correlations; correlation is not necessary for causation in some cases.

Causality and Non-Correlation: Illustrative Examples

Classic view: causality is often linked to correlation (covariation).
Barrowman’s provocative claim: there are causal relationships without measurable correlation in some contexts.
Example proposed in class (Causation without correlation): Burning fuel (oil) keeps a house warm. Inside-temperature changes may show little correlation with the amount of oil burned if there is a dynamic equilibrium.
- This highlights how causal relationships can operate via mechanisms that keep a system at a stable level despite changes in a driver.
Buffer solution example (Ilona’s suggestion): adding acid to a buffer causes chemical reactions (causal interactions) but may not change the overall pH due to a dynamic equilibrium (Le Châtelier’s principle).
- Emphasizes that causal chains can exist even if an observable outcome (pH) remains constant.
- Applications to biology: blood pH must be tightly regulated; radical pH changes are harmful.
Additional buffer example (Expii/ScienceRevisions UK): adding acid to buffer may produce a chain of reactions with no net pH change, illustrating causal interactions that are kept in check by equilibrium.
Takeaway: Causation and correlation can diverge; causal processes can be masked by system dynamics.

Evidence-Based Medicine (EBM) and the Big Picture

EBM foregrounds a particular methodological path to evidence: not just correlations or simple experiments, but randomized controlled trials (RCTs) and aggregations of RCTs.
Bradford Hill criteria historically influenced causality discussions in medicine, but they have limitations.
Question raised: Is RCT-based evidence the very best form of science for all fields?
EBM vision: a hierarchy of evidence that places RCTs and systematized reviews at the top; clinical judgment is redefined by high-quality evidence.

The Evidence Hierarchy: The Pyramid

The hierarchy (one widely used version) includes eight levels:
1. Systematic reviews & meta-analyses of RCTs
2. RCTs: Randomized controlled trials
3. Controlled trials without randomization (quasi-experimental)
4. Single non-experimental studies (cases, correlations) – analytic
5. Systematic reviews of descriptive studies
6. Single descriptive study
7. Authority & expert statements
8. Mechanistic reasoning/explanations
Note: There are several pyramid variants; some omit level 8, but many interpretations treat level 8 as a key target in mechanistic explanations.
Visual cue: the top (level 1) is the strongest form of evidence; the bottom (level 8) includes mechanistic reasoning and expert opinion.

1) Systematic reviews & meta-analyses of RCTs

Purpose: synthesize all RCTs on a treatment to obtain a broader, less biased estimate of efficacy.
Rationale: aggregate data across trials to overwhelm biases from individual studies.
Requires careful interpretation by human readers and interpreters; meta-analysis power and biases depend on study quality and heterogeneity.
Ioannidis and related debates: strength and limitations of meta-analysis; the method’s power vs bias concerns.
Key caveat: even systematic reviews can propagate bias if included studies are biased; quality control remains essential.

2) Randomized controlled trials (RCTs)

Core idea: randomize subjects into treatment vs control groups to balance known and unknown confounders.
Randomization aims to ensure that differences post-intervention can be attributed to the treatment rather than other factors.
Blinding is typically used so researchers and participants do not know who is in which group, reducing bias.
Terminology note: “controlled” typically refers to keeping all other conditions constant apart from the intervention; RCTs are often treated as quantitative with clearly defined outcomes.
Historical touchpoints:
- Fisher’s work on randomization (agriculture) laid groundwork for modern randomization concepts.
- Bradford Hill popularized randomization in medical trials (1948); he emphasized it as a safeguard against bias.
Context: RCTs are not without limitations; they require careful design to balance groups and to interpret results in light of real-world variability.

3) Controlled trials without randomization (quasi-experimental)

Used when randomization is unethical or impractical (e.g., social interventions, public health policies).
Still aims at causal inference, but with greater potential bias than RCTs.
Examples include some public health and education interventions where random assignment isn’t feasible.

4) Single non-experimental studies (analytic)

Includes cases, cohorts, cross-sectional studies; correlations, etc., sometimes with informal control groups or retrospective data.
These are observational in nature and can be useful for exploring associations and generating hypotheses, but are more vulnerable to bias and confounding.

5) Systematic reviews of descriptive/qualitative studies

Synthesize descriptive or qualitative data to draw broader inferences about phenomena.
Descriptive studies can be large and global; they may capture broad patterns but are less tightly controlled.

6) Single descriptive study

No intervention; no predefined comparisons.
Can be long-term and provide useful information about effects over time, side-effects, genetic differences, etc.
Serves as a basis for generating future experimental work.

7) Authority & expert statements (opinion, committee reports)

Historically influential but criticized for over-reliance on expert opinion without empirical support.
Not unique to medicine; similar issues arise in other fields (ecology, conservation biology, etc.).
Recent debates highlight the need to ground expert opinions in higher levels of evidence when possible.

8) Mechanistic reasoning/explanations

Involves causal mechanisms: what components are involved, what they do, and in what order.
A mechanistic account explains the “how” and “why” behind observed outcomes, filling in the “black box.”
In RCTs, mechanistic explanations are often treated as weaker evidence because they are inferences, not observed effects.
Debate: mechanistic explanations can be strong (informing trial design, interpretation, generalization) but can also mislead if the mechanism is context-specific or speculative.
Example visualization: a speculative mechanism for ivermectin’s action against COVID-19; such diagrams illustrate mechanistic thinking but may lack robust evidence and can be retracted when data contradict them.

The Oxford EBM Table and the Place of Mechanisms

The Oxford table (major EBM center) shows mechanisms and case-based reasoning at lower levels; still acknowledged as part of the broader evidence landscape.
The bottom levels historically include case studies and mechanism-based reasoning; the placement of mechanisms in the pyramid has been debated.
Acknowledgment: we will revisit mechanisms in Week 6; some argue that mechanisms should be integrated higher in the evidential hierarchy.

The Historical and Philosophical Context

EBM has been framed as part of a broader shift between rationalist (mechanistic explanations) and empiricist (data-driven) approaches in medicine.
Historical pendulum: medicine has swung between emphasis on explanatory mechanisms and emphasis on observation and data, sometimes depicted as a pendulum.
Some readings argue that EBM represents a late-empiricist stage; others argue for a more integrated view that includes both mechanism and data.
For further reading: discussions on rationalism vs empiricism, and the possible paradigm shifts in medicine.

What Is Evidence? Data, Information, Knowledge, and Evidence in EBM

Core distinctions (Damman & Smart, 2019; summarized in lecture slides):
- Data: symbols (numbers, text, images, sounds) representing phenomena.
- Information: data in a specific context, processed for a purpose.
- Evidence: information used to evaluate explanations, models, hypotheses; comparisons across bodies of information against explanations.
- Knowledge: evidence-based understanding, justified by evaluation and community consensus; broader than facts.
Important distinction: data, information, evidence, and knowledge are related but not identical; evidence in EBM is about the evaluation of explanations, often requiring comparison and synthesis across studies.
Methods and evaluation interplay with evidence: methods gather data; evaluation judges how well data support explanations.

Why a Pyramid? Qualities of Evidence Across Levels

The pyramid is often portrayed because high-quality evidence is rarer and more resource-intensive to obtain than lower-level evidence.
Common questions: what makes evidence strong or weak? What counts as bias, and how does it influence the hierarchy?
The answer is not fixed: criteria for strength and weakness can vary (quality, bias, relevance, generalizability).
The hierarchy is a heuristic, not an absolute rule; context matters for what counts as appropriate evidence in a given field.

Bias, Causality, and the Role of Randomization

Randomization is designed to identify and mitigate confounding factors by balancing unknown differences across groups.
Eliminative induction: a logical approach to ruling out alternatives to isolate the effect of interest (e.g., “not this, not that, only this”).
Limits of randomization: perfect balance is never guaranteed in the real world; actual RCTs may deviate from ideal conditions.
Some contexts may benefit more from case studies and mechanistic reasoning than from RCTs alone; broadening evidence sources can be valuable in certain decision contexts.

Limits and Critiques of the Hierarchy

Borgerson’s critique (2009):
- The hierarchy may fail to rank methods that are better at identifying causality or less biased.
- The hierarchy may overstate the objectivity of higher levels and understate the value of mechanistic reasoning and other approaches.
The strongest claims of EBM (the “strong version”): only randomized trials establish genuinely causal relationships; lower levels capture mere correlations.
In essays and debates, distinguishing between assertion (claims without reasoning) and argument (claims supported by reasoning and evidence) is crucial.
There are calls for a more nuanced approach: EBM+ advocates incorporating mechanistic studies alongside probabilistic evidence to improve causality inferences.

Meta-Analyses, Systematic Reviews, and Their Critiques

Meta-analyses and systematic reviews sit at the top of the pyramid but raise concerns:
- They pool biases of individual studies, potentially amplifying biases if the included studies are flawed.
- Some accompanying summaries or press materials steer readers away from critical engagement with the data.
Notable cautionary examples: homeopathy and ADHD meta-analyses illustrating how biased selections can mislead conclusions.

Supplementary Readings: Expanding the Debate

Cartwright (2007): Are RCTs the gold standard? A nuanced defense of when RCTs work well and when they do not; introduces hypothetico-deductivism and other philosophical concepts.
Greenhalgh et al. (2022): Advocates for EBM+; argues that RCTs alone are not sufficient for decision making, especially in rapid health decisions; suggests integrating mechanistic studies.
Jureidini & McHenry: Critiques of industry-funded trials and the broader RCT paradigm; highlights external (non-methodological) influences on trial outcomes.
Streptomycin TB example (Clarke et al.): Demonstrates how RCTs can mislead if mechanistic resistance is ignored; illustrates need for mechanism-aware interpretation.
Worrall and other philosophers’ contributions: discuss limits of randomization and the role of explanatory reasoning in identifying causality.

Practical Implications: How to Use Evidence Wisely

RCTs are essential but not sufficient in many cases; a broader evidence base improves decision making, especially in public health and complex interventions.
Public health perspectives argue for broader evidence sources beyond RCTs when deciding on population-level interventions.
In medicine and health policy, a balanced approach (EBM+) that includes mechanistic reasoning, observational data, and tailored context is increasingly advocated.

Writing and Academic Practice

Thesis and argumentation: when outlining essays, start with a clear thesis, then structure arguments and counterarguments, citing main readings and supplementary readings.
Distinguish between data, evidence, and knowledge; cite primary sources rather than relying solely on lectures.
Use a structured outline, avoid overreliance on passive constructions, and allow time for revision and tutor feedback.
Extensions and special considerations: academic extensions exist; use special considerations for longer deadlines when needed.

Concluding Outlook: Week 6 and Beyond

The course will explore Big Data and its relationship to theory and causal explanation, continuing the debate about how best to establish causality across disciplines.
The ongoing conversation about EBM, its hierarchy, and the role of mechanisms suggests a pluralistic, context-sensitive approach to evidence in science and health.

Key Terms and Concepts (Glossary)

Evidence-Based Medicine (EBM): a movement promoting the primacy of randomized trials and their aggregations as the basis for clinical decision-making.
Bradford Hill criteria: a set of considerations used to infer causality from observational data, including temporality, strength, consistency, specificity, dose-response, plausibility, coherence, experimentation, analogy.
Randomized Controlled Trial (RCT): a study design in which participants are randomly assigned to a treatment or control group, often with blinding and predefined outcomes.
Quasi-experimental: non-randomized controlled trials used when randomization is unethical or impractical.
Mechanistic reasoning: explanations that describe the biological or physical mechanisms by which an intervention produces an effect.
Eliminative induction: a method of reasoning that rules out alternative explanations to isolate a causal claim.
Mechanistic vs probabilistic explanations: mechanistic explanations describe how a system works; probabilistic explanations describe the likelihood of outcomes given certain factors.
EBM+: an approach that integrates mechanistic and probabilistic evidence to inform causal inferences.
Bias: systematic error that distorts study results; includes selection bias and ascertainment bias; mitigated by allocation concealment and blinding.
Meta-analysis: a statistical technique for combining results from multiple studies to derive a pooled estimate of effect.
Systematic review: a comprehensive, pre-planned review of the literature using explicit methods to minimize bias.
Knowledge, Information, Data: Data are raw symbols; information is data in context; evidence is information used to evaluate explanations; knowledge is justified, consensus-based understanding.