Experimental Design and Causality: Quick Notes (Caffeine Reaction Time Lab)
Key Concepts
- Experimental research involves making comparisons between groups; true experiments use random assignment to control groups, while quasi-experiments do not.
- True experiments vs quasi experiments:
- True: manipulate the independent variable; compare an experimental group to a control group; random assignment used.
- Quasi: groups are pre-existing or not randomly assigned; harder to infer causation due to confounds.
- Association vs causation:
- Association: a relationship or correlation between variables (e.g., caffeine and reaction time).
- Causation requires three conditions: association, temporal precedence, and non-spuriousness.
- Operationalization in the caffeine study: caffeine vs decaf vs nothing measured via reaction time (roller drop task).
- placebo effect: improvement due to expectation rather than the active treatment; controlled with placebo group (decaf) and blinding.
- Blinding: reducing bias by concealing group assignment; single, double, and triple blinding vary who is unaware (participants, experimenters, or data analysts).
- Gold standard: randomized controlled trials (RCTs) with random assignment, placebo controls, and blinding; ideally double blinded.
- Experimental designs:
- Between groups (between-subjects): different participants in each condition.
- Within groups (within-subjects, repeated measures): same participants measured under each condition, with a baseline first measurement.
- Mixed design: combines between and within features.
- Baseline measurement: a pre-treatment measure used for comparison in within-subjects or interrupted time series designs.
- Non-spuriousness: elimination of alternative causes (confounds) via randomization, control groups, blinding.
- Ethical and practical limits: some causal variables (trauma, brain injury) cannot be manipulated; researchers rely on other designs.
Experimental Designs and Their Features
- Between groups design:
- Two or more separate groups; one receives treatment, others serve as controls.
- Example: caffeine vs decaf; sometimes include a no-treatment group.
- Within groups design (repeated measures):
- All participants experience all conditions; measures taken before and after treatment.
- Advantage: controls for individual differences; disadvantage: practice effects.
- Mixed design:
- Combines between and within aspects; two measurements (pre and post) across groups.
- Allows both between-group and within-group comparisons.
- Quasi-experimental designs:
- Non-equivalent groups or interrupted time series without full random assignment.
- Stronger threats to internal validity due to confounding group differences.
Randomization and Control of Confounds
- Random assignment:
- Equal chance of ending up in experimental or control group.
- Cancels out individual differences (age, motivation, fatigue, etc.) across groups.
- Essential for eliminating alternative causes (non-spuriousness).
- Why not just randomize to a no-treatment group?
- To control for placebo effects; use a placebo control (e.g., decaf coffee) to ensure equal experience across groups.
- Placebo controls:
- Placebo: an intervention with no active ingredient but with expected effect.
- In psychology/medicine, helps isolate the effect of the active treatment from expectations.
- Blinding:
- Single blind: participants unaware of their group.
- Double blind: both participants and experimenters unaware.
- Triple blind: also analysts/data interpreters unaware.
- In this study, double blind was used; there was also a test for blinding effectiveness (survey showing ~54% could guess correctly, near chance).
Baseline, Time, and Alternative Explanations
- Baseline measurement:
- A pre-treatment measurement to compare against post-treatment results.
- Helps attribute changes to the treatment rather than pre-existing differences.
- Potential confounds (examples discussed):
- Sleep prior to lab, regular caffeine use, distraction, other substances, time of day, sugar in coffee, ADHD status, age, fatigue, motivation, etc.
- Eliminating confounds:
- Random assignment distributes confounds evenly.
- Placebo control accounts for expectancy effects.
- Blinding reduces biases in treatment administration and data interpretation.
Causation Framework in the Lab
- Hypothesis testing form:
- If caffeine increases alertness, then RT<em>extcaffeine<RT</em>extnocaffeine.
- Three criteria for causation:
- Association: caffeine and faster reaction time should be related.
- Temporal precedence: caffeine exposure precedes the faster reaction time.
- Non-spuriousness: alternative explanations are ruled out (via randomization, placebo, blinding).
- How the caffeine study fulfills (or falls short of) these:
- Random assignment helps satisfy non-spuriousness by balancing confounds.
- Placebo and blinding control for expectation effects.
- A second control group (no drink) is quasi-experimental, introducing some uncertainty about causality for that group.
Practical Takeaways and Design Variants
- True experiments are the gold standard for establishing causation when feasible, often labeled randomized controlled trials (RCTs).
- Limitations of true experiments: ethical, practical, or cost constraints may prevent manipulation of certain variables.
- Recognize variations:
- Between groups: compare separate groups.
- Within groups: compare the same participants across conditions with potential learning effects.
- Mixed: combine both approaches to balance advantages.
- Ethical note: not all variables can be manipulated; alternative designs are used to infer causal relationships where possible.
Quick Reference: Key Terminology
- Association: correlation between A and B.
- Causation: association + temporal precedence + non-spuriousness.
- Random assignment: equal chance of group allocation; balances confounds.
- Placebo effect: improvement due to expectations, not the active treatment.
- Blinding: concealing group assignment to reduce bias; single/double/triple.
- Between groups vs within groups vs mixed designs.
- Baseline measurement: pre-treatment measure for comparison.
- Gold standard: randomized, placebo-controlled, ideally double-blinded experiments.
- Quasi-experiment: lacks full randomization; more vulnerable to confounds.
- Interrupted time series: quasi analogue where measurements occur before and after an event.
- Reaction time: a common dependent variable for alertness studies; lower is faster.