Understanding the Data: Study Design & Bias in Research

Lecture 4: Understanding Data in Research

Study Design & Bias in Research

  • Course: KHPM324 Chronic Diseases of Modern Society

  • Instructor: Hannah Oh, ScD

  • Institution: Division of Health Policy & Management, College of Health Sciences, Korea University

Review Test

  • Access test at www.socrative.com

    • Student log-in required

    • Enter name for participation points

    • Room name: [To be titled in class]

  • A total of 3 questions on the test

Research Question Design Example

  • Scenario: Testing whether a new drug (Drug A) reduces the risk of stroke.

  • Study Design Question: How would you design a study to answer this question?

Randomized Experiment Design

  • Components:

    • Drug A (new drug) vs. Placebo

    • Randomized assignment of subjects into groups

  • Question: What would be the 1-year risk of stroke for both groups?

Epidemiologic Study Designs

  • Experimental Studies:

    • Involves intervention by the investigator in treatment assignments.

    • Example: Randomized Controlled Trial (RCT)

  • Observational Studies:

    • Involves observing outcomes based on natural treatment assignments.

    • Types:

    • Cross-Sectional Study

    • Cohort Study

    • Case-Control Study

Randomization in Studies

  • Process:

    • Divide subjects into two groups: Group 1 & Group 2

    • Random assignment of treatment (e.g., drug A or placebo)

  • Outcome of Randomization:

    • Ensures comparability between groups

    • 1-year risk in both groups remains the same regardless of treatment assignment

Study Types: Observational Study Designs

Cross-Sectional Study
  • Focus: Disease frequency comparison (exposed vs. unexposed) at a specific time

  • Limitations: Provides a snapshot, lack of temporal data

Cohort Study
  • Structure:

    • Groups: Exposed group (at risk) vs. Unexposed group

  • Outcome: Measure incidence of disease over time

Case-Control Study
  • Definition:

    • Comparison of cases (with disease) and controls (without disease)

  • Outcome: Frequency of exposure comparisons between cases and controls

Study Design Phases

Prospective Studies
  • Definition: Exposures measured before outcome events occur

  • Characteristics:

    • More costly and time-consuming

Retrospective Studies
  • Definition: Exposures measured after outcome events have occurred

  • Characteristics:

    • Prone to incorrect information gathering

Errors in Research

  • Categories:

    • Random (chance errors)

    • Systematic (bias errors)

Implications of Errors
  • Systematic Error:

    • Flaws in study design, data collection, or analysis can lead to incorrect conclusions

    • Broad types of bias include:

    • Confounding

    • Selection Bias

    • Information Bias (measurement error or misclassification)

Confounding Bias Examples

Example 1: Down Syndrome & Birth Order
  • Question: Is birth order a cause of Down syndrome?

  • Finding: Maternal age influences both factors (risk).

Example 2: Aspirin & Stroke
  • Observation: People taking aspirin may show a higher stroke rate.

  • Confounding Variable: The presence of atherosclerosis increases both stroke risk and the likelihood of aspirin use—thus misleading correlation.

    • Atherosclerosis is defined as the buildup of fats, cholesterol, and other substances in artery walls which can restrict blood flow.

Understanding Confounders

  • Definition:

    • A confounding variable is both causally associated with the outcome and either causally or non-causally associated with the exposure without being an intermediate variable.

Reducing Confounding in Research

  • Continuation:

    • Randomization provides similar risk factor distributions across groups.

    • Other methods:

    • Standardization (e.g., age-standardized measures)

    • Restriction and stratification (e.g., comparisons within the same age group)

    • Covariate adjustment in regression models

Selection Bias

  • Description:

    • A systematic error occurring due to biased recruitment or retention of study participants.

    • Examples of analysis illustrating selection bias through disproportionate representation of smokers and non-smokers and their respective disease statuses are illustrated with ratios and percentages reflecting risks.

Recall Bias

  • Definition:

    • A type of information bias resulting from inaccurate recall of past exposure.

  • Example: In cases of birth defects, mothers may recall pregnancy exposures differently based on whether their child has a defect, leading to differential accuracy in recall.

Reducing Recall Bias

  • Strategies:

    • Frame survey questions to improve recall accuracy.

    • Utilize a control group composed of mothers of babies born with defects unrelated to the study.

    • Gather historical data from medical records or biomarkers recorded pre-outcome.

Conclusion

  • Recap and anticipation for the next lecture: See you next week!