Lecture Notes: Critical Thinking in Statistics (1.1–1.2)

The Sixth Sense for Thinking Statistically

  • Six core principles to apply when evaluating statistics (the “sixth sense”):

    • Data beat anecdotes; belief is not a substitute for arithmetic. Always ask: Is there data backing the claim? Who are the authors/credentails? Is it peer-reviewed?
    • Aim for a bird's eye view (data from many) rather than a worm's eye view (one anecdote). Consider broad, representative data rather than a single story.
    • Beware the data source: where data come from matters. Distinguish random samples (more representative) from opt-in polls (potential bias).
    • Watch for lurking variables (alternative explanations). Confounding factors can explain apparent relationships.
    • Variation is everywhere; uncertainty is a given. Distinguish common-cause variation from special-cause variation; expect random error.
    • Conclusions are not certain; report and consider confidence/uncertainty. Use ranges (e.g., a
      CICI) rather than a single point estimate when possible.
    • Data reflect social and political values. Measurement choices and what gets studied can change over time and across cultures.
  • Examples used to illustrate the six sense:

    • Power lines and cancer debate: anecdotal claims vs large case-control studies; importance of distinguishing causation from correlation; media use of anecdotes vs data.
    • Birth month and height: large observational studies can show statistically significant differences that may be practically negligible; consider sample context and potential confounders.
    • Left-handedness and longevity: potential confounding (historical practices forcing hand-switching) can distort conclusions; sample composition matters.
    • OECD obesity graph: self-reported vs measured data differ; context of population and data collection method matters.
    • Bus arrival times: illustrated random variation vs special causes (accidents) and the need to separate them.
    • Mammograms and mortality risk: single figure vs confidence interval; snapshots vs over-time interpretation.
    • WEIRD and data representativeness: most data are from Western, educated, industrialized, rich democracies; avoid overgeneralizing to other populations.
    • Data sources like social media or app-based data can introduce sampling bias (who uses the platform).
    • Potholes app in a city can over-represent wealthier areas if uptake is higher there.
    • Lies, damn lies, and statistics: data sources matter; some numbers in the wild are wrong or misinterpreted.
  • Key takeaway: develop a habit of asking what data exist, where they came from, who is included, what is being measured, and what might be missing or confounded.

Data provenance, sampling, and representativeness

  • Data origins matter for trustworthiness:

    • Market research firms (random samples) vs website polls (opt-in, self-selected).
    • Random samples aim for representativeness; opt-in polls risk bias.
    • Reporting should clearly label polls as scientific or non-scientific when presenting results.
  • Missing and unrepresentative data:

    • Historically, many studies focused on men; data sets may be WEIRD (Western, Educated, Industrialized, Rich Democracies).
    • Big data sources (social media, streaming subscribers) exclude non-users and can misrepresent populations.
    • Real-world examples show bias in data collection (e.g., pothole reporting skewed toward wealthier areas).
  • Practical note on data credibility:

    • Data from reputable, randomly sampled sources are generally more trustworthy than opportunistic online polls.
    • Always check whether data are drawn from a random sample and whether the sample is representative of the population of interest.

Hidden variables and alternatives explanations

  • Lurking variables can explain apparent associations:

    • Example: owning at least two cars correlates with longevity, likely due to SES/income confounding.
    • Confounding factors can masquerade as causal relationships.
  • Observational studies vs experiments:

    • Observational evidence can show associations but not causation; experiments are needed for causal claims.
  • Anecdotes vs data:

    • Individual stories are compelling but not reliable evidence of general patterns.

Variation, uncertainty, and how to read results

  • Variation is inherent in data:

    • Random (common-cause) variation vs special-cause variation (e.g., accidents).
    • Expect bus arrivals to vary around a typical time; occasional deviations happen.
  • Uncertainty in conclusions:

    • Example: mammogram study reported a
      26%26\% reduction in mortality in a snapshot; best practice is to report a confidence interval around the estimate, not a single point value.
    • A single figure is less reliable than a range of plausible values.
  • Data interpretation depends on context:

    • Does the reported difference matter in practice (practical significance) as well as statistically significant?

Data and values: social and political influences

  • Data measurement reflects values and priorities:

    • How poverty, unemployment, homelessness, bullying, and workplace safety are defined and measured changes over time.
    • COVID-19 death counting practices changed during the pandemic, affecting death rate estimates.
  • The role of funding and governance:

    • Longitudinal studies (e.g., Dunedin) can face funding pressures linked to political winds, which can influence research continuity.
  • Questions to reflect on in reading and evaluating data:

    • Why do we hear so much bad news? How are perceptions formed?
    • Do funders influence research topics or methods?
    • Do media portray a lopsided view of reality?
    • How do we spot fake or misleading statistics in reporting?
  • Reading and resources mentioned for critical scrutiny:

    • Guardian article: nine-point guide to spotting a dodgy statistic.
    • TED talk: three ways to spot a bad statistic.

Guidelines and worry questions for evaluating a study (overview)

  • A two-page document titled Guidelines and Worry Questions for Evaluation of a Study is provided for Week 1:

    • Aim: build a structured approach to critically assess studies.
    • Eight steps in the guidelines, used to evaluate a study.
  • Step 1: Determine the type of study. Types listed include: sample survey, experiment, observational study, anecdote, or combinations thereof.

  • Step 2: Identify the seven critical components (to be stepped through). This section indicates there are eight steps total and that step 2 covers seven components.

  • Note: The details of steps 3–8 are not listed in the transcript, but the framework emphasizes systematic evaluation of study design and evidence.

Short recap and upcoming focus

  • Today and tomorrow: focus on reading the news critically and evaluating statistical claims in articles, which will feed into Assignment 1 (due end of Week 6).
  • Readings and in-class discussions will cover identifying data sources, sampling, bias, and appropriate interpretation of results.

Reading and assignments reminder

  • Article referenced for Week 1 page: critical reading of news items and statistical claims.
  • Practice expectations: ability to distinguish data-driven conclusions from anecdotes, assess sampling, and interpret uncertainty.