Statistics in Psychology and Everyday Life Study Notes

Module 8: Statistical Reasoning in Everyday Life

Learning Targets

  • 8-1 Explain why we need statistics in psychology and in everyday life.
  • 8-2 Describe descriptive statistics.
  • 8-3 Explain how we describe data using the three measures of central tendency.
  • 8-4 Discuss the relative usefulness of the two measures of variation.
  • 8-5 Describe inferential statistics.
  • 8-6 Explain how we determine whether an observed difference can be generalized to other populations.

The Need for Statistics

8-1 Why do we need statistics in psychology and in everyday life?
  • Role in Psychology:
    • Psychologists use statistics in descriptive, correlational, experimental, and other research designs to measure variables and interpret results.
  • Importance of Statistical Understanding:
    • Accurate statistical understanding is crucial for everyone, enhancing clarity and critical thinking about data in daily life.
    • Memorizing complicated formulas is not necessary to think critically about statistical data.
  • Risks of Misinformation:
    • Quick estimates can lead to misrepresentation of reality, creating public misinformation.
    • Examples of misinformation include assumptions about the percentage of people who are gay and the usage of brain capacity:
    • Big claims such as “10 percent of people are gay” compared to “3 to 4 percent” from national surveys.
    • The idea that we only use “10 percent of our brain,” which is misleading as we utilize nearly all parts.
  • Doubt Big Numbers:
    • Question the validity of large, round numbers presented without documentation (e.g., claims about missing children or homeless individuals) as they might be motivated by bias.
  • Goal-Setting Bias:
    • People are more inclined to set goals with round numbers (e.g., losing 20 pounds rather than 19 or 21)
    • Statistical behavior modification is nearly four times more effective when focusing on the .300 average rather than .299.
  • Statistical Illiteracy and Health Risks:
    • Misinformation can lead to health scares, exemplified by media reporting on contraceptive pills causing a supposed 100% increase in blood clot risks. The actual increased risk was from 1 in 7000 to 2 in 7000, highlighting the need for accurate statistical reasoning.

Descriptive Statistics

8-2 What are descriptive statistics?
  • Definition:
    • Descriptive statistics summarizes and describes characteristics of a group under study, similar to how teachers assess student performance.
  • Data Visualization:
    • Researchers represent data using visual formats like histograms, which display distributions (e.g. truck durability after 10 years).
  • Graphical Representation:
    • Caution: Read scale labels carefully, as misleading graph designs can exaggerate or minimize differences:
    • Example of two graphs differing only in their y-axis scale, where a limited range can make differences appear more pronounced (Graph (a)) than they actually are (Graph (b)).
  • Key Point: Always verify the scale labels when interpreting graphs to comprehend the data accurately.
Measures of Central Tendency
8-3 How do we describe data using the three measures of central tendency?
  • Definition of Central Tendency:
    • A single score representing a set of scores.
    • Three main measures: Mode, Mean, and Median:
    • Mode: The most frequently occurring score or scores.
    • Mean: The arithmetic average, calculated by summing all scores and dividing by the number of scores.
    • Median: The middle score in an ordered distribution; it divides the scores into two equal halves.
  • Symmetrical Distributions:
    • In a symmetrical bell-shaped distribution, the mode, median, and mean are often similar.
  • Skewed Distributions:
    • In skewed distributions, the mean can be disproportionately affected by extreme values, leading to significant discrepancies:
    • Example with income data wherein high incomes disproportionately raise the mean compared to the median.
  • Important Consideration:
    • Recognize that the mean can misrepresent actual data trends, especially in populations with significant outliers.

Measures of Variation

8-4 What is the relative usefulness of the two measures of variation?
  • Understanding Variation:
    • Averages can be misleading if not accompanied by measures of variation, which indicate how scores differ from one another.
  • Importance of Low Variability:
    • Averages based on scores with low variability provide more reliable predictions (e.g., a consistent scoring basketball player).
  • Range as a Measure of Variation:
    • The range (difference between highest and lowest scores) is often imprecise as it can overlook variations in other scores.
  • Standard Deviation:
    • A more useful measure that examines how much individual scores differ from the mean:
    • It takes all scores into account.
    • Example illustrating differences in standard deviation between classes, where one has clustered scores (low variability) and the other has widely dispersed scores (high variability).

Inferential Statistics

8-5 What are inferential statistics?
  • Definition and Purpose:
    • Inferential statistics enables researchers to infer conclusions about a population based on sample data, evaluating the reliability of observed differences.
  • Understanding Noise in Data:
    • Averages may differ between groups not due to genuine differences but chance fluctuations.
  • Reliability of Observed Differences:
    • Inferential statistics help assess how likely it is that observed differences reflect real differences rather than random chance.
When Is an Observed Difference Reliable?
8-6 How do we know whether an observed difference can be generalized to other populations?
  • Principles for Generalization:
    1. Representative Samples:
    • Effective generalizations stem from samples that accurately represent the relevant population rather than exceptional or biased samples.
    1. Variability:
    • Observations with low variability yield more reliable averages than those with high variability (examples illustrate this point).
    1. Sample Size:
    • Larger samples tend to be more reliable than smaller ones.
  • Caution Against Anecdotes:
    • Generalizations from anecdotal evidence or small samples can lead to erroneous conclusions.
  • Statistical Significance and Reliability:
    • Statistically significant findings indicate observable differences are likely not due to chance; however, they must also be practical in importance.
  • Illustrative Examples:
    • Large sample studies that yield statistically significant results but are minor in real-world impact (e.g., social media behavior) must be evaluated for practical relevance.
  • Conclusion:
    • Smart analytical thinking requires distinguishing between statistical significance (likelihood of being due to chance) and practical significance (real-world relevance) in research findings.