Statistical Significance and Related Concepts

Statistical Significance

Overview

  • Course: COMG 102: Everyday Communication with Numbers

  • Topics Covered:

    • Inferential statistics

    • Probability

    • Hypothesis testing and Type 1 Errors

    • Limitations

    • Effect Sizes

    • Practical significance

    • Confidence Intervals

Inferential Statistics

  • Definition: Inferential statistics involve using data collected from a sample to make inferences about a population.

  • Context: After collecting data and examining its characteristics (Descriptive Statistics), we can assess the meaningfulness of the sample data.

  • Tools for inference:

    • Statistical significance

    • Effect sizes

    • Confidence intervals

  • Purpose: Inference allows us to go beyond descriptive statistics and derive conclusions about the broader population.

Statistical Significance

  • Definition: Statistical significance refers to the likelihood or probability that a sample observation reflects what is happening within the entire population.

  • Importance: Establishes two core points:

    • The observed phenomenon in the sample is also occurring in the larger population.

    • The observation is statistically rare—greater than chance—and thus notable and deserving of further inquiry.

Normal Distribution Context
  • Knowledge of Normal Distribution:

    • Most common values in a distribution are near its center.

    • Least frequent values are found at the distribution's tails.

    • Statistically significant values are identified in these tail ends.

Probability in Statistical Significance

  • Relevance: Probability is crucial in inferential statistics.

  • Sampling Distribution of Means:

    • Example context: Selecting an infinite number of random samples from a population (e.g., men’s shoe sizes).

    • Enables calculation of the probability of obtaining specific values within the population.

Hypothesis Testing

  • Types of Hypotheses:

    • Null Hypothesis: suggests no effect, indicating no statistically significant difference in means between the sample and population.

    • Alternative Hypothesis: suggests that a difference does exist.

      • One-tailed hypothesis examples: Sample mean is greater than the population mean.

      • Two-tailed hypothesis examples: Sample mean differs from the population mean without specifying direction.

Assessing Statistical significance
  • Distinction between Difference and Significance: A difference in means does not automatically imply statistical significance due to potential random sampling error.

  • Determining Significance:

    • Important criterion: The difference must reach an alpha level ($ ext{p-value}$) of .05, meaning the chance of a result occurring by chance is less than 5%.

    • Techniques: Statistical tests (e.g., z-score) help determine this probability.

      • Significant if p < .05.

Example of Shoe Size Survey
  • Population Mean: 11 (men’s shoe sizes)

  • Sample Mean: 9

  • Questions posed: Is this difference significant?

    • Evaluate probability:

      • Example probabilities might include p = .30 or p = .01.

Limitations of Statistical Significance

  • Challenges:

    • Forces researchers into binary yes-or-no questions regarding significance.

    • Findings may lack generalizability, truth, meaning, and replicability.

    • Heavily dependent on sample size: Minor increases in sample size might lead to statistical significance.

    • Necessity of additional metrics like effect size and confidence intervals to fully understand results.

Effect Size

  • Definition: Effect size gauges the magnitude of difference between two groups; indicates the level of impact (small vs. large).

  • Illustrative Examples:

    • Strong relationship: Studying and exam scores.

    • Weak relationship: Text message frequency and relationship satisfaction.

  • Stability: Effect size remains consistent regardless of sample size.

  • Complementary Importance: Effect size, combined with statistical significance, provides crucial information to assess the reliability of results.

  • Common Metrics:

    • Cohen’s d: Common formula associated with t-tests to measure effect size.

    • Correlation coefficient (r): Effect size associated with correlation tests.

Practical Significance

  • Definition: Practical significance assesses whether differences between groups hold real-world relevance.

  • Consideration: This is often subjective, influenced by the testing context and ultimate goals.

  • Example Contexts:

    • A 3% change in preference for ice cream flavors may be statistically significant but not practically significant.

    • A 3% mortality rate of a new clinical drug poses real-world implications.

Confidence Intervals

  • Definition: Confidence Intervals (CIs) represent a range of values believed, to a certain confidence level, to contain the population parameter.

  • Accuracy Assessment: Even with unknown population parameters, they provide a better estimate of population representation.

  • Levels of Confidence:

    • 95% confidence intervals are associated with p = .05.

    • 99% confidence intervals are associated with p = .01.

  • Integral Components: For a holistic understanding, statistical significance, effect size, and confidence intervals should be considered together.

Summary

  • Core Topics:

    • Statistical Significance:

      • Probability

      • Hypothesis Testing (including Type 1 Errors)

      • Limitations

    • Effect Sizes:

      • Practical Significance

    • Confidence Intervals