Statistical Significance and the Role of Probability in Statistics
The Role of Probability in Statistics: Statistical Significance
The fundamental learning goal is to understand the concept of statistical significance and the essential role that probability plays in defining it within a statistical study.
Definition of Statistical Significance: A set of measurements or observations in a statistical study is said to be statistically significant if it is unlikely to have occurred by chance.
Qualitative Examples of Statistical Significance
Criminal Investigation in Detroit:
A detective finds that of the guns used in crimes during the past week were sold by the same gun shop.
This finding is considered statistically significant because there are many gun shops in the Detroit area. Consequently, having out of guns originate from a single shop is deemed unlikely to have occurred by chance alone.
Global Temperature Records:
Data Observation 1: In terms of the global average temperature, of the years between and were the five hottest years in the century. This finding is statistically significant.
Data Observation 2: Having the hottest years on record occur all in a row in a data set that goes back approximately years is statistically significant.
Conclusion: Such a streak of hot years is very unlikely to have occurred by chance alone and therefore provides strong empirical evidence of a warming world.
Basketball Win-Loss Records:
Scenario: The team with the worst win-loss record in a basketball league wins a single game against the defending league champions.
Assessment: A single win in this context is not statistically significant. Although a team with a poor record is expected to lose most of its games, it is also expected to win occasionally, even against high-performing teams, due to normal variation in performance. This event is reasonably likely to occur by chance.
Statistical Significance in Experimental Research
Case Study: Herbal Formula for Preventing Colds
Methodology: A researcher conducts a double-blind experiment using a treatment group and a control group.
Participants: randomly selected people in the treatment group (given the herbal formula) and randomly selected people in the control group (given a placebo).
Duration: The study lasted for a three-month period.
Results:
Treatment Group: people developed colds.
Control Group: people developed colds.
Analysis: Whether a person contracts a cold during a three-month period depends on many unpredictable factors. Small differences between groups should be expected.
Conclusion: The difference between individuals and individuals is small enough to be explained by chance. Therefore, the difference is not statistically significant, and the researcher cannot conclude that the herbal formula is effective.
Quantifying Statistical Significance
Necessity of Quantification: Qualitative definitions of significance are too vague for rigorous science. Probability is used to quantify the likelihood that an observed result occurred by chance.
The Significance Threshold Question: Is the probability that the observed difference occurred by chance less than or equal to (also expressed as in )?
Decision Rules:
If the probability is less than or equal to , the difference is said to be statistically significant at the level.
If the probability is greater than , the observed difference is considered reasonably likely to have occurred by chance and is therefore not statistically significant.
Common Probability Levels:
Statistical significance at the level means the probability of the result occurring by chance is ( in or less).
Statistical significance at the level means the probability of the result occurring by chance is ( in or less).
Although is a common choice, it is somewhat arbitrary; statisticians may also use other probabilities such as or depending on the study requirements.
Case Study: Salk Polio Vaccine Significance
Study Parameters:
Treatment Group: children received the Salk polio vaccine.
Control Group: children received a placebo.
Observed Data:
Treatment Group Cases: children developed paralytic polio.
Control Group Cases: children developed paralytic polio.
Probability Calculation:
Researchers calculated that the probability of this specific difference occurring by chance was approximately (or ).
Assessment of Significance:
Because this probability is significantly lower than both and , the results are considered statistically significant at both the and levels.
This extremely low probability—later referred to in statistics as a "P-value"—gave researchers high confidence that the vaccine was truly responsible for the reduction in polio cases rather than random chance.
Questions & Discussion
Think About It Question: Suppose an experiment finds that people taking a new herbal remedy get fewer colds than people taking a placebo, and the results are statistically significant at the level. Has the experiment proven that the herbal remedy works? Explain.
Implicit Explanation: Statistical significance at the level indicates a very low probability (1 in 100 or less) that the results were due to chance, suggesting the remedy is likely effective. However, in statistics, "proof" is rarely absolute; rather, it indicates strong evidence where the likelihood of being wrong is quantified (e.g., a chance the result still happened by luck).