Lecture Notes on Hypothesis Testing
Alpha Level and Medical Questions
Default Alpha Levels:
0.05 for medical questions.
0.01 for non-medical questions is suggested, but not strictly enforced.
Any deviation from these must be explained.
Testing Hypotheses
Alpha and H0 (Null Hypothesis):
Alpha signifies the probability that H0 is true when data suggests otherwise.
Set an alpha level for the hypothesis test and use that for decision-making.
Example of Coin Flipping
Fair Coin Test:
Experiment with 100 flips.
If results fall between a specific range (e.g., 45 heads), consider it fair, otherwise conclude it's not.
Sample Mean and Proportions
Statistical Symbols:
U = population mean; Pi (π) = proportion of successes.
Initial belief about a proportion (e.g., 15% left-handed) is expressed as Pi = 0.15.
Estimating Success:
Use x̄ (sample mean) and π̂ (sample proportion) as estimates.
Hypothesis Testing
Purpose:
To determine if collected data supports or contradicts initial beliefs.
Conclusion Outcome:
Accept or reject the null hypothesis based on statistical calculations.
Essential to understand what data indicates regarding initial beliefs (H0).
Data Validity and Confidence
Data Assumptions:
Default expectation is a simple random sample unless stated otherwise.
Any biases must be documented.
Test Statistic and P-Value
Calculating P-Value:
Test statistics lead to the calculation of p-value.
Compare p-value against alpha to decide on H0.
Reject H0 if p < alpha.
Court Analogy:
A jury might not assert clear guilt or innocence but rather determine if the evidence is sufficient or not to conclude.
Reporting Conclusions
Conclusion Writing:
Write conclusions comprehensibly for non-experts.
Avoid statistical jargon in conclusions: no references to null hypothesis or p-values should be present.
Clear Statements:
Conclusions should directly answer the research question in simple language (e.g., "the insurance company underestimated the average claim amounts").
Assumptions and Concerns
Key Assumptions:
Random sampling is a default assumption unless proven otherwise.
Test for normality is crucial when sample sizes are small.
Concerns:
Document any concerns arising from data collection methods.
Example Case Studies
Insurance Claims
Original belief: average claim = $1,800.
New sample data resulting in a mean of $1,950, indicating potential miscalculation.
Calculate test statistics and compare with alpha to decide on conclusions.
Parking Time Study
Question posed against university’s claim of 30 minutes on average.
Sample data shows lesser time (25 minutes).
Analyze using appropriate test statistics for a valid conclusion.
Quality Control Case
Mean weight of objects supposed to be 16 ounces.
Sample recorded at 15.8 ounces.
Calculate test stats; conclusion indicates potential regulatory actions necessary.
Election Polling
Assessing the likelihood of candidate winning with proportions from surveys.
P-value calculates relation to alpha (0.01) to support or reject the winning prediction.
Final Remarks
Understand the implications of not rejecting the null hypothesis and how it correlates with real-world situations.
Ensure clarity and comprehension in reporting results to facilitate understanding for all audiences, regardless of their statistical knowledge.