Chapter 9 - Hypothesis Testing 138
Chapter 9 - Hypothesis Testing with 2 Samples
1. Introduction to Hypothesis Testing
Definition of hypothesis testing: A statistical method used to make inferences about population parameters based on sample data.
2. Distinction between t-test and z-test
Opening Activity: How do we tell the difference between a t-test and a z-test?
Key Differences:
t-test: Used when the sample size is small (typically less than 30) or when the population standard deviation is unknown.
z-test: Used when the sample size is large (typically 30 or more) and the population standard deviation is known.
3. Testing Between 2 Proportions
3.1 Key Details
HATNC: Hypotheses About Two Population Proportions
Differences:
Hypotheses are not about a number but about 2 population proportions.
Assumptions must be true for both samples, with a twist concerning the hypothesized population proportion (pc).
3.2 Example: Theta Drug Company
The Theta Drug Company claims their new drug is more effective for women than men.
Study parameters:
Sample Size: 100 women, 200 men.
Results: 38% of women caught colds; 51% of men caught the flu.
Objective: Determine if the drug is statistically more effective for women at a significance level of 0.01.
4. Testing Between 2 Means
4.1 Example: Restaurant Manager Performance
Scenario: Mio, the restaurant owner, wants to test if her two managers perform at the same level using data on customer complaints.
Conducting a two-sample test to determine if the mean complaints differ between the two managers.
Key Note: Ensure that all conditions for performing the test are met.
4.2 Caution in Testing
Do not confuse:
Two-sample test with matched pairs test.
Ensure samples are independent and from entirely different populations.
4.3 Example: Effectiveness of Hypnotism
A study investigates hypnotism in reducing pain using matched pairs.
Data shows before and after scores; differences calculated.
Objective: Test at a 5% significance level if average sensory measurements are lower after treatment.
5. Confidence Intervals for the Difference in Parameters
5.1 Overview
Use confidence intervals to determine differences between two population parameters (proportions or means).
6. Confidence Intervals for the Difference in Proportions
6.1 Constructing Confidence Intervals
Sample Scenario:
Age 18-29: 316 users, 26% use Twitter.
Age 30-49: 532 users, 14% use Twitter.
Task: Construct and interpret a 90% confidence interval for the difference in proportions of Twitter users among these age groups.
7. Confidence Intervals for the Difference in Means
7.1 Example with Professional Athletes
Sample 1: 45 professional football players, mean height = 6.18 feet, SD = 0.37.
Sample 2: 40 professional basketball players, mean height = 6.45 feet, SD = 0.31.
Task: Find a 95% confidence interval for the difference in mean heights.
Assess whether football players are generally shorter than basketball players.
8. Matched Pairs t-Interval
Study of high school students to compare sitting vs. standing pulse rates.
Objective: To conclude if sitting pulse rates are significantly lower than standing rates.
9. Calculator Tricks
9.1 Using Pooled Options for t-Intervals
For t-intervals, pooled option is usually insignificant as standard deviations are often unknown.
Recommendation: 99% of the time, leave as "no" for pooled standard deviation.
9.2 Matched Pairs t-Interval
Matched pairs t-intervals analyzed from a single group of subjects measuring differences between treatments.