Definition of hypothesis testing: A statistical method used to make inferences about population parameters based on sample data.
Opening Activity: How do we tell the difference between a t-test and a z-test?
Key Differences:
t-test: Used when the sample size is small (typically less than 30) or when the population standard deviation is unknown.
z-test: Used when the sample size is large (typically 30 or more) and the population standard deviation is known.
HATNC: Hypotheses About Two Population Proportions
Differences:
Hypotheses are not about a number but about 2 population proportions.
Assumptions must be true for both samples, with a twist concerning the hypothesized population proportion (pc).
The Theta Drug Company claims their new drug is more effective for women than men.
Study parameters:
Sample Size: 100 women, 200 men.
Results: 38% of women caught colds; 51% of men caught the flu.
Objective: Determine if the drug is statistically more effective for women at a significance level of 0.01.
Scenario: Mio, the restaurant owner, wants to test if her two managers perform at the same level using data on customer complaints.
Conducting a two-sample test to determine if the mean complaints differ between the two managers.
Key Note: Ensure that all conditions for performing the test are met.
Do not confuse:
Two-sample test with matched pairs test.
Ensure samples are independent and from entirely different populations.
A study investigates hypnotism in reducing pain using matched pairs.
Data shows before and after scores; differences calculated.
Objective: Test at a 5% significance level if average sensory measurements are lower after treatment.
Use confidence intervals to determine differences between two population parameters (proportions or means).
Sample Scenario:
Age 18-29: 316 users, 26% use Twitter.
Age 30-49: 532 users, 14% use Twitter.
Task: Construct and interpret a 90% confidence interval for the difference in proportions of Twitter users among these age groups.
Sample 1: 45 professional football players, mean height = 6.18 feet, SD = 0.37.
Sample 2: 40 professional basketball players, mean height = 6.45 feet, SD = 0.31.
Task: Find a 95% confidence interval for the difference in mean heights.
Assess whether football players are generally shorter than basketball players.
Study of high school students to compare sitting vs. standing pulse rates.
Objective: To conclude if sitting pulse rates are significantly lower than standing rates.
For t-intervals, pooled option is usually insignificant as standard deviations are often unknown.
Recommendation: 99% of the time, leave as "no" for pooled standard deviation.
Matched pairs t-intervals analyzed from a single group of subjects measuring differences between treatments.