Chapter 9 - Hypothesis Testing 138

Chapter 9 - Hypothesis Testing with 2 Samples

1. Introduction to Hypothesis Testing

Definition of hypothesis testing: A statistical method used to make inferences about population parameters based on sample data.

2. Distinction between t-test and z-test

Opening Activity: How do we tell the difference between a t-test and a z-test?
- Key Differences:
  - t-test: Used when the sample size is small (typically less than 30) or when the population standard deviation is unknown.
  - z-test: Used when the sample size is large (typically 30 or more) and the population standard deviation is known.

3. Testing Between 2 Proportions

3.1 Key Details

HATNC: Hypotheses About Two Population Proportions
Differences:
- Hypotheses are not about a number but about 2 population proportions.
- Assumptions must be true for both samples, with a twist concerning the hypothesized population proportion (pc).

3.2 Example: Theta Drug Company

The Theta Drug Company claims their new drug is more effective for women than men.
Study parameters:
- Sample Size: 100 women, 200 men.
- Results: 38% of women caught colds; 51% of men caught the flu.
Objective: Determine if the drug is statistically more effective for women at a significance level of 0.01.

4. Testing Between 2 Means

4.1 Example: Restaurant Manager Performance

Scenario: Mio, the restaurant owner, wants to test if her two managers perform at the same level using data on customer complaints.
Conducting a two-sample test to determine if the mean complaints differ between the two managers.
Key Note: Ensure that all conditions for performing the test are met.

4.2 Caution in Testing

Do not confuse:
- Two-sample test with matched pairs test.
- Ensure samples are independent and from entirely different populations.

4.3 Example: Effectiveness of Hypnotism

A study investigates hypnotism in reducing pain using matched pairs.
Data shows before and after scores; differences calculated.
Objective: Test at a 5% significance level if average sensory measurements are lower after treatment.

5. Confidence Intervals for the Difference in Parameters

5.1 Overview

Use confidence intervals to determine differences between two population parameters (proportions or means).

6. Confidence Intervals for the Difference in Proportions

6.1 Constructing Confidence Intervals

Sample Scenario:
- Age 18-29: 316 users, 26% use Twitter.
- Age 30-49: 532 users, 14% use Twitter.
Task: Construct and interpret a 90% confidence interval for the difference in proportions of Twitter users among these age groups.

7. Confidence Intervals for the Difference in Means

7.1 Example with Professional Athletes

Sample 1: 45 professional football players, mean height = 6.18 feet, SD = 0.37.
Sample 2: 40 professional basketball players, mean height = 6.45 feet, SD = 0.31.
Task: Find a 95% confidence interval for the difference in mean heights.
Assess whether football players are generally shorter than basketball players.

8. Matched Pairs t-Interval

Study of high school students to compare sitting vs. standing pulse rates.
Objective: To conclude if sitting pulse rates are significantly lower than standing rates.

9. Calculator Tricks

9.1 Using Pooled Options for t-Intervals

For t-intervals, pooled option is usually insignificant as standard deviations are often unknown.
Recommendation: 99% of the time, leave as "no" for pooled standard deviation.

9.2 Matched Pairs t-Interval

Matched pairs t-intervals analyzed from a single group of subjects measuring differences between treatments.