Hypothesis Testing: Two Sample Tests
Hypothesis Testing with Two Samples
Null and Alternative Hypotheses
- Null Hypothesis: No significant difference between the sample means of the two groups.
- Alternative Hypothesis: There is a significant difference between the sample means of the two groups.
Critical Region
- The critical region is determined by the alpha level . For in a two-tailed test, the critical values are .
Formulas for Test Statistic
The test statistic is calculated differently for two-sample tests. The general formula is:
Where:
- and are the sample means of the two groups.
- and are the population means of the two groups.
Under the null hypothesis, , so the formula simplifies to:
Expanding the Formula
The standard deviation of the sampling distribution needs to be calculated. The formula depends on whether the population standard deviations are known.
If population standard deviations are known:
Where:
- and are the variances of the two populations.
- and are the sample sizes of the two groups.
If population standard deviations are unknown:
Where:
- and are the sample variances of the two groups.
- and are the sample sizes of the two groups.
Decision Making
- After calculating the test statistic , compare it to the critical values.
- If |z| > 1.96 (for ), reject the null hypothesis.
- If , fail to reject the null hypothesis.
Factors Influencing the Decision
Size of the Difference Between Sample Statistics:
- If the difference between and is large, it is more likely to reject the null hypothesis. This means the calculated value will likely fall beyond the critical value.
- Example: If group 1 scores 90% and group 2 scores 70%, the large difference increases the likelihood of rejecting the null hypothesis.
Alpha Level:
- A larger alpha level (e.g., ) makes it more likely to reject the null hypothesis.
- Analogy: A lenient teacher (higher alpha) is more likely to give higher grades, making it easier to score well.
One-Tailed vs. Two-Tailed Test:
- One-tailed tests are more likely to reject the null hypothesis because the entire alpha level is concentrated in one direction.
- Two-tailed tests split the rejection region, making it more strict.
Sample Size:
- A larger sample size makes it more likely to reject the null hypothesis.
- A larger sample size makes the denominator of the statistic smaller, resulting in a larger value if the numerator (difference in means) is constant.
- If is large and the denominator is small, then is large.
- A survey of 1000 people is more reliable than a survey of 10 people.
Example Question
- Question: Do athletes in different sports (basketball vs. football) vary in terms of their readiness for college, based on college entrance exam scores?
Solving the Example
Problem Statement: Determine if there is a significant difference in the average entrance exam scores of basketball and football players.
Test Selection: Use an independent two-sample z-test.
- The exam scores of football players do not impact the exam scores of basketball players, hence they are independent.
- Sample sizes are greater than 30.
Critical Values:
- Given in a two-tailed test, the critical values are .
Formula:
Calculations:
Given:
- Basketball players: , ,
- Football players: , ,
Standard Deviation of Sampling Distribution:
Z-score:
Decision:
- Since is between and , fail to reject the null hypothesis.
Conclusion:
- There is no evidence to suggest that basketball and football players have significantly different college entrance exam scores. The observed difference is likely due to random chance.