Comparing two means involves similar concepts as comparing two proportions.
For independent random variables, variances add together.
To find the standard deviation for the difference of two sample means:
Use the formula:
Sample Variance 1 / n1
Sample Variance 2 / n2
Add these variances together.
Square root the total to find the standard deviation.
Since population standard deviations for groups are typically unknown, we use standard error instead of standard deviation:
Substitute sample variances for population variances in formulas.
The appropriate model for means is the Student's t-distribution, leading to a:
Two-sample t interval for the difference in means.
Two-sample hypothesis test.
Calculators can perform these tests, specifically:
Option 4: Two-sample t test
Option 0: Two-sample t interval
To find the t value:
Use the formula:
(Difference in sample means - Difference in population means) / Standard error of the difference.
The model follows a Student's t distribution with degrees of freedom estimated via:
Two sample sizes added together minus 2 (n1 + n2 - 2).
Exact degrees of freedom may result in a decimal value, provided by the calculator.
Several assumptions and conditions must be met:
Independence Assumption: Requires randomization, indicating data can be generated from a simple random sample or randomized experiment.
10% Condition: Generally not checked for difference of means unless sample sizes are quite large relative to population size.
Normal Population Assumption: Nearly normal condition must be checked for both groups, confirmed using histograms or normal probability plots.
Independence of Groups Assumption: Groups compared must be independent. For dependent groups, a different method is used.
The formula to calculate confidence intervals:
Difference of sample means ± Margin of error.
Margin of error = Critical t value × Standard error of the difference between means.
Critical value determined by confidence level and degrees of freedom.
The hypothesis test checks:
Null hypothesis (Ho): The difference in population means = 0 (no difference).
Alternative hypothesis (Ha): The difference is not equal to 0 (indicating a difference).
Conduct the t-test with the t statistic formula:
(Difference in sample means - Null hypothesis value) / Standard error.
Obtain the p-value to decide whether to reject Ho.
Example: Compare mean pulse rates of smokers (mean = 80, sd = 5, n = 26) and nonsmokers (mean = 74, sd = 6, n = 32).
Hypotheses:
Ho: Mean smokers - Mean nonsmokers = 0
Ha: Mean smokers - Mean nonsmokers ≠ 0
Confirm conditions:
Random samples, under 10% population, nearly normal data.
Resulting t value found to be approximately 4.15, with a corresponding p-value indicating strong evidence against the null.
Pooled T-test: Assumes equal variance; often unnecessary unless certain conditions apply.
Use caution as equal means do not imply equal variances.
For most tests, it is safer not to pool.
Assumptions made in randomized comparative experiments can allow for pooling but should be approached with caution.
Analyze results to determine if one group is significantly different.
Example analysis of saturated fat content in pizzas:
Conduct two-sample t-test for fat content:
Null: No difference in fat content
Alternate: There is a difference
Results led to conclusions drawn from t-value and p-value calculations, confirming the significant difference in content.
Visual representation through histograms and normal probability plots assists in corroborating findings.
Confidence interval results provide a range to quantify the significant differences.