Comparing Means
Comparing Means
Overview of Comparing Means
Investigates if a child’s birthweight is affected by the mother's marital status.
Analyzes summary measures within the categorical variable of marital status, which is a nominal variable.
Data Presentation
Table 5.16.8: Summary Statistics of Birthweight by Marital Status (Brisbane Babies 1984-1988)
Marital Status: The categories involved are Married, Never Married, Living Together, and Other.
N: Sample size for each category.
Mean (kg): Average birthweight.
Standard Deviation (SD) (kg): Variation in birthweights within each category.
Minimum (kg): Lowest recorded birthweight.
Maximum (kg): Highest recorded birthweight.
Marital Status | N | Mean (kg) | SD (kg) | Minimum (kg) | Maximum (kg) |
|---|---|---|---|---|---|
Married | 363 | 3.57 | 0.42 | 2.33 | 4.85 |
Never married | 70 | 3.64 | 0.44 | 2.34 | 5.06 |
Living together | 53 | 3.38 | 0.42 | 2.27 | 4.18 |
Other | 14 | 3.40 | 0.40 | 2.86 | 4.28 |
Total | 500 | 3.56 | 0.43 | 2.27 | 5.06 |
Comparative Analysis
Differences in Birthweights:
Never Married: Babies are slightly heavier than those born to Married women by 0.07 kg.
Living Together: Babies are lighter by 0.19 kg compared to Married women.
Other: Babies are lighter by 0.17 kg than Married women.
Standard Error of the Mean (SE): Used to assess sampling variability of the mean and derive Confidence Intervals (CIs).
Detailed Birthweight Analysis
Table 5.16.9: Mean Birthweight, SE, and 95% CI by Marital Status
Mean: Reflects average birthweights across groups.
Standard Error (SE): Estimate of the variance of the sample mean.
95% CI: Ranges that estimate the true population mean based on sample data.
Marital Status | N | Mean (kg) | SE (kg) | 95% CI |
|---|---|---|---|---|
Married | 363 | 3.57 | 0.022 | (3.53, 3.62) |
Never married | 70 | 3.64 | 0.052 | (3.53, 3.74) |
Living together | 53 | 3.38 | 0.058 | (3.27, 3.50) |
Other | 14 | 3.40 | 0.108 | (3.16, 3.63) |
Total | 500 | 3.56 | 0.019 | (3.52, 3.59) |
Variability Observations
Standard Errors: Differences in group sizes lead to variation in SE:
Largest group (‘Married’) has the smallest SE.
Smallest group (‘Other’) exhibits the highest SE.
Confidence Intervals:
Overlap significantly between ‘Married’ and ‘Never married’ groups.
‘Married’ group has a narrow CI that does not overlap with the ‘Living together’ category.
The Two-Sample t-Test
Null Hypothesis Formulation
Assumes that true mean birthweights do not vary among different marital status groups.
Examines the difference in birthweights:
Married: 3.57 kg
Never married: 3.64 kg
Difference: 0.07 kg.
Test Statistic Dependency
Influenced by:
Size of the difference between means.
Standard errors of the means.
Degrees of freedom (df) calculation: df = ext{sum of sample sizes} - 2.
Specific Example of t-Test
Comparing birthweights from Married (363) and Never Married (70) mothers.
df = 363 + 70 - 2 = 431.
Results: p-value is greater than 0.10, indicating lack of significant difference.
Analysis of Variance (ANOVA)
Global Test Analysis
Conducts a comprehensive test on four means of marital status with pairwise comparisons examined post hoc.
Null hypothesis states no variation in population means across all groups.
ANOVA F-statistic
Calculate: F-statistic to evaluate variance among means relative to standard errors of the means.
Degrees of Freedom (df) in ANOVA:
Numerator df: Number of groups - 1.
Denominator df: Total sample size - number of groups.
Example Calculation:
F = 14.01.
Degrees of freedom: df = (3, 497).
p-value = 0.0029, indicating significant differences among groups.
Assumptions for t-Test and ANOVA
Normal distribution of the mean is critical for small samples.
Larger sample sizes suitable for Central Limit Theorem to apply for means distribution.
Standard deviations within groups should be approximately equal.
Assumed independence among observations from separate groups of individuals.
Non-Normal Distributions: Transformations and Nonparametric Tests
Transformation Approaches
Positive skewed distributions can be addressed using log transformation.
Non-parametric test alternatives include:
For t-test: Wilcoxon–Mann–Whitney test
For ANOVA: Kruskal–Wallis test
Repeated Measurements of Quantitative Variable
Contextual Analysis
Analyzes whether a variable varies over time, using a dietary intervention trial as an example.
Example Evaluation of Vitamin A Intake
Participants: 17 assessed for Vitamin A intake at baseline and 6 months post-intervention.
Table 5.16.10: Vitamin A Levels
Person | Baseline | 6 Months | Change (6 months - baseline) |
|---|---|---|---|
1 | 120 | 124 | 4 |
2 | 122 | 135 | 13 |
3 | 126 | 140 | 14 |
4 | 131 | 130 | -1 |
5 | 135 | 169 | 34 |
9 | 157 | 189 | 32 |
7 | 160 | 176 | 16 |
8 | 181 | 196 | 15 |
9 | 195 | 223 | 28 |
10 | 200 | 254 | 54 |
11 | 205 | 267 | 62 |
12 | 211 | 289 | 78 |
13 | 215 | 301 | 86 |
14 | 220 | 227 | 7 |
15 | 246 | 305 | 59 |
16 | 255 | 278 | 23 |
17 | 278 | 299 | 21 |
Mean (SD) | 185.7 (49.6) | 217.8 (65.4) | 32.1 (26.3) |
Assessment of Group Independence
Not met; most 6-month values exceed baseline values.
Most appropriate summary statistic for repeated measurements: mean change within individuals.
Mean Change in Vitamin A Levels: 32.1 micrograms with a range from -1 to +86 micrograms, SD of 26.3 micrograms.
Confidence Interval Calculation
Calculate 95% CI for the mean change:
ext{95% CI} = ( ext{mean change} - 1.96 imes SE( ext{mean change}), ext{mean change} + 1.96 imes SE( ext{mean change}))
Result: (19.5, 44.6), with the lower limit above zero.
Paired or One-Sample t-Test Application
Assumes paired measures; evaluates significance of the change within pairs.
t-statistic calculation with N pairs (df = N - 1).
Calculated t = 5.02, with 16 df and p = 0.0001, indicating statistically significant changes in vitamin A levels.