M2.Lecture 4: T Test for Paired Data (NURS 8004 Module 2, Session 4)

  • Topic: t Test for Paired Data (Paired t-test / t test for dependent groups)

  • Context: Comparing means of two paired groups; useful when the same subjects are measured twice or when subjects are naturally paired (e.g., twins, matched pairs).

  • Key assumptions:

    • Normality of the difference scores (the distribution of the paired differences is approximately normal).
    • DV is measured at interval/ratio level.
    • Pairs are dependent; observations within a pair are matched.
  • Alternative name: t Test for Dependent Groups.

  • General steps for hypothesis testing (as reviewed):
    1) Develop null and research hypotheses.
    2) Choose a level of significance (alpha).
    3) Determine which statistical test is appropriate.
    4) Run analysis to obtain test statistic and p value.
    5) Make a decision about rejecting or failing to reject the null hypothesis.
    6) Make a conclusion.

  • Example study (from transcript):

    • Design: 12 sets of identical twins; one twin attended preschool the year before kindergarten, the other stayed at home.
    • Outcome: IQ measured later.
    • This is a paired design because each twin pair forms a natural match.
  • Reported results (example data):

    • Preschool group mean IQ = 103, SD = 4.17 (n = 12).
    • Home group mean IQ = 104.9, SD = 3.95 (n = 12).
    • Test statistic: t(11) = 2.110, p = 0.059.
    • Result: p > α (0.05); fail to reject the null hypothesis.
  • Practical takeaway: Although the test statistic is reasonably large and the p-value is close to 0.05, with α = 0.05 there is not enough evidence to conclude a difference in IQ between preschool and home conditions in this paired sample.

  • Important caveat: Failure to reject H0 does not prove there is no difference; it may reflect limited power or small sample size (n = 12 pairs).

  • Relevance to study design: Paired designs control for between-subject confounds (e.g., genetics in twins) and focus on within-pair differences.

  • Ethical/interpretive note: When interpreting results from paired designs, especially with twins or sensitive outcomes like IQ, ensure ethical considerations (consent, privacy) and avoid overgeneralization beyond the matched context.

  • Formulae and calculations (key relationships):

    • Paired difference for each pair: di = X{i, ext{preschool}} - X_{i, ext{home}}
    • Mean difference: ar{d} = rac{1}{n} \, \sum{i=1}^{n} di
    • Standard deviation of differences: sd = ext{sd}(d1, d2, …, dn)
    • Paired t-statistic: t = \frac{\bar{d}}{s_d / \sqrt{n}}
    • Degrees of freedom: df = n - 1
    • Test interpretation: compare observed t to the critical value from the t distribution with df degrees of freedom, or use the p-value, e.g., p = 0.059 for this example.
  • Specific data interpretation from the transcript:

    • Given: t(11) = 2.110, \quad p = 0.059 and alpha \alpha = 0.05.
    • Decision: Since p > \alpha, fail to reject the null hypothesis.
    • Null hypothesis in this context: H0: \mud = 0 (no mean difference in IQ between preschool and home groups).
    • Alternative hypothesis in this context (two-tailed as implied): Ha: \mud \neq 0 (there is a difference in mean IQ between the two conditions).
    • Conclusion wording (per transcript): There will be no difference in IQ between the preschool group and the home group at the 0.05 significance level.
  • Connections to foundational principles:

    • This is a classic example of using a matched-pairs design to control for confounding variables (e.g., genetic and early-life factors in twins).
    • Demonstrates the logic of null vs. alternative hypotheses, signficance testing, and interpretation of p-values in the context of paired data.
    • Illustrates how a non-significant p-value near the threshold can occur even with a relatively large t-statistic, highlighting the role of sample size and variance in statistical power.
  • Practical implications and extensions:

    • If power were a concern, increasing the sample size (more twin pairs) could reduce the standard error of the difference and potentially yield a significant result if a true difference exists.
    • Reporting effect size for paired data (e.g., Cohen's d for paired samples) can provide information about practical significance beyond p-values (requires the standard deviation of the differences).
    • Researchers should check the normality of the difference scores (e.g., via Q-Q plots or Shapiro-Wilk test) to validate the assumption.
  • Notes on formatting and presentation in reports:

    • Always report: test statistic, degrees of freedom, and p-value (e.g., t(11) = 2.110, \; p = 0.059).
    • Include sample sizes for each condition and the mean and standard deviation for each set of paired observations.
    • When interpreting results, distinguish between statistical significance and practical significance; discuss potential limitations (e.g., small sample size, measurement error).
  • Summary takeaway:

    • The t-test for paired data compares mean differences within matched pairs.
    • In this example, the observed difference in IQ between preschool vs home conditions was not statistically significant at the 0.05 level (p = 0.059), given 12 pairs.