Confidence Intervals and Hypothesis Testing in Biostatistics

Confidence Intervals

Formulas for Test 2

Point Estimate (PE) and Margin of Error (MOE)
- Formula:
  \text{Point Estimate} + \text{Margin of Error} = \text{PE} + \text{MOE}
- Lower Confidence Interval:
  \text{Lower C.I.} = \text{PE} - \text{MOE}
- Upper Confidence Interval:
  \text{Upper C.I.} = \text{PE} + \text{MOE}

### The equations for estimating parameters

\text{PE} = \text{(mean or proportion)}
- Margin of Error (MOE)
For proportions:
\text{MOE} = \text{z} \times \sqrt{\frac{p(1-p)}{n}}
For means:
\text{MOE} = t_{\frac{\alpha}{2}} \times \frac{s}{\sqrt{n}}

Inference Using Single Sample and Two Samples

Dependent Samples

Confidence Interval:
- Formula:
  \text{C.I.} = d \pm \frac{t{\alpha/2} \cdot Sd}{\sqrt{n}}
- Where,
- $d$ = difference of sample means
- $S_d$ = standard deviation of differences
- $n$ = sample size
- Test Statistic:
  T = \frac{\bar{x} - d0}{Sd / \sqrt{n}}

Independent Samples

Non-Pooled is used when variances are unequal
- Test Statistic:
- T = \frac{\bar{x1} - \bar{x2}}{Sp \cdot \sqrt{\frac{1}{n1} + \frac{1}{n_2}}}
- Where:
- Sp = \sqrt{\frac{(n1 - 1)S1^2 + (n2 - 1)S2^2}{n1 + n_2 - 2}}

Bartlett’s F Test for Variances

Formula:
- F = \frac{S1^2}{S2^2} where $S1^2 > S2^2$
Degrees of freedom
- df{numerator} = n{1} - 1
- df{denominator} = n{2} - 1

Quick TI cheats

Confidence Interval:
- Use STAT
Single Sample Tests:
- For mean: TINTERVAL for confidence interval
- For proportion: 1-PROPZINT
Two Independent Samples:
- Pooled: 2-SAMPTTEST
- Non-Pooled: 2-SAMPTTEST

Principles of Biostatistics - Test 2 Instructions

Show all work for credit.
Using calculators:
- Allowed for checking work; must show commands and round answers to 4 decimal places.

Hypothesis Testing Example: Comparing Average Salaries

Department A vs. Department B

Sample Size from A:
- n_A = 40
Average Salary from A:
- \bar{x}_A = 55000
Standard Deviation from A:
- S_A = 5000
Sample Size from B:
- n_B = 35
Average Salary from B:
- \bar{x}_B = 53500
Standard Deviation from B:
- S_B = 4800

Hypotheses

State the hypotheses:
- Null hypothesis:
- H0: \sigma{A}^2 = \sigma_{B}^2
- Alternative hypothesis:
- HA: \sigma{A}^2 > \sigma_{B}^2

Test Statistic

Calculate the test statistic using the F-test formula:
- F = \frac{SA^2}{SB^2}

Decision and Interpretation

Determine p-value from the F-distribution.
Decision:
- Reject $H_0$
- Fail to reject $H_0$

Orange Yield Comparison Example: Florida vs. California

Sample Means from Table 1:
- Florida Mean: \bar{x}_{FL} = 31.67
- California Mean: \bar{x}_{CA} = 28.73
Hypothesis Test:
- Test if yield in FL is more than in CA.
Statistical outputs:
- p-value from Bartlett's Test:
- p = 0.179

Hypotheses for Yield Comparison

State the hypotheses:
- Null: H0: \mu{FL} \leq \mu_{CA}
- Alternative: HA: \mu{FL} > \mu_{CA}

Interpretation of Results

Decision:
- If p > 0.05, fail to reject $H_0$
- If p \leq 0.05, reject $H_0$

Daily Step Count Analysis from Fitness Study

Study Details

Sample Size: 16 individuals
Mean Daily Step Count: 779.38 mg/dL

Data Input

Daily Step Count Before and After table.
Analyze possible shifts in average step count.

Confidence Interval Calculation

Construct a 95% confidence interval:
- Compute t_{\frac{\alpha}{2}} & proposals for CI
- Results: Calculate and round as necessary.

Evaluation of New Study Results

Findings suggest:
Confidence Interval for weight loss:
- Reported as (0.51, 0.69)
Calculate:
- Point Estimate (P.E.)
- Margin of Error (M.O.E.)
- \text{P.E.} = \frac{0.51 + 0.69}{2} = 0.601
- \text{M.O.E.} = \frac{0.69 - 0.51}{2} = 0.09

Interpretation of the Results

Interpretation of Confidence Interval:
- We are 90% confident that the true, unknown population mean of weight loss μ is within the interval (0.51, 0.69).
- The results suggest a reasonable expectation about weight loss under the new program
Overall conclusion of tests must adhere to statistical significance criteria, coupled with clear articulation of conditions and methodology utilized during analyses.

Hypothesis Testing Example: Comparing Average Salaries

Department A vs. Department B

Sample Size from A:
n_A = 40
Average Salary from A:
\bar{x}_A = 55000
Standard Deviation from A:
S_A = 5000
Sample Size from B:
n_B = 35
Average Salary from B:
\bar{x}_B = 53500
Standard Deviation from B:
S_B = 4800

Hypotheses

State the hypotheses:
- Null hypothesis:
  H0: \sigmaA^2 = \sigma_B^2
- Alternative hypothesis:
  HA: \sigmaA^2 > \sigma_B^2

Test Statistic

Calculate the test statistic using the F-test formula:
- Formula:
  F = \frac{SA^2}{SB^2}
- Substitute the values:
  F = \frac{(5000)^2}{(4800)^2}
- Calculation step:
  F = \frac{25000000}{23040000} \approx 1.087

Decision and Interpretation

Determine the p-value from the F-distribution based on the calculated F statistic and degrees of freedom.
Decision:
- If p-value < significance level (e.g., 0.05), reject $H_0$.
- If p-value ≥ significance level, fail to reject $H_0$.

Orange Yield Comparison Example: Florida vs. California

Sample Means from Table 1:
Florida Mean:
\bar{x}_{FL} = 31.67
California Mean:
\bar{x}_{CA} = 28.73
Hypothesis Test:
- Test if yield in FL is more than in CA.

Hypotheses for Yield Comparison

State the hypotheses:
- Null:
  H0: \mu{FL} \leq \mu_{CA}
- Alternative:
  HA: \mu{FL} > \mu_{CA}

Interpretation of Results

Decision:
- If p > 0.05, fail to reject $H_0$
- If p \leq 0.05, reject $H_0$

Daily Step Count Analysis from Fitness Study

Study Details

Sample Size: 16 individuals
Mean Daily Step Count: 779.38 mg/dL

Data Input

Daily Step Count Before and After table.
Analyze possible shifts in average step count.

Confidence Interval Calculation

Construct a 95% confidence interval:
- Compute t_{\frac{\alpha}{2}} & proposals for CI.
- Results: Calculate and round as necessary.

Evaluation of New Study Results

Findings suggest:
- Confidence Interval for weight loss:
- Reported as (0.51, 0.69)
- Calculate:
- Point Estimate (P.E.)
- Margin of Error (M.O.E.)
- \text{P.E.} = \frac{0.51 + 0.69}{2} = 0.601
- \text{M.O.E.} = \frac{0.69 - 0.51}{2} = 0.09

Interpretation of the Results

Interpretation of Confidence Interval:
- We are 90% confident that the true, unknown population mean of weight loss μ is within the interval (0.51, 0.69).
- The results suggest a reasonable expectation about weight loss under the new program.
- Overall conclusion of tests must adhere to statistical significance criteria, coupled with clear articulation of conditions and methodology utilized during analyses.