Section 3.3: Constructing Bootstrap Confidence Intervals
Course: Statistics 101, University of Canterbury
Unit B: Understanding Inference
Confidence Intervals
Sampling Distributions
Understanding and Interpreting Confidence Intervals
Constructing Bootstrap Confidence Intervals
Bootstrap Confidence Intervals using Percentiles
Hypothesis Testing
Introducing Hypothesis Tests
Measuring Evidence with P-values
Determining Statistical Significance
A Closer Look at Testing
Key Skills:
Selecting a bootstrap sample and computing a bootstrap statistic
Using technology to create a bootstrap distribution
Estimating the standard error from bootstrap distributions
Constructing a 95% confidence interval based on the sample statistic and standard error from a bootstrap distribution
Standard Error (SE):
Standard deviation of the sampling distribution
Margin of Error (MoE):
For a 95% CI: MoE = 2 × SE
Confidence Interval:
formula: statistic ± MoE
Calculate statistic for each sample
Concern: Only one sample available, leading to uncertainty on how sample statistics vary.
Bootstrapping as a method to assess this variability.
Metaphor: "Pull yourself up by your bootstraps" illustrates achieving the seemingly impossible without outside help—applies to statistical inference.
Learning goal: Where might the "actual" proportion (p) of blue M&Ms be found?
Assumptions needed: Sample must be representative of the hypothetical population to generalize results.
Random Sampling: Essential for accurate statistical inference.
Visualizes the concept of sampling from a hypothetical population composed of the original sample.
How repeated samples from a hypothetical population are simulated.
Practicality: Sample drawn with replacement from the existing sample elements.
Scenario: Random sample of 9 dogs used to demonstrate bootstrapping.
Illustrates the generation of a bootstrap sample from an original sample.
Bootstrap Sample: Random sample with replacement of the same size as the original sample.
Bootstrap Statistic: Statistic computed from a bootstrap sample.
Bootstrap Distribution: Distribution of numerous bootstrap statistics.
Example provided for testing if generated samples can be valid bootstrap samples from the original values.
Samples vs. Statistics: Original sample, bootstrap samples, bootstrap statistics visualized.
Dotplot representation of bootstrap samples and comparative statistics (mean and standard error).
Sampling distribution centered around population parameter, whereas bootstrap distribution centers on sample statistic.
Bootstrapping used to assess precision of original estimates—not to improve point estimates.
Variability in bootstrap statistics reflects variability in sample statistics.
Standard error can be estimated from bootstrap distribution's standard deviation.
Formula for confidence intervals: statistic ± 2 × SE, where SE is derived from bootstrap distribution.
Standard Error: Standard deviation of the bootstrap distribution.
Margin of Error: MoE = 2 × SE, for 95% CI.
Construct confidence interval: statistic ± MoE.
Bootstrapping can assess uncertainty in various sample statistics:
Proportion (p)
Difference in means (µ1 - µ2)
Difference in proportions (p1 - p2)
Scenario: Randomly sampled n = 25 Mustangs to estimate average price.
Calculated statistics from sampled Mustangs:
Sample Size (n) = 25
Mean price estimate (x̄) = $15.98K
Standard deviation (s) = $11.11K
Precision assessment follows.
Procedure:
Means calculated from bootstrap samples.
Repetitive calculations yield distribution data.
Dotplot for bootstrap distribution showing mean and standard error.
Survey on 2,251 individuals was conducted regarding beliefs on global warming.
Result showed 1,328 saying "yes", leading to confidence interval analysis.
Bootstrap analysis showing mean proportions and standard error from 1,000 samples.
Comparison of beliefs in global warming by political party
79% of Democrats vs. 38% of Republicans.
Confidence interval needs determination for the difference in proportions.
Distribution representation for the difference in belief proportions between Democrats and Republicans.
Steps:
Generate bootstrap samples by systematic sampling with replacement.
Compute bootstrap statistics.
Assemble bootstrap statistics into a distribution.
Resulting distributions can estimate confidence intervals if broadly symmetrical.