Understanding Statistical Inference: Confidence Intervals and Hypothesis Testing
Focus on Logic, Not Memorization of Proofs
The primary goal is to understand the logic behind statistical results, not to memorize complex proofs.
Proofs are useful to understand once to grasp the underlying meaning of a result, such as knowing that X∗ is a random variable with a specific distribution and the logic for calculating a confidence interval from it.
The emphasis is on the practical implications and understanding what the results mean, rather than the exact mathematical derivations.
Population Parameters and Sampling
Population too big: Often, we deal with populations that are too large to measure every member.
Unknown Parameters: We don't know the true parameters of these populations (e.g., mean (μ), variance (σ2), standard deviation (σ)).
Comparison of Populations: We might want to compare parameters between two or more populations (e.g., μ<em>x1 vs. μ</em>x2 or ratio of variances).
Sampling: To learn about population parameters, we take a sample, which is a set of observations.
Random Event: The act of drawing a sample is a random event, which can be represented by random variables.
Example: Choosing 10 students from MSU results in different sets of students each time.
Point Estimators
Definition: A single value (a statistic derived from a sample) used to estimate an unknown population parameter.
Formation: Sample data is 'put together' (e.g., summed, averaged) to estimate parameters.
Examples:
Sum: ∑Xi
Average (Sample Mean): $\bar{X} = \frac{1}{n} \sum{i=1}^{n} Xi
Sample Variance: s2=n−1∑(Xi−Xˉ)2 (an unbiased estimator for population variance).
Alternative Sample Variance: s<em>b2=n∑(X</em>i−Xˉ)2 (often called the maximum likelihood estimator, but it is a biased estimator for population variance).
Properties of Estimators:
Unbiasedness: An estimator θ^ is unbiased if its expected value equals the true population parameter θ (i.e., E[θ^]=θ).
Example: E[Xˉ]=μX (sample mean is an unbiased estimator of population mean).
Example: E[s2]=σ2 (sample variance with n−1 in denominator is an unbiased estimator of population variance).
Limitations of Point Estimators: Even if unbiased, a single point estimate doesn't convey the quality or accuracy of the estimation. Increasing sample size (n) generally improves quality, but we need more precise measures.
Interval Estimators (Confidence Intervals)
Purpose: To provide a range of values (an interval) within which the true population parameter is likely to lie, along with a specified level of confidence.
Formulation: Finding two estimators (a lower bound θ^<em>1 and an upper bound θ^</em>2) such that the probability of the true parameter θ being between them is 1−α (e.g., P(θ^<em>1≤θ≤θ^</em>2)=1−α).
Example: The mean height of MSU students is between 5.7 and 5.8 feet with 99.95% confidence.
Interpretation: If we repeat the sampling and interval calculation many times, 1−α of these intervals would contain the true population parameter. (e.g., if 1−α=0.9995, then in 10,000 universities, 9995 of the calculated intervals would contain the actual mean height).
Derivation for Population Mean (μ) with Known Population Standard Deviation (σ):
Assumption: For sufficiently large sample sizes (n≥30), the sample mean Xˉ is approximately normally distributed: Xˉ∼N(μ,nσ2).
Standardization: Convert Xˉ to a standard normal variable (Z-score): Z=σ/nXˉ−μ.
Probability Statement: From the Z-distribution table, we find critical values ±Z<em>α/2 such that: P(−Z</em>α/2≤Z≤Zα/2)=1−α
Rearranging to find Confidence Interval for μ: P(Xˉ−Z<em>α/2nσ≤μ≤Xˉ+Z</em>α/2nσ)=1−α
The lower bound is θ^<em>1=Xˉ−Z</em>α/2nσ and the upper bound is θ^<em>2=Xˉ+Z</em>α/2nσ.
Derivation for Population Mean (μ) with Unknown Population Standard Deviation (σ) or Small Sample Size (n<30):
Use the t-distribution instead of the Z-distribution.
Replace population standard deviation σ with sample standard deviation s: T=s/nXˉ−μ with n−1 degrees of freedom.
The process for finding the interval is otherwise similar.
Derivation for Population Variance (σ2):
Uses the Chi-squared (χ2) distribution.
Test Statistic:σ2(n−1)s2∼χn−12.
Probability Statement: Find critical values χ2<em>α/2 and χ2</em>1−α/2 from the Chi-squared table such that: P(χ2<em>α/2≤σ2(n−1)s2≤χ2</em>1−α/2)=1−α
* (Note: The speaker clarifies that for the upper bound of the interval for χ2, one typically uses χ2<em>1−α/2, which corresponds to the smaller critical value on the left tail, and χ2</em>α/2 for the right tail, since the inequality is inverted when solving for σ2)
Rearranging to find Confidence Interval for σ2: P(χ2<em>1−α/2(n−1)s2≤σ2≤χ2</em>α/2(n−1)s2)=1−α
Controlling Confidence Interval Properties:
Width vs. Confidence: To increase confidence (1−α) while keeping sample size (n) fixed, the interval must become wider.
Width vs. Sample Size: To keep the interval width fixed while increasing confidence, or to narrow the interval for fixed confidence, the sample size (n) must be increased.
This trade-off helps in deciding optimal sample size based on desired precision and confidence.
Other Parameters: Similar procedures exist for other parameters like proportions (p<em>1−p</em>2) or ratios of variances ($\sigma1^2 / \sigma2^2 which uses the F-distribution, as seen in Fisher's technique).
Hypothesis Testing
Core Idea: To check if a claim or belief, formulated as a hypothesis, is supported by available data and evidence.
Example: A belief that MSU students are taller than Missouri students.
Formulating Hypotheses:
Null Hypothesis (H0): A statement about a population parameter that is assumed to be true until evidence suggests otherwise. It typically includes an equality.
Example: The mean height of MSU students is 5.6 feet (H0:μ=5.6).
Alternative Hypothesis (H<em>a or H</em>1): A statement that contradicts the null hypothesis. It can be:
Two-sided: The parameter is not equal to a specific value (e.g., Ha:μ=5.6).
One-sided: The parameter is greater than or less than a specific value (e.g., H<em>a:μ>5.6 or Ha: \mu < 5.6).
The null and alternative hypotheses should collectively cover all possibilities in the parametric space.
Logic of Hypothesis Testing:
Assumption: Assume the null hypothesis (H0) is true.
Sampling and Test Statistic: Take a sample and calculate a test statistic (e.g., Xˉ).
Probability under H<em>0: Determine the probability of observing a test statistic as extreme as, or more extreme than, the one obtained, assuming H</em>0 is true.
Decision: If this probability is very low (below a predetermined significance level α), then it casts doubt on the initial assumption that H<em>0 is true, and we reject H</em>0. Otherwise, we fail to reject H0.
Critical Region (Rejection Region):
Significance Level (α): A chosen threshold (e.g., 0.05 or 5%). It represents the maximum probability of making a Type I error.
Critical Values: Based on α and the distribution of the test statistic (assuming H0 is true), we find critical values that define the boundaries of the critical region.
Decision Rule: If the calculated test statistic falls within the critical region, we reject H0.
Example for Mean (μ) with Known σ (Two-sided test H<em>0:μ=μ</em>0 vs. H<em>a:μ=μ</em>0):
Test statistic: Z=σ/nXˉ−μ0.
Critical Region: Reject H<em>0 if Z<−Z</em>α/2 or Z > Z_{\alpha/2}.