Lesson 9.2: Conditions and P-values for Significance Tests for a Proportion

Significance Tests for a Proportion: Overview and Methodology

  • Introduction to Significance Tests: Significance tests are employed to determine if experimental or observational results provide enough evidence to reject a null hypothesis (H0H_0) in favor of an alternative hypothesis (HaH_a). Previously, simulations were used to estimate P-values; however, standardized formulas allow for more precise calculations.
  • Example Case: Mrs. Gallas’s Free Throw Accuracy:     - Mrs. Gallas claims an 80% free throw shooting percentage (p=0.80p = 0.80).     - In a sample of 5050 free throws, she makes 3232, resulting in a sample proportion (p^\hat{p}) of 3250=0.64\frac{32}{50} = 0.64.

Defining Hypotheses and Parameters

  • Parameter Definition: The parameter pp represents the true proportion of free throw makes for Mrs. Gallas.
  • The Null Hypothesis (H0H_0): The claim being tested, assumed to be true until proven otherwise.     - H0:p=0.80H_0: p = 0.80
  • The Alternative Hypothesis (HaH_a): The claim we are looking for evidence for.     - Ha:p<0.80H_a: p < 0.80

Conditions for Performing a Significance Test

Before calculating the test statistic and P-value, three conditions must be satisfied to ensure the validity of the inference:

  • 1. Random Condition: The data must come from a random sample or a randomized experiment.     - Significance: This allows us to generalize the results to the population of all free throws.     - Status in Mrs. Gallas Case: Yes, the sample is assumed to be representative.
  • 2. 10% Condition: When sampling without replacement, the sample size (nn) should be less than or equal to 10% of the population (NN).     - Formula: n0.10Nn \leq 0.10N     - Application: 5010% (all free throws)50 \leq 10\% \text{ (all free throws)}. This confirms that sampling without replacement is acceptable.
  • 3. Large Counts Condition: Both the expected number of successes and failures must be at least 1010.     - Calculations:         - np=50×0.80=4010np = 50 \times 0.80 = 40 \geq 10         - n(1p)=50×(10.80)=1010n(1-p) = 50 \times (1 - 0.80) = 10 \geq 10     - Significance: Meeting this condition ensures that the sampling distribution of p^\hat{p} is approximately normal.

The Sampling Distribution of p^\hat{p}

  • Mean (μp^\mu_{\hat{p}}): Equivalent to the hypothesized population proportion.     - μp^=p=0.80\mu_{\hat{p}} = p = 0.80
  • Standard Deviation (σp^\sigma_{\hat{p}}): Calculated using the formula for the standard deviation of a proportion.     - σp^=p(1p)n=0.80(0.20)50=0.057\sigma_{\hat{p}} = \sqrt{\frac{p(1-p)}{n}} = \sqrt{\frac{0.80(0.20)}{50}} = 0.057
  • Distribution Notation: The sampling distribution can be modeled as approximately normal: N(0.80,0.057)N(0.80, 0.057).
  • Normal Curve Labeling: The mean is centered at 0.800.80. Standard deviations (SD) are marked as follows:     - 1 SD:0.743-1 \text{ SD}: 0.743     - 2 SD:0.686-2 \text{ SD}: 0.686     - 3 SD:0.629-3 \text{ SD}: 0.629     - +1 SD:0.857+1 \text{ SD}: 0.857     - +2 SD:0.914+2 \text{ SD}: 0.914     - +3 SD:0.971+3 \text{ SD}: 0.971

Standardized Test Statistics and P-values

  • Test Statistic Formula: A measure of how far the sample statistic deviates from the null hypothesis parameter in units of standard deviation.     - z=StatisticParameterStandard Deviation of Statisticz = \frac{\text{Statistic} - \text{Parameter}}{\text{Standard Deviation of Statistic}}     - z=p^pp(1p)nz = \frac{\hat{p} - p}{\sqrt{\frac{p(1-p)}{n}}}
  • Calculation for Mrs. Gallas (n=50n=50, p^=0.64\hat{p}=0.64):     - z=0.640.800.057=2.81z = \frac{0.64 - 0.80}{0.057} = -2.81     - Interpretation: The observed value p^=0.64\hat{p} = 0.64 is 2.812.81 standard deviations below the hypothesized mean (0.800.80).
  • P-value Calculation: The probability of obtaining a sample result as extreme or more extreme than the one observed, assuming H0H_0 is true.     - P(p^0.64p=0.80)=0.002P(\hat{p} \leq 0.64 \mid p = 0.80) = 0.002
  • Conclusion Criteria: Generally, if the P-value is less than the significance level (α=0.05\alpha = 0.05), we reject H0H_0.     - Result: Because 0.002<0.050.002 < 0.05, we reject H0H_0. We have convincing evidence that Mrs. Gallas is a less than 80% free throw shooter.

Scenario Analysis: Alternative Sample Result

  • Hypothetical Case: Suppose Mrs. Gallas made 36/5036/50 shots.     - Sample Proportion (p^\hat{p}): 3650=0.72\frac{36}{50} = 0.72     - Test Statistic (zz): z=0.720.800.057=1.40z = \frac{0.72 - 0.80}{0.057} = -1.40     - P-value: 0.0810.081
  • Interpretation of P-value: Assuming H0H_0 is true (p=0.80p = 0.80), there is a 0.0810.081 probability of getting a sample proportion of 0.720.72 or less purely by chance.
  • Decision: Because 0.081>0.050.081 > 0.05, we fail to reject H0H_0.
  • Conclusion: We do not have convincing evidence that Mrs. Gallas is a less than 80% free throw shooter.

QuickNotes: Lesson 9.2 Summary

  • Conditions for Significance Test (LTA):     - Random: Random sample or assignment.     - 10%: n0.10Nn \leq 0.10N.     - Large Counts: np10np \geq 10 and n(1p)10n(1-p) \geq 10.
  • Calculation (LTAZ):     - z=p^pp(1p)nz = \frac{\hat{p} - p}{\sqrt{\frac{p(1-p)}{n}}}
  • Interpretation of P-value: The probability of observing a result at least as extreme as the sample statistic by chance, given that the null hypothesis is true.

Case Study: Cilantro Flavor Perception

  • Context: Scientists believe 10%10\% of the population perceives a soapy flavor in cilantro. Ebise believes the proportion is higher among teenagers.
  • Study Details: Ebise takes a random sample of n=200n = 200 teenagers and finds 2525 identify the soapy flavor.
  • 1. Hypotheses:     - H0:p=0.10H_0: p = 0.10     - Ha:p>0.10H_a: p > 0.10     - Parameter definition (pp): The true proportion of all teenagers who get a soapy flavor for cilantro.
  • 2. Evidence Assessment:     - p^=25200=0.125\hat{p} = \frac{25}{200} = 0.125 (12.5%12.5\%     - This provides some evidence for HaH_a because 12.5%>10%12.5\% > 10\%.
  • 3. Check Conditions:     - Random: The problem states a "random sample of 200 teenagers," allowing generalization to the population.     - 10%: 20010% (all teenagers)200 \leq 10\% \text{ (all teenagers)}, so sampling without replacement is acceptable.     - Large Counts:         - 200(0.10)=2010200(0.10) = 20 \geq 10         - 200(0.90)=18010200(0.90) = 180 \geq 10         - Conclusion: The sampling distribution of p^\hat{p} is approximately normal.
  • 4. Standardized Test Statistic and P-value:     - z=0.1250.100.10(0.90)200=1.18z = \frac{0.125 - 0.10}{\sqrt{\frac{0.10(0.90)}{200}}} = 1.18     - P-value: 0.11930.1193
  • 5. Interpretation of P-value: Assuming H0H_0 is true (p=0.10p = 0.10), there is a 0.11930.1193 probability of getting a sample proportion of 0.1250.125 or greater purely by chance.
  • 6. Conclusion:     - Compared against α=0.05\alpha = 0.05.     - Because 0.1193>0.050.1193 > 0.05, we fail to reject H0H_0. We do not have convincing evidence that more than 10% of teenagers get a soapy flavor for cilantro.