Chapter 9: Testing a Claim - Significance Tests
Chapter 9: Testing a Claim
Section 9.1: Significance Tests: The Basics
Learning Targets
STATE appropriate hypotheses for a significance test about a population parameter.
INTERPRET a P-value in context.
MAKE an appropriate conclusion for a significance test.
INTERPRET a Type I error and a Type II error in context, providing a consequence of each error in a given setting.
Stating Hypotheses
Confidence intervals and significance tests are the two most common methods of statistical inference.
Confidence Intervals: Provide a range of values for the population parameter based on sample data.
Significance Tests: Weigh evidence for or against a particular claim using observed data.
The Process of a Significance Test
A significance test (or hypothesis test) is a formal procedure used to decide between two competing claims, known as hypotheses.
null hypothesis (H0): The claim that we weigh evidence against (e.g., no effect or no difference).
alternative hypothesis (Ha): The claim we are trying to find evidence for (indicating some effect or difference).
Example: A free-throw shooter claims that his proportion of made free throws is $p = 0.8$ ; we suspect exaggeration, setting:
Null hypothesis: $H0: p = 0.80$
Alternative hypothesis: $Ha: p < 0.80$
Types of Alternative Hypotheses
The alternative hypothesis can be:
One-Sided: Indicates that a parameter is either greater than or less than the null value.
Examples:
$H0: p = 0.80$, $Ha: p < 0.80$ (less than)
$H0: p = 0.80$, $Ha: p > 0.80$ (greater than)
Two-Sided: States that a parameter is different from the null value (could be either direction).
Example: $H0: p = 0.80$, $Ha: p
eq 0.80$
Defining Hypotheses in Context
Practical application: For the Hawaii Pineapple Company, where the mean weight of pineapples last year was 31 ounces, with a new irrigation system installed:
Appropriate hypotheses:
$H0: µ = 31$ (null hypothesis: mean weight remains the same)
$Ha: µ
eq 31$ (alternative hypothesis: mean weight has changed)where $µ$ = the true mean weight of all pineapples grown in the field this year.
Importance of Hypotheses
Hypotheses should express beliefs or suspicions prior to analyzing data.
Always refer to population parameters, not sample statistics.
Incorrect: $H0: ar{p}= 0.80$ or $Ha: ar{x}= 31$
Interpreting P-values
A player claiming to make $80\%$ of his free throws makes only $ar{p} = 0.64$ in a random sample of 50 trials.
This serves as evidence against $H0: p = 0.80$ and supports $Ha: p < 0.80$.
P-value Definition: Probability of observing evidence as strong or stronger than the current evidence if $H0$ is true.
Example numerical calculation:
$P-value ext{ approx } rac{3}{400} = 0.0075$
Indicates how likely it is for an $80\%$ shooter to achieve $64\%$ or lower simply by chance.
Hypothetical Research Problem: Calcium Intake
The NIH recommends $1300\text{ mg}$ daily for teenagers.
A test is constructed with:
Null hypothesis: $H0: µ = 1300$
Alternative hypothesis: $Ha: µ < 1300$
Sample: 20 teens report average intake $ar{x} = 1198$ mg, with standard deviation $s_x = 411$ mg.
Resulting P-value obtained: $0.1404$.
Interpretation:
(a) If $H0$ is true, average intake is $1300$ mg.
(b) If average intake is truly $1300$ mg, there's a $0.1404$ probability of getting a sample average of $1198$ mg or less by random chance.
Making Conclusions from P-values
Decisions in significance tests based on P-value strength:
Reject $H0$ if the result is unlikely due to chance alone (small P-value).
Fail to reject $H0$ if the result is not unlikely (large P-value).
Guidelines for Drawing Conclusions
If $P-value < 0.05$, reject $H0$ and conclude evidence for $Ha$.
If $P-value \, ext{ is not small}$, fail to reject $H0$ (not convincing evidence for $Ha$).
Significance Level (α)
Significance level $ ext{α}$ defines the boundary for determining statistical significance.
Common levels are $ ext{α = 0.05}$, $ ext{α = 0.01}$, and $ ext{α = 0.10}$.
Preference varies depending on whether Type I or II errors are more serious:
Type I Error: Rejecting $H0$ when it's true.
Probability is equal to $ ext{α}$.
Type II Error: Failing to reject $H0$ when $Ha$ is actually true.
Type I and II Errors in Context
Example: Potato chip production scenario, testing for blemishes in potatoes.
Type I Error: Concludes >8% blemished when it’s really 8%.
Consequence: Rejects good shipment, wasting resources.
Type II Error: Fails to find evidence for >8% blemishes when they exist.
Consequence: Produces bad chips, harms reputation and reduces sales.
Summary
Hypotheses define the framework for significance tests.
P-values assist in interpreting evidence against the null hypothesis.
Conclusions guide decision-making based on statistical significance levels and the implications of Type I and Type II errors.