Module 8: One-Sample Hypothesis Testing (Part 1)
Exam and Course Logistics
Office Hours Opportunities:
Two Webex office hours this week.
After next Monday's class, two in-person office hours (likely Monday and Tuesday at in the classroom) will be offered before Wednesday.
Mock Exam: A mock exam will be held next Monday during class.
An answer key and in-person availability will follow the mock exam.
Instructor's Availability/Efficiency:
The instructor will be available for questions, emphasizing efficient meetings (asking specific questions).
Instructor is moving further away but remains committed to supporting students.
Group Study: Highly recommended for efficiency; the instructor is more willing to meet with groups of students (Webex or in-person) than individuals.
Cheat Sheet Template: A template document to aid in preparing cheat sheets will be made available on Wednesday.
Attendance Tracking: Students should check their AT, AC, and AP records. AT tracks absences (good numbers: ). AC and AP track missed active learning items, costing from the activity grade per miss.
Choosing Between Z-test and T-test for Means
Key Condition: The decision to use a Z-test or T-test depends solely on whether the population standard deviation () is known.
Z-test Usage: Use a Z-test only if the question explicitly states that the population standard deviation () is known (e.g., "the population standard deviation is equal to ").
T-test Usage: If the population standard deviation () is not mentioned or unknown, assume a T-test is required.
Clarification on Sample Size ():
The rule that for one can use a Z-test (due to T and Z distributions overlapping for large sample sizes) is not the primary condition for choosing between Z and T in this class.
The only decisive factor is knowledge of the population standard deviation.
Applicability: This rule applies currently to confidence intervals for means and will be extended/modified for proportions (covered in Lecture ).
Lecture 8: One-Sample Hypothesis Testing (Part 1)
This is the concluding lecture for the statistical portion of the course.
After Lecture , the course will transition to more algebraic topics.
Understanding Hypothesis Testing
Definition: A hypothesis is a statement about population parameters (e.g., population mean, ).
Example (Cafeteria Rating):
Parameter: The average rating () for the campus cafeteria.
Population: All individuals who have used the cafeteria service (not just students, but also staff, visitors, etc.).
Practicality: It's impossible to collect data from the entire population, so sampling is necessary.
Setting Up Hypotheses
Two complementary statements are used:
Null Hypothesis (): Always includes the equality sign.
E.g., (hypothesized population mean, ).
Alternative Hypothesis (): States the opposite of the null hypothesis.
E.g., .
Complementary Nature: Together, and cover all possibilities ().
Types of Hypothesis Tests
Two-Sided (Two-tailed) Test:
Used when we want to test if a parameter is different from a specific value.
Form: vs. .
Focus of today's lecture.
One-Sided (One-tailed) Test:
Used when we are interested if a parameter is less than or greater than a specific value.
Form: vs. H1: \mu > \mu0 (or vs. H1: \mu < \mu0).
Will be discussed on Wednesday.
Three Approaches to One-Sample Hypothesis Testing
When testing a hypothesis, there are three equivalent approaches to draw conclusions:
Approach 1: Critical Value Approach
This method compares a calculated test statistic to critical values that define rejection regions.
State the Null () and Alternative () Hypotheses:
Example: For cafeteria ratings,
(The average rating is )
(The average rating is not )
This is a two-tailed test because of the "" in . The hypothesized value is .
Specify the Desired Level of Significance () and Sample Size ():
Significance Level (): The probability of rejecting a true null hypothesis. By default, (), but it can be changed. Recall .
Sample Size (): Determines the degrees of freedom. For the example, a convenience sample of students was used.
Determine the Appropriate Test Statistic (Z or T):
Based on whether the population standard deviation () is known.
Example: Since is unknown for the cafeteria rating, a T-test is used.
Determine the Critical Value(s):
Degrees of Freedom (): For a one-sample T-test, . For , .
Using Minitab to find critical values (Slide 4):
Navigate to
Graph > Probability Distribution Plot > View Probability > OK.Select
T Distributionand enterDegrees of freedom: 5.Go to
Shaded Area, selectProbability, chooseBoth Tails.Enter
Probability = 0.05().The output shows the critical values. For and (both tails), the critical values are . These values define the rejection regions (the tails) and the non-rejection region (the middle area).
Calculate the Test Statistic (T-stat):
Formula: (where is sample mean, is hypothesized population mean, is sample standard deviation, is sample size).
Example Calculations:
First, get sample descriptive statistics using Minitab (
Stat > Basic Statistics > Display Descriptive Statistics).Sample mean () of collected ratings: .
Sample standard deviation () of collected ratings: .
.
.
Make a Decision and Draw a Conclusion:
Compare the calculated to the critical values ().
If falls into a rejection region (i.e., less than or greater than ), reject .
If falls into the non-rejection region (between the critical values), do not reject .
Example: The calculated falls between and . It is in the non-rejection region.
Conclusion: Do not reject at the significance level. (This implies the sample mean of is not statistically different from the hypothesized mean of ).
Approach 2: P-Value Approach
This method compares the probability of observing the test statistic (or more extreme) to the significance level.
Steps 1-3: Same as the Critical Value Approach.
Calculate the P-value:
Use the calculated (e.g., ).
Using Minitab to find P-value:
Navigate to
Graph > Probability Distribution Plot > View Probability > OK.Select
T Distributionand enterDegrees of freedom: 5.Go to
Shaded Area, selectX value, chooseLeft Tail.Enter
X value = -0.716.The output gives the probability for the left tail. For , the left tail probability is approximately .
For a two-tailed test, double this probability: P-value = .
Make a Decision and Draw a Conclusion:
Rule:
If P-value : Reject .
If P-value > \alpha: Do Not Reject .
Example: P-value () is greater than ().
Conclusion: Do not reject at the significance level.
Consistency: This conclusion is consistent with the critical value approach.
Interpretation of P-value (brief): The P-value is the probability of observing a test statistic as extreme as, or more extreme than, the one calculated, assuming the null hypothesis is true. (This definition will be revisited for deeper understanding later).
Approach 3: Confidence Interval Approach
This method determines if the hypothesized population parameter falls within the calculated confidence interval.
Steps 1-3: Same as the Critical Value Approach.
Construct the Confidence Interval (CI):
Formula (for T-distribution CI for means): (where corresponds to the desired confidence level and ).
Example Calculations:
(for confidence and , from Step 4 of Critical Value Approach).
Confidence Interval: (lower bound approx. , upper bound approx. ).
Make a Decision and Draw a Conclusion:
Rule:
If the hypothesized population mean () falls within the confidence interval: Do Not Reject .
If the hypothesized population mean () falls outside the confidence interval: Reject .
Example: The hypothesized population mean () is within the calculated confidence interval .
Conclusion: Do not reject at the significance level.
Consistency: This conclusion is consistent with both the critical value and P-value approaches.
Using Minitab for All-in-One Hypothesis Testing
Minitab can perform all these calculations simultaneously, providing the T-stat, P-value, and confidence interval directly.
Steps:
Go to
Stat > Basic Statistics > 1-Sample t...Select
Summarized data(orOne or more samples, each in a columnif you have raw data).Enter the
Sample size (n),Sample mean (X bar), andStandard deviation (S).Check
Perform hypothesis testand enter theHypothesized mean (Mu)().Click
Options:Set the
Confidence level(e.g., for ).Set the
Alternative hypothesistoMean not equal to hypothesized meanfor a two-sided test.
Click
OK.
Output: Minitab will display the T-value (), P-value, and the confidence interval.
Missing Information: Minitab's
1-Sample tfunction does not directly provide the critical values. These must be obtained separately usingGraph > Probability Distribution Plot.Educational Value: Understanding the manual steps behind each approach is crucial before relying solely on software outputs.
Practice Example: Exam Average (One-Sample T-test)
Scenario: Instructor believes the class average for the first exam is .
Hypotheses:
Sample Data: A sample of students yields a sample mean and a sample standard deviation .
Significance Level: (or confidence level).
Test Type: T-test (since population standard deviation is unknown).
Minitab Application (
Stat > Basic Statistics > 1-Sample t...with summarized data):Sample size: 10Sample mean: 83.4Standard deviation: 5.7Hypothesized mean: 85Options:Confidence level: 93.0,Alternative hypothesis: Mean not equal to hypothesized mean.
Minitab Results:
T-Value ():
P-Value:
Confidence Interval ( CI): (rounded)
Conclusions from Each Approach:
P-value Approach: P-value () > \alpha ().
Decision: Do not reject at the level.
Confidence Interval Approach: The hypothesized mean () falls within the CI .
Decision: Do not reject at the level.
Critical Value Approach:
Degrees of Freedom: .
Finding Critical Values (Minitab:
Graph > Probability Distribution Plot): For and (both tails), the critical values are .Comparison: The calculated falls between and (non-rejection region).
Decision: Do not reject at the level.
Overall Conclusion: All three approaches consistently lead to the conclusion to not reject the null hypothesis that the class average is .