Hypothesis Testing: P-Value, Steps, and Interpretation

Administrative Information for Course Review

  • Course Review Deadline: The process to review or opt-out from the course must be completed by September 29. Students should follow the instructions provided in an email sent earlier today.

  • Cost: While the exact charge is uncertain, it is estimated to be around 20orpotentiallymore,butnotexcessivelyhigh.Studentswhodonotneedthereviewshouldoptout.</p></li></ul><h3collapsed="false"seolevelmigrated="true">PValueDeterminationandInterpretation</h3><ul><li><p><strong>PValueLocation:</strong>Thelocationofthepvalueisalwaysdeterminedbythedirectionalsignofthealternativehypothesis(20 or potentially more, but not excessively high. Students who do not need the review should opt out.</p></li></ul><h3 collapsed="false" seolevelmigrated="true">P-Value Determination and Interpretation</h3><ul><li><p><strong>P-Value Location:</strong> The location of the p-value is always determined by the directional sign of the alternative hypothesis (H_A).</p><ul><li><p><strong>Example:</strong>If).</p><ul><li><p><strong>Example:</strong> IfH_Ausesa"lessthan"(uses a "less than" (<)sign,thepvalueistheareatotheleftoftheteststatistic.</p></li><li><p><strong>Concept:</strong>Thepvaluerepresentstheprobabilityofobservingateststatisticasextremeas,ormoreextremethan,whatwasobservedinthesample,<em>assumingthenullhypothesis() sign, the p-value is the area to the left of the test statistic.</p></li><li><p><strong>Concept:</strong> The p-value represents the probability of observing a test statistic as extreme as, or more extreme than, what was observed in the sample, <em>assuming the null hypothesis (H_0)istrue</em>.</p></li></ul></li><li><p><strong>ConceptualizingHypothesisTests(The10,000LivesAnalogy):</strong></p><ul><li><p>Imaginethateveryhypothesistestyouperformhas,intheory,beenconducted<strong>10,000times</strong>bysomeoneelsebeforeyou.</p></li><li><p><strong>IllustrationwithCoinFlipApplet:</strong></p><ul><li><p><strong>NullHypothesis:</strong>Probabilityofgettingheads(p)fromafaircoinis) is true</em>.</p></li></ul></li><li><p><strong>Conceptualizing Hypothesis Tests (The 10,000 Lives Analogy):</strong></p><ul><li><p>Imagine that every hypothesis test you perform has, in theory, been conducted <strong>10,000 times</strong> by someone else before you.</p></li><li><p><strong>Illustration with Coin Flip Applet:</strong></p><ul><li><p><strong>Null Hypothesis:</strong> Probability of getting heads (p) from a fair coin isP=0.5.</p></li><li><p><strong>SampleSize:</strong>.</p></li><li><p><strong>Sample Size:</strong>n=60peopleinaclassroom.</p></li><li><p><strong>Experiment:</strong>Eachofthe60peopleflipsacoinonce,andthenumberofheads(successes)iscounted.</p></li><li><p><strong>Simulation:</strong>Theappletsimulatestakingsamplesofsize60fromapopulationwherepeople in a classroom.</p></li><li><p><strong>Experiment:</strong> Each of the 60 people flips a coin once, and the number of heads (successes) is counted.</p></li><li><p><strong>Simulation:</strong> The applet simulates taking samples of size 60 from a population wherep=0.5andcalculatingtheproportionofheads(and calculating the proportion of heads (p_{hat})<strong>10,000times</strong>.</p></li><li><p><strong>ExpectedOutcome:</strong>Ifyourepeatthisexperimentmanytimes,thedistributionof) <strong>10,000 times</strong>.</p></li><li><p><strong>Expected Outcome:</strong> If you repeat this experiment many times, the distribution ofp_{hat}valueswillformabellshaped(normal)curvecenteredaroundvalues will form a bell-shaped (normal) curve centered aroundP=0.5.</p></li><li><p><strong>Likelihood:</strong>The<strong>taller</strong>aspecific.</p></li><li><p><strong>Likelihood:</strong> The <strong>taller</strong> a specificp{hat}valueisonthiscurve,themorelikelyitistooccurifvalue is on this curve, the more likely it is to occur ifH0istrue.Conversely,the<strong>shorter</strong>avalueis,thelesslikelyitis.</p></li><li><p><strong>ClassroomExample:</strong></p><ul><li><p>Getting<strong>30outof60heads</strong>(i.e.,is true. Conversely, the <strong>shorter</strong> a value is, the less likely it is.</p></li><li><p><strong>Classroom Example:</strong></p><ul><li><p>Getting <strong>30 out of 60 heads</strong> (i.e.,p_{hat} = 0.5)isverylikelyandwouldbeatthepeakofthecurve.</p></li><li><p>Getting<strong>12outof60heads</strong>isveryunlikelyandwouldbeashortpointonthecurve.</p></li><li><p>Getting<strong>60outof60heads</strong>wouldbeextremelyunlikely.</p></li></ul></li></ul></li></ul></li><li><p><strong>InterpretingaLowPValue:</strong>Ifyourobservedsampleresult(e.g.,gettingonly22headsoutof60,resultinginaverylowpvaluelike) is very likely and would be at the peak of the curve.</p></li><li><p>Getting <strong>12 out of 60 heads</strong> is very unlikely and would be a short point on the curve.</p></li><li><p>Getting <strong>60 out of 60 heads</strong> would be extremely unlikely.</p></li></ul></li></ul></li></ul></li><li><p><strong>Interpreting a Low P-Value:</strong> If your observed sample result (e.g., getting only 22 heads out of 60, resulting in a very low p-value likeP-value = 0.0192)isveryunlikelyundertheassumptionthatthenullistrue,thenyourejectthenullhypothesis.Itsuggeststhatyoursampleresultistooextremetobeexplainedbychancealone.</p></li><li><p><strong>CrucialPValueInterpretationRule:</strong>Wheninterpretingapvalue,it<strong>always</strong>doneundertheassumptionthatthenullhypothesis() is very unlikely under the assumption that the null is true, then you reject the null hypothesis. It suggests that your sample result is too extreme to be explained by chance alone.</p></li><li><p><strong>Crucial P-Value Interpretation Rule:</strong> When interpreting a p-value, it <strong>always</strong> done under the assumption that the null hypothesis (H0)is,inreality,true.Thisholdstrueregardlessofwhetheryoureject) is, in reality, true. This holds true regardless of whether you rejectH0orfailtorejector fail to rejectH_0.</p></li></ul><h3collapsed="false"seolevelmigrated="true">SixStepsofaHypothesisTest</h3><h4collapsed="false"seolevelmigrated="true">Step1:DraftaResearchQuestion</h4><ul><li><p><strong>Purpose:</strong>Toclearlystatethequestionthehypothesistestaimstoanswer.</p></li><li><p><strong>Strategy:</strong>Paraphrasetherelevantsentencefromtheproblemstatement,usuallytheoneprecedingtherequestto"performahypothesistest."</p></li><li><p><strong>Example1(BankAccountsinArrears):</strong></p><ul><li><p><strong>ProblemStatementSnippet:</strong>"determineiflessthan1.</p></li></ul><h3 collapsed="false" seolevelmigrated="true">Six Steps of a Hypothesis Test</h3><h4 collapsed="false" seolevelmigrated="true">Step 1: Draft a Research Question</h4><ul><li><p><strong>Purpose:</strong> To clearly state the question the hypothesis test aims to answer.</p></li><li><p><strong>Strategy:</strong> Paraphrase the relevant sentence from the problem statement, usually the one preceding the request to "perform a hypothesis test."</p></li><li><p><strong>Example 1 (Bank Accounts in Arrears):</strong></p><ul><li><p><strong>Problem Statement Snippet:</strong> "…determine if less than 1% of all accounts held by the bank are in arrears."</p></li><li><p><strong>Research Question:</strong> "Is less than 1% of all accounts held by the bank in arrears?"</p></li></ul></li><li><p><strong>Example 2 (Diet Soda Taste):</strong></p><ul><li><p><strong>Problem Statement Snippet:</strong> "…determine if the majority of all diet soda drinkers now like the taste of their soda."</p></li><li><p><strong>Research Question:</strong> "Do the majority (i.e., more than half) of all diet soda drinkers now like the taste of their soda?"</p></li></ul></li></ul><h4 collapsed="false" seolevelmigrated="true">Step 2: Formulate Hypotheses (H0andandHA)</h4><ul><li><p><strong>AlternativeHypothesis()</h4><ul><li><p><strong>Alternative Hypothesis (H_A):</strong></p><ul><li><p>Mustalwaysalignwiththe<strong>directionalsign</strong>impliedbytheresearchquestion(e.g.,"lessthan,""morethan,""differentfrom").</p></li><li><p><strong>Example1():</strong></p><ul><li><p>Must always align with the <strong>directional sign</strong> implied by the research question (e.g., "less than," "more than," "different from").</p></li><li><p><strong>Example 1 (p < 0.01):</strong>Iftheresearchquestionis"lessthan1):</strong> If the research question is "less than 1%," thenH_A: p < 0.01.</p></li><li><p><strong>Example2(.</p></li><li><p><strong>Example 2 (p > 0.5):</strong>Iftheresearchquestionis"majority(morethanhalf),"then):</strong> If the research question is "majority (more than half)," thenH_A: p > 0.5.</p></li></ul></li><li><p><strong>NullHypothesis(.</p></li></ul></li><li><p><strong>Null Hypothesis (H_0):</strong></p><ul><li><p>Theparametersymbol(e.g.,):</strong></p><ul><li><p>The parameter symbol (e.g.,pforproportion)mustbethesameasinfor proportion) must be the same as inH_A.</p></li><li><p>Thevalueoftheparameter(e.g.,.</p></li><li><p>The value of the parameter (e.g.,0.01oror0.5)mustbethesameasin) must be the same as inH_A.</p></li><li><p>Thedirectionalsymbolfor.</p></li><li><p>The directional symbol forH_0<strong>mustalwaysincludeanequalsign</strong>.</p><ul><li><p><strong>Optionsfor<strong>must always include an equal sign</strong>.</p><ul><li><p><strong>Options forH_0:</strong></p><ul><li><p>:</strong></p><ul><li><p>H_0: p = ext{value}(mostcommonandalwaysacceptable)</p></li><li><p>(most common and always acceptable)</p></li><li><p>H0: p ext{ (greater than or equal to) value}(if(ifHAislessthan)</p></li><li><p>is less than)</p></li><li><p>H0: p ext{ (less than or equal to) value}(if(ifHAisgreaterthan)</p></li></ul></li></ul></li></ul></li><li><p><strong>Example1(BankAccounts):</strong></p><ul><li><p>is greater than)</p></li></ul></li></ul></li></ul></li><li><p><strong>Example 1 (Bank Accounts):</strong></p><ul><li><p>H_0: p = 0.01</p></li><li><p></p></li><li><p>H_A: p < 0.01</p></li></ul></li><li><p><strong>Example2(DietSoda):</strong></p><ul><li><p></p></li></ul></li><li><p><strong>Example 2 (Diet Soda):</strong></p><ul><li><p>H_0: p = 0.5</p></li><li><p></p></li><li><p>H_A: p > 0.5</p></li></ul></li></ul><h4collapsed="false"seolevelmigrated="true">Step3:CheckConditions</h4><ul><li><p><strong>TypeofTest:</strong>Identifyiftheprobleminvolvesproportionsormeans.(Discussionfocusesonproportionshere,indicatedbypercentagesor"majority").</p></li><li><p><strong>ConditionsforProportions(TwoMethods):</strong></p><ol><li><p><strong>Method1(UsingPopulationProportionunder</p></li></ul></li></ul><h4 collapsed="false" seolevelmigrated="true">Step 3: Check Conditions</h4><ul><li><p><strong>Type of Test:</strong> Identify if the problem involves proportions or means. (Discussion focuses on proportions here, indicated by percentages or "majority").</p></li><li><p><strong>Conditions for Proportions (Two Methods):</strong></p><ol><li><p><strong>Method 1 (Using Population Proportion underH_0):</strong></p><ul><li><p>):</strong></p><ul><li><p>np ext{ (greater than or equal to) } 10</p></li><li><p></p></li><li><p>n(1-p) ext{ (greater than or equal to) } 10</p></li><li><p>Where</p></li><li><p>Wherepisthevaluefromthenullhypothesis(is the value from the null hypothesis (P_0).

  • Method 2 (Using Sample Counts):

    • Number of successes $\ge 10$

    • Number of failures $\ge 10$

    • Note: Both methods yield the same pass/fail result. You can use either one.

  • Example (Diet Soda):

    • Sample Size: n=75</p></li><li><p><strong>NullPopulationProportion:</strong></p></li><li><p><strong>Null Population Proportion:</strong>P_0 = 0.5</p></li><li><p><strong>Successes:</strong>39peoplelikedthetaste.</p></li><li><p><strong>Failures:</strong>36peopledislikedthetaste(</p></li><li><p><strong>Successes:</strong> 39 people liked the taste.</p></li><li><p><strong>Failures:</strong> 36 people disliked the taste (75 - 39 = 36).</p></li><li><p><strong>CheckingConditions:</strong></p><ul><li><p><strong>Method1:</strong></p><ul><li><p>).</p></li><li><p><strong>Checking Conditions:</strong></p><ul><li><p><strong>Method 1:</strong></p><ul><li><p>n P_0 = 75 imes 0.5 = 37.5 ext{ (greater than or equal to) } 10(Passes)</p></li><li><p>(Passes)</p></li><li><p>n (1-P_0) = 75 imes (1-0.5) = 75 imes 0.5 = 37.5 ext{ (greater than or equal to) } 10(Passes)</p></li></ul></li><li><p><strong>Method2:</strong></p><ul><li><p>Successes:(Passes)</p></li></ul></li><li><p><strong>Method 2:</strong></p><ul><li><p>Successes:39 ext{ (greater than or equal to) } 10(Passes)</p></li><li><p>Failures:(Passes)</p></li><li><p>Failures:36 ext{ (greater than or equal to) } 10(Passes)</p></li></ul></li></ul></li><li><p><strong>ConclusionforConditions:</strong>Conditionsaremet.</p></li></ul></li><li><p><strong>RandomSample:</strong>Theproblemmuststatethatarandomsamplewastaken.(e.g.,"randomsampleof75dietsodadrinkers").</p></li></ul><h4collapsed="false"seolevelmigrated="true">Step4:CalculateTestStatistic(Zscore)andPvalue</h4><ul><li><p><strong>KeyConsiderationfor(Passes)</p></li></ul></li></ul></li><li><p><strong>Conclusion for Conditions:</strong> Conditions are met.</p></li></ul></li><li><p><strong>Random Sample:</strong> The problem must state that a random sample was taken. (e.g., "random sample of 75 diet soda drinkers").</p></li></ul><h4 collapsed="false" seolevelmigrated="true">Step 4: Calculate Test Statistic (Z-score) and P-value</h4><ul><li><p><strong>Key Consideration forp{hat}:</em></strong><em>Thesampleproportion(:</em></strong><em> The sample proportion (p{hat})mustrefertothesameconceptasthepopulationproportion() must refer to the same concept as the population proportion (p)inthehypotheses.If) in the hypotheses. IfHAisaboutlikingthetaste,is about liking the taste,p{hat}mustbetheproportionwho<em>liked</em>thetaste.</p></li><li><p><strong>Example(DietSoda):</strong></p><ul><li><p>must be the proportion who <em>liked</em> the taste.</p></li><li><p><strong>Example (Diet Soda):</strong></p><ul><li><p>P0 = 0.5(from(fromH0)</p></li><li><p>)</p></li><li><p>n = 75</p></li><li><p><strong>ObservedSuccesses:</strong>39peoplelikedthetaste.</p></li><li><p><strong>Calculate</p></li><li><p><strong>Observed Successes:</strong> 39 people liked the taste.</p></li><li><p><strong>Calculatep{hat}:</em></strong><em>:</em></strong><em>p{hat} = 39/75</p></li><li><p><strong>TestStatistic(Zscore):</strong>(Detailsforcalculationnotprovidedintranscript,butitwasstatedtobe</p></li><li><p><strong>Test Statistic (Z-score):</strong> (Details for calculation not provided in transcript, but it was stated to beZ = 0.35).</p></li></ul></li><li><p><strong>FindingthePvaluefromZTable:</strong></p><ul><li><p>TheZtablealwaysprovidestheareatothe<strong>left</strong>oftheZscore.</p></li><li><p><strong>Example:</strong>For).</p></li></ul></li><li><p><strong>Finding the P-value from Z-Table:</strong></p><ul><li><p>The Z-table always provides the area to the <strong>left</strong> of the Z-score.</p></li><li><p><strong>Example:</strong> ForZ = 0.35:</p><ul><li><p>Lookup:</p><ul><li><p>Look upZ=0.35intheZtable:yieldsin the Z-table: yields0.6368</p></li><li><p>Thisvalue(</p></li><li><p>This value (0.6368)istheareatotheleftof) is the area to the left ofZ=0.35.</p></li></ul></li><li><p><strong>Adjustingfor.</p></li></ul></li><li><p><strong>Adjusting forH_ADirection:</strong></p><ul><li><p>RecallthatthepvaluelocationisdictatedbyDirection:</strong></p><ul><li><p>Recall that the p-value location is dictated byHA.Inthedietsodaexample,. In the diet soda example,HA: p > 0.5(greaterthansign).</p></li><li><p>Therefore,thepvalueistheareatothe<strong>right</strong>oftheteststatistic.</p></li><li><p><strong>PvalueCalculation:</strong>(greater than sign).</p></li><li><p>Therefore, the p-value is the area to the <strong>right</strong> of the test statistic.</p></li><li><p><strong>P-value Calculation:</strong>1 - ( ext{Area to the left}) = 1 - 0.6368 = 0.3632.</p></li><li><p><strong>Result:</strong>Pvalue.</p></li><li><p><strong>Result:</strong> P-value= 0.3632.</p></li></ul></li><li><p><strong>InterpretationofPvalue(.</p></li></ul></li><li><p><strong>Interpretation of P-value (0.3632):</strong>Thispvalueisrelativelylarge,indicatingthatobservingasampleproportionlike):</strong> This p-value is relatively large, indicating that observing a sample proportion like39/75(ormoreextreme)isquitelikelyifthenullhypothesis((or more extreme) is quite likely if the null hypothesis (p=0.5)weretrue.Itcorrespondstoarelatively"tall"pointonthenormalcurve.</p></li></ul></li></ul><h4collapsed="false"seolevelmigrated="true">Step5:Decision</h4><ul><li><p><strong>DecisionRule:</strong>Comparethepvaluetothesignificancelevel(alpha,) were true. It corresponds to a relatively "tall" point on the normal curve.</p></li></ul></li></ul><h4 collapsed="false" seolevelmigrated="true">Step 5: Decision</h4><ul><li><p><strong>Decision Rule:</strong> Compare the p-value to the significance level (alpha,\alpha).

      • If P-value $\le \alpha$, then Reject H_0.

      • If P-value $> \alpha$, then Fail to Reject H_0</strong>.</p></li></ul></li><li><p><strong>AlphaLevel:</strong>Alpha(</strong>.</p></li></ul></li><li><p><strong>Alpha Level:</strong> Alpha (\alpha)isalwaysprovidedintheproblemstatement(e.g.,"performahypothesistestatalpha) is always provided in the problem statement (e.g., "perform a hypothesis test at alpha0.05").</p><ul><li><p><strong>Example(DietSoda):</strong>").</p><ul><li><p><strong>Example (Diet Soda):</strong>\alpha = 0.05</p></li></ul></li><li><p><strong>ApplyingtheRule:</strong></p><ul><li><p>Pvalue</p></li></ul></li><li><p><strong>Applying the Rule:</strong></p><ul><li><p>P-value= 0.3632</p></li><li><p></p></li><li><p>\alpha = 0.05</p></li><li><p>Since</p></li><li><p>Since0.3632 > 0.05(Pvalueisgreaterthanalpha),thedecisionisto<strong>FailtoReject(P-value is greater than alpha), the decision is to <strong>Fail to RejectH_0</strong>.</p></li></ul></li></ul><h4collapsed="false"seolevelmigrated="true">Step6:Conclusion(Notfullydetailedintranscriptbutimpliedafterdecision)</h4><ul><li><p>Statethedecisionintermsoftheresearchquestionandthecontextoftheproblem.Ifyoufailtoreject</strong>.</p></li></ul></li></ul><h4 collapsed="false" seolevelmigrated="true">Step 6: Conclusion (Not fully detailed in transcript but implied after decision)</h4><ul><li><p>State the decision in terms of the research question and the context of the problem. If you fail to rejectH0,itmeansthereisnotenoughevidencetosupport, it means there is not enough evidence to supportHA.Ifyoureject. If you rejectH0,itmeansthereissufficientevidencetosupport, it means there is sufficient evidence to supportHA.</p></li></ul><p><strong>Example(ConcludingforDietSodaProblem):</strong>(BasedonFTR.</p></li></ul><p><strong>Example (Concluding for Diet Soda Problem):</strong> (Based on FTRH_0)</p><ul><li><p>Thereisnotsufficientevidenceatthe)</p><ul><li><p>There is not sufficient evidence at the\alpha = 0.05$$ significance level to conclude that the majority of all diet soda drinkers now like the taste of their soda.