Overview: In practice, we look at the p-value for the test statistic when applying z-tests or t-tests in software like JASP. Several effect-size measures can be reported, e.g. Pearson's correlation coefficient, Cohen's d, omega, or omega-squared.
One simple example setup (z-test):
Population of interest: all clinical IP scores with population mean μ (unknown) and population standard deviation known as part of the example: σ=15.
Small sample: n = 25 students from a class, sample mean xˉ=110.
Null hypothesis: H0:μ=100 (population mean of IP scores assumed to be 100 under the null).
Goal: test whether the sample provides evidence that the population mean differs from 100.
Rationale: The z-test compares the sample mean to the population mean using knowledge of the population standard deviation.
Example summary: With the numbers above, the z-statistic would be computed to evaluate whether the observed sample mean could arise if μ=100.
How to compute the z-statistic (for the sample mean):
For a single observation x: the z-score is z=σx−μ
For a sample mean, assuming known population standard deviation, the distribution of the sample mean is centered at μ with standard deviation σXˉ=nσ, so
Z-statistic for the sample mean: z=σ/nxˉ−μ0
Under the null, z follows the standard normal distribution N(0,1).
In the example: with xˉ=110 , μ<em>0=100, σ=15, and n=25, z=15/25110−100=310≈3.33 which yields a very small p-value (two-sided, about p≈0.0009$).Ifthechosenalphais0.05,wewouldrejectH0.</p></li></ul></li><li><p>Hypotheses,alpha,anddecisionregions(two−tailedexample):</p><ul><li><p>Nullhypothesis:H0: \mu = \mu0(e.g.,\mu_0 = 100).</p></li><li><p>Alternative:dependingontheresearchquestion(oftentwo−sided,unlessspecifiedotherwise).</p></li><li><p>Significancelevel:\alpha = 0.05(example).</p></li><li><p>Criticalregionforatwo−sidedtest:thecentrallimittheoryforthestandardnormalimplies<br>\alpha/2 = 0.025ineachtail,sothecriticalz−valuesareapproximately\pm z_{\alpha/2} = \pm 1.96.</p></li><li><p>Connectiontothe68−95−99.7rule:about95\pm 2standarddeviationsofthemeanunderthenormaldistribution,whichexplainsthe95H_0atthechosen\alpha.</p></li></ul></li><li><p>Keylimitationofthez−test(unknownsigmainpractice):</p><ul><li><p>Thez−testassumesthatthepopulationstandarddeviation\sigmaisknown,whichisveryunrealisticinmostreal−worldsettings.</p></li><li><p>Because\sigmaisrarelyknown,thedistributionusedisnotstrictlystandardnormalfortheteststatisticbasedonthesample;thisleadstothet−distributionratherthanthestandardnormal.</p></li></ul></li><li><p>Thet−distribution:motivationandproperties</p><ul><li><p>When\sigmaisunknownandweestimateitwiththesamplestandarddeviations,theteststatisticbecomes</p></li><li><p>One−samplet−statistic: t = \frac{\bar{x} - \mu_0}{s / \sqrt{n}} </p></li><li><p>Degreesoffreedom: df = n - 1 </p></li><li><p>Thet−distributionissymmetricbutheavier−tailedthanthestandardnormal;itsexactshapedependsonthedegreesoffreedom.</p></li><li><p>Asthesamplesizegrows,thet−distributionapproachesthestandardnormal:forlargedf > 30(roughly),t\approx zandsapproximates\sigma.</p></li><li><p>Inpractice,youcomparethecalculatedttoat−distributionwithdf = n - 1toobtainap−value.</p></li></ul></li><li><p>Whatdoesthep−valuetellyouhere?</p><ul><li><p>Asmallp−value(e.g.,p < 0.05)indicatesthattheobservedsamplemeanisunlikelyunderthenullandleadstorejectionofH_0atthechosenlevelofsignificance.</p></li><li><p>Intheexample,azofabout3.33yieldsap−valuewellbelow0.05,supportingrejectionofH_0.</p></li></ul></li><li><p>Extendingtotwomeans(two−samplet−test,independentsamples,equalvariances):</p><ul><li><p>Whencomparingtwosamples,youcanuseatwo−samplet−testifyouwanttoassesswhetherthetwopopulationmeansdiffer.</p></li><li><p>Assumptionsforthetwo−samplet−testwithequalvariances(pooled−variancet−test):</p></li><li><p>Eachpopulationisnormallydistributed.</p></li><li><p>Thetwopopulationstandarddeviationsareequal(homogeneityofvariance):\sigma1 = \sigma2(wedonotknowthem;weestimate).</p></li><li><p>Teststatisticforindependentsampleswithequalvariances:</p></li><li><p>Pooledstandarddeviation(sp):<br> sp = \sqrt{\frac{(n1 - 1)s1^2 + (n2 - 1)s2^2}{n1 + n_2 - 2}} t−statistic: t = \frac{\bar{x}1 - \bar{x}2}{sp \sqrt{\frac{1}{n1} + \frac{1}{n_2}}} Degreesoffreedom: df = n1 + n2 - 2 </p></li><li><p>Interpretation:largerabsolutevalueoftleadstoalargerdifferencebetweenthetwosamplemeansrelativetothepooledvariability,andasmallerp−value.</p></li></ul></li><li><p>Effectsizefort−tests:Pearson′scorrelationcoefficientr</p><ul><li><p>Acommonwaytoquantifythemagnitudeoftheeffectinat−testisviathecorrelationcoefficientr,whichcanbederivedfromthet−statisticanditsdegreesoffreedom:</p></li><li><p>Formula: r = \frac{t}{\sqrt{t^2 + df}} </p></li><li><p>Signofrmatchesthesignofthet−statistic,reflectingthedirectionoftheeffect.</p></li><li><p>Note:Somesoftware(e.g.,JASP)reportsthet−statisticanddf,andyoucancomputerbyhandifdesired.</p></li></ul></li><li><p>Practicalreportingandsoftwareconsiderations</p><ul><li><p>SoftwarelikeJASPreportsthep−valuefortheteststatistic(zort)andthedegreesoffreedom;itmaynotalwaysoutputtheeffectsize(e.g.,r)directly,soyoumaycomputeityourselffromtanddf.</p></li><li><p>Interpretationhingesonbothstatisticalsignificanceandpracticalsignificance:ap−valuecanbesmallwithaverylargesampleevenfortinyeffects.</p></li></ul></li><li><p>Therelationshipbetweenp−valuesandsamplesize(p−hackingwarning)</p><ul><li><p>Akeyproperty:thep−valueissensitivetosamplesize.Increasingsamplesizecanshrinkthep−valueeveniftheeffectsizeremainstiny.</p></li><li><p>Thiscanleadto“p−hacking”orfishingforsignificancebysimplyaccumulatingmoreobservations.</p></li><li><p>Caution:whilelargersamplesincreasepowertodetectrealeffects,theycanalsoproducestatisticallysignificantresultsthatarepracticallymeaninglessiftheeffectsizeistrivial.</p></li></ul></li><li><p>Takeawaynotes</p><ul><li><p>Usethez−testwhenthepopulationstandarddeviation\sigmaisknown;otherwise,usethet−testwithsasanestimateofvariability.</p></li><li><p>Forone−sampletests,use t = \frac{\bar{x} - \mu0}{s / \sqrt{n}} \; (df = n - 1) or z = \frac{\bar{x} - \mu0}{\sigma / \sqrt{n}} \; (df = \text{not applicable if } \sigma \text{ is known}) dependingondataconditions.</p></li><li><p>Fortwoindependentsampleswithequalvariances,usethepooled−variancet−test;otherwise,separate−variance(Welch)t−testmaybeused(notdetailedherebutcommonlyneededwhenvariancesdiffer).</p></li><li><p>Alwaysreporteffectsize(e.g.,r$$) in addition to p-values to convey practical significance.
Be mindful of sample size: large samples can yield statistically significant results for negligible effects without substantive importance.