STATS FINAL: T-tests and Mann-Whitney U test (Wilcoxon Rank)

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/32

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

33 Terms

1
New cards

What is a student’s T-test?

It’s the same thing as a T-test

The Student's t-test (or simply the t-test) is an inferential statistical test used to determine if there is a statistically significant difference between the means of two groups.

The T-test is a statistical tool that takes the difference between the averages of two groups, calculates a T-value, and then compares that T-value against a special "t-curve" to figure out if the difference you saw is a real effect or just random luck.

<p>It’s the same thing as a <strong>T-test</strong></p><p><span>The Student's t-test (or simply the </span><strong>t-test</strong><span>) is an </span><strong>inferential statistical test</strong><span> used to determine if there is a </span><span style="color: red;"><strong><span>statistically significant difference</span></strong></span><span> between the </span><strong>means</strong><span> of </span><strong>two groups</strong><span>.</span></p><p></p><p>The<strong> T-test is a statistical tool that takes the difference between the averages of two groups</strong>, calculates a T-value, and then compares that <strong>T-value against a special "t-curve" to figure out if the difference</strong> you saw is a <strong>real effect</strong> or just <strong>random luck.</strong></p><p></p>
2
New cards

Who developed the T-test?

William Sealy Gosset ==> developed the T-stat

3
New cards

T-test vs. Z-test:

The population standard deviation is unknown, it’s estimated from the sample instead

<p>The population standard deviation is unknown, it’s <strong>estimated from the sample instead</strong></p>
4
New cards

What does the T-distribution look like?

The t-distribution is flatter and has heavier tails than the normal (Z) distribution

  • when the sample size gets larger, that’s when the distribution starts to look relatively similar to the Z.

<p>The <strong>t-distribution</strong> is flatter and has heavier tails than the normal (Z) distribution</p><ul><li><p>when the sample size gets larger, that’s when the distribution starts to look relatively similar to the Z. </p></li></ul><p></p>
5
New cards

What is a Test Statistic?

Test Statistic is the single, crucial number you calculate from your data to see if your results are significant (helps us determine P value = aka. significance level)

  • A test statistic summarizes your observed data into a single number.

  • Think of it as summarizing all the information about your sample (like the average, the variability, and the size) into one number.

6
New cards

Test statistics differ depending on the statistical test we’re using:

uA test statistic follows a certain distribution.

<p><span style="font-family: &quot;Wingdings 3&quot;;"><span>u</span></span><span style="font-family: &quot;Trebuchet MS&quot;;"><span>A test statistic follows a certain distribution.</span></span></p>
7
New cards

What are the steps to calculating a test statistic?

The test statistic is calculated right in the middle of your statistical process:

  1. Formulate hypotheses (decide what you're testing: H0 and H1).

  2. Choose the right statistical test (e.g., T-test, ANOVA).

  3. Calculate the test statistic corresponding to that test.

  4. Determine the p-value (based on where your calculated test statistic falls on the distribution curve).

8
New cards
<p>A larger T-value from the T-test means:</p><ul><li><p>X1 and X2 are the sample means</p></li><li><p>N1 and N2 are the sample sizes for each groups</p></li><li><p>Sp2 is pooled variance</p></li></ul><p></p>

A larger T-value from the T-test means:

  • X1 and X2 are the sample means

  • N1 and N2 are the sample sizes for each groups

  • Sp2 is pooled variance

A larger difference in the numerator means a larger t-value, which suggests a stronger case for a significant result.

<p>A<strong> </strong><span style="color: rgb(0, 0, 0);"><strong><mark data-color="#fff5b1" style="background-color: rgb(255, 245, 177); color: inherit;">larger difference in the numerator means a larger </mark></strong></span><span style="color: rgb(0, 0, 0);"><strong><mark data-color="#fff5b1" style="background-color: rgb(255, 245, 177); color: inherit;"><span>t</span></mark></strong></span><span style="color: rgb(0, 0, 0);"><strong><mark data-color="#fff5b1" style="background-color: rgb(255, 245, 177); color: inherit;">-value</mark></strong></span>, which suggests a stronger case for a significant result.</p>
9
New cards

The bigger the T-stat, the better because:

The t-statistic is your metric for convincing yourself (and others) that the difference you found is a true signal and not just statistical noise.

**The bigger the t-value, the more compelling your evidence is that the two groups are different

10
New cards

What is the difference between the 3 different T-tests:

One sample T-test: comparing population mean to sample mean

Example: Comparing the mean of a single sample to a known value or a theoretical mean

  • (Does the average height of students in your class differ significantly from the national average height of 5'7"?

Two sample indepedant T-test: Classic

Purpose: Compares the means of two separate, independent groups.

e.g., comparing the test scores of two different groups of students

Paired/dependant T-test:

Purpose: Compares the means of two related or paired groups

(e.g., comparing students' scores before and after taking a prep course).

11
New cards

T-tests difference simplified:

One sample ==> testing sample mean to a known “popular mean”

Two-sample (independant) ==> comparing the mean between 2 indepedant groups

Paired T-test ==> comparing the mean between 2 dependant/paired groups (pre and post)

<p><strong>One sample</strong> ==&gt; testing sample mean to a known “popular mean”</p><p><strong>Two-sample (independant)</strong> ==&gt; comparing the mean between 2 indepedant groups</p><p><strong>Paired T-test ==&gt; </strong>comparing the mean between 2 dependant/paired groups (pre and post)</p>
12
New cards

How do we determine if an indepedant t-test is appropriate?

  1. Assumption of independence

  2. Dependant/responding variable MUST BE continuous (interval or ratio)

  3. The manipulated/indepedant variable must be binary categorical (women vs. men, treatment vs. control)

*so basically the manipulated variable must be a binary category and the responding variable must be continuous like blood pressure between men vs. women

13
New cards

What are the assumptions for conducting a two-sample indepedant T-test?

  1. random representative sample

  2. continuous DV

  3. independant observation

  4. normality (normal distribution)

  5. Homogeneity of Variance (equal variance) ==> the spread should be around the same

Homogeneity/Equal Variance means the spread of data should be around the same, and if you were to draw a distribution curve (like a bell curve) for each of your two independent groups, they would look similar in their width (or flatness) .

14
New cards

How do we check normality and equal variance for an indepedant T-test, like checking the assumptions before we input datat?

Normality is tested ==> via. Shapiro-wilk test

Equal variance is tested ==> via. Levene’s Test

15
New cards

What do we use if the variance’s are unequal for an indepedant T-test?

Welch’s T-test ==> like the gummy bears.

I hate gummy bears and so we want a homogeneity of variance to avoid Welch’s

16
New cards

What are degrees of freedom and what’s the difference between small vs. large df (degrees of freedom)

Degrees of freedom (df) are essentially the number of values in a final calculation that are free to vary.

  • Small df: The T-distribution is wider and shorter (fatter tails), reflecting greater uncertainty because the sample size is small.

  • Large df: The T-distribution becomes narrower and looks almost exactly like the normal Z-distribution. This happens as the sample size gets large, and your estimate of the population standard deviation becomes more reliable.

17
New cards

How do we determine P value for an independent T-test?

  1. degrees of freedom

  2. determine critical value and rejection regions via. T-distribution table

  3. Compare T-stat to critical value

18
New cards

What is critical value?

A critical value is the value of the test statistic that defines the Rejection Region (or Critical Region).

1. The Simple Concept: The "Line in the Sand"

Think of the critical value as the boundary line drawn on the distribution curve that separates results that are considered common from results that are considered rare (and therefore statistically significant).

19
New cards

When does a T-value fall into the rejection region?

When the T-value is more extreme than the critical value = there IS a significance = reject null

Conclusion: Your result is rare enough to conclude that it did not happen by random chance under the null hypothesis. You reject the null hypothesis (H0).

<p>When the <span style="color: red;"><strong>T-value is more extreme than the critical value = there IS a significance = reject null</strong></span></p><p></p><p><strong>Conclusion:</strong> Your result is rare enough to conclude that it did not happen by random chance under the null hypothesis. You <strong>reject the null hypothesis (</strong><span><strong><span>H0</span></strong></span><strong>)</strong>.</p>
20
New cards

How do we find critical value?

Based on the degrees of freedom we calculate

the # of data -1 in group A and B added

<p>Based on the degrees of freedom we calculate</p><p></p><p>the<strong> # of data -1</strong> in group A and B added</p>
21
New cards

If the critical value isn’t available in the table we:

round down

So we look at:

  1. whether we’re doing one-tail or two-tail

  2. we look for our alpha value/significance levels

  3. Go down to the degrees of freedom

AND THAT’S OUR CRITICAL VALUE, SO IF WE PASS IT WE REJECT NULL

<p><strong>round down</strong></p><p></p><p></p><p><strong>So we look at:</strong></p><ol><li><p><strong>whether we’re doing one-tail or two-tail</strong></p></li><li><p><strong>we look for our alpha value/significance levels</strong></p></li><li><p><strong>Go down to the degrees of freedom</strong></p></li></ol><p></p><p><span style="color: green;"><strong>AND THAT’S OUR CRITICAL VALUE, SO IF WE PASS IT WE REJECT NULL</strong></span></p><p></p>
22
New cards

Two-tailed vs. One-tailed

1. Two-Tailed Test (Non-Directional)

  • When to Use It: Use a two-tailed test when you are simply asking, "Is there a difference?"

    • You are open to the possibility that the average of Group 1 could be higher OR lower than the average of Group 2.

2. One-Tailed Test (Directional)

  • When to Use It: Use a one-tailed test when you are asking, "Is one group significantly greater than (or less than) the other?" You are only looking for a difference in one specific, predicted direction.

<p><span style="color: red;"><strong>1. Two-Tailed Test (Non-Directional)</strong></span></p><ul><li><p><strong>When to Use It:</strong> Use a two-tailed test when you are simply asking, "<strong>Is there a difference?</strong>" </p><ul><li><p>You are open to the possibility that the average of Group 1 could be <strong>higher</strong> OR <strong>lower</strong> than the average of Group 2.</p></li></ul></li></ul><p></p><p>2. One-Tailed Test (Directional)</p><ul><li><p><strong>When to Use It:</strong> Use a one-tailed test when you are asking, "<strong>Is one group significantly greater than (or less than) the other?</strong>" You are only looking for a difference in <strong>one specific, predicted direction</strong>.</p></li></ul><p></p><p></p>
23
New cards

Be cautious when selecting a one-tailed test because:

One-tailed tests have more power to detect differences, but also more like to make Type I error (false positive).

<p><span style="font-family: &quot;Trebuchet MS&quot;;"><span>One-tailed tests have more power to detect differences, but also more like to </span><strong><span>make Type I error (false positive).</span></strong></span></p>
24
New cards

What are the characteristics of a Mann-Whiteny U test (Wilcox Rank-sum test)

The Mann-Whitney U test (or Wilcoxon Rank-Sum test) is the non-parametric alternative to the Independent Samples T-test.

  • Do not assume normality, this is for skewed data or data with outliers

  • Ranks instead of means, so the test ranks all the values from both groups together and compares a sum between the group

  • Works with ordinal and continuous data

<p>The Mann-Whitney U test (or Wilcoxon Rank-Sum test) is the <span style="color: red;"><strong>non-parametric alternative to the Independent Samples T-test.</strong></span></p><p></p><ul><li><p><strong>Do not assume normality,</strong> this is for skewed data or data with outliers</p></li><li><p><span style="color: rgb(168, 93, 0);"><strong>Ranks instead of means</strong></span>, so the test ranks all the values from both groups together and compares a sum between the group</p></li><li><p><span style="font-family: &quot;Trebuchet MS&quot;;"><strong><span>Works with ordinal and continuous data </span></strong></span></p></li></ul><p></p>
25
New cards

Independant observations still apply for the Wilcoxon Rank-Sum test, yes or no?

YES

The assumption of independent observations still apply: the two groups being compared should be independent (no repeated measures, paired data, or matched samples).

26
New cards

What statistic do we calculate for the Wilcoxon rank-sum/Mann U whitney?

U-statistic

H₀: The distributions (ranked sum) of the two groups are equal.

Ha: The distributions (ranked sum) of the two groups are not equal (for a two-tailed test).

27
New cards

What are paired T-tests used for?

comparing the means (responding variable) of 2 groups that are dependant

Longitudinal: repeated measures from same individuals over time

Cross-sectional: data from pairs of people that are likely to be similar for reasons such as genetics (twins), home environment (married couples, patient-family caregiver dyads )

28
New cards

What are the assumptions of paired T-tests?

  • Subjects are presumed to be randomly sampled (representative sample)

  • Continuous DV and Binary IV

  • Normality ==> The differences between paired values are approximately normally distributed.

29
New cards

What would the null and alternative hypothesis look like for a paired T-test?

Null = no difference pre- and post treatment

Alternative = Pre and post not equal (two-tailed test)

30
New cards

What is the non-parametric to a paired T-test?

Wilcoxon SIGNED rank test

  • no normality

  • ranking instead of mean difference

  • ordinal and continuous data

31
New cards

How do we determine if there is no difference between pre and post treatment (paired T-test)?

The average difference between the two groups is = zero

so we do post minus pre and then take the average of those points

<p><strong>The average difference between the two groups is = zero </strong></p><p></p><p></p><p><strong>so we do post minus pre and then take the average of those points </strong></p>
32
New cards

For the wilcoxon-signed rank test, how do we determine mean difference?

The median is the appropriate measure of the center.

The null hypothesis for the Wilcoxon Signed-Rank test is that the median of the population difference scores is zero.

33
New cards

When can we not do a wilcoxon signed rank test, even with pairs?

When there are less than 5 pairs, so we only have 5 people per group basically

<p><strong>When there are less than 5 pairs, so we only have 5 people per group basically</strong></p>