RMS - Non-parametric

\ \ Central Limit Theorem makes it so that we can study all kinds of distributions because their sampling distributions converge into a normalising distribution

→ this means that we only need a limited set of statistical tools to draw inferences about our data. * Parametric statistics

\ It is not always though, that sample is normally distributed.

Data may be skewed
Test is not robust to violation of assumption

→ small samples

\ Non-parametric → unable to apply normal sampling distribution. We cannot assume that the sampling distribution is normal.

\ Non-parametric testing

Converting data to ranks
Eliminates outliers
Transform the (quantitative) data to rank orders
Sample space of rank orders can be easily determined: it ranges from 1 to n
This gives the sampling distribution of the test statistic

Score	Rank
87	1
512	4
353	2
468	3

^ The lowest score gets rank 1

Score	Rank
87	1
512	5
^^353^^	^^2.5^^
468	4
^^353^^	^^2.5^^

^^^ when two score are tied:^^

^^Scores share 2 places now, 2nd and 3rd. Because they are tied they get rank 2.5; average value of the places they share.^^

Wilcoxon test

Non-parametric, two-sample t-test
Example used: A clinical trial of a rare disease, 5 patients: 3 receive treatment, 2 control

Condition	Score
Treatment	22, 28, 24
Control	29, 32

Small sample → hard to determine whether assumption of normality is met

We look across all groups when it comes to ranking!

Wilcoxon test steps

Assumptions:

1. Data are rank-ordered 2. Two independent samples 3. No assumption with respect to distribution

Hypotheses

1. H0: The population distributions of the (quantitative) score are identical

1. → average rank in the groups is thew same 2. HA: The populations distributions of the (quantitative) score differ in such a way that the expected average ranks also differ

1. This implies that the average rank in the group differs

Statement about the distributions of the two populations → equality between two groups

Test statistic

1. Just the difference in average rank

Condition	Score	Average rank
Treatment	1,3,2	2
Control	4,5	4,5

\ 2 - 4,5 = -2.5.

Is this an extreme outcome, given the sample space?

\ Sample space: set of all possible outcomes

\ Under H0: all events are equally likely

Some events have same outcomes

\ W = difference in average rank

P-value:

1. Test statistic: -2.5 2. P(diff=-2.5 or more extreme) = 0.1 3. Two-sided = 0.1 x 2 = 20

P > 0.05 → can’t reject H0

Limited Power? * After all, more extreme outcomes were not even possible in this example!

\ 1 sided statement:

which of these is stated, what am I interested in
How is test statistic defined
Distribution drawing

Kruskal-Wallis Test

Non-parametric ANOVA
Example: 18 students, have to memorise words while listening to 3 types of music and recalls as many words as possible next day * Condition 1: 6 Bach * 2: 6 Venom * 3: 6 silence
Small sample → non-parametric
Convert data into ranks * Lowest value: lowest rank (1)

We look across all groups when it comes to ranking!

\ Steps

Compute average overall rank R
Compute average rank for the individual groups
Fill in KW-formula
Chi-squared distribution

1. df = groups - 1

Sign Test

Non-parametric paired t-test
Wilcoxon: independent, sign test: independent

\ Steps

Hypotheses:

1. H0: The proportion of ‘‘+’’P

\ \ \ \