IPR - WEEK 7 - statistical testing theory

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/24

flashcard set

Earn XP

Description and Tags

- Hypothesis testing -

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

25 Terms

1
New cards

What is a hypothesis?

A clear, testable statement or prediction about what you expect to find i.e. about the variables or the outcomes

proposes a potential explanation or effect that can be examined through your experiment / analysis

2
New cards

Key features of a hypothesis?

  1. testable - it must be possible to test this prediction using the experiment / observation (i.e. the variables you selected)

  2. Specific: Should clearly define the variables and expected effect or relationships

  3. falsifiable: should have a way to prove the hypothesis wrong if its incorrect (i.e., testable and realistic)

3
New cards

What are the types of hypothesis?

  • Null Hypothesis (H0)

  • Alternative Hypothesis (H1)

    • One tailed

    • Two-tailed

4
New cards

Null hypothesis

There will be no effect or relationship

e.g. no difference in wellbeing scores between the placebo and treatment group

5
New cards

Alternative hypthesis?

  • There will be an effect or relationship

  • one-tailed = effect is expected direction is specified e.g. students in the treatment group wull have higher wellbeing scores

  • two-tailed = an effect is ecpected but direction not specified e.g. wellbeing scores will differ between the treatment and placebo group

6
New cards

When should ur hypothesis be one or two-tailed?

  • one tailed = rare, when there is specific direction or strong theoretical justification. only used if an opposite direction is impossible or irrelevant.

  • two-tailed = when you predict a difference but not the direction. Direction could go wither way

7
New cards

To test a hypothesis, we need to analyses, what do most analyses test provide?

  • Test statistic e.g. Z-scores - summarise s how dar samole result is from H0 expectations

  • ie.e., differences between your observed data (your effect) and what is expected under the Null Hypothesis (there beong no effect)

  • Each statistic has an associated p-value

8
New cards

Whats a p-value?

Probavility of observing data as extreme (or more extreme) assuming there is no effect (if the null hypothesis is true)

it determined the strength of evidence against the Null Hypothesis

  • the compared to a set criterion: the significance alpha level (a) to determine if you can reject the Null Hypothesis

9
New cards

P levels compared to the sognifocance alpha level (a), to determine if you can reject the Null Hypothesis?

  • If p < a → reject H0 (result is statistically significant

  • If p ≥ a → fail to reject H0

10
New cards

If there is a small p value?

data are rare under H0 → reject H0

strong evidence against H0 - EFFECT LIKELY TO EXIST

does not tell you your Alternative Hypothesis is true

11
New cards

Phrasinf of null hypothesis in write ups:

If p < α → “We reject the null hypothesis” (evidence suggests there is
some difference)
If p ≥ α → “We fail to reject the null hypothesis” (insufficient evidence to
conclude there is a difference)

12
New cards

What is the significance / alpha level ( a) ?

Set threshold / criteria that quantifies the strength of evidence against the Null hypothesis, i.e. threshold of deciding whether to reject H0

13
New cards

What is the typical alpha level (a)

.05 (other values acceptable, .01 annd .001

14
New cards

what does the a value represent?

The maximum (acceptable) probability of rejecting the null hypothesis (H0) when the null hypothesis is actually true

i.e. if you accept up to a 5% risk of claiming there is an effect when there is not

15
New cards

p value > .05

Not rare under null hypothesis → fail to reject null hypothesis ( H0)

16
New cards

P-value < .05

rare under null hypothesis → reject H0

17
New cards

Significance level for two tailed distribution?

The alpha (.05) is divided equally between the 2 of them

a/2 = 0.25

still interpreted as p < 0.5 = reject h0

18
New cards

significance level for one tailed?

critical region (tail) is on one specific side

if p < .05 = reject h0

will ignore anything extreme on the other side

19
New cards

When a = .05 and p < a, the result falls in the critical region (5% tail) suggesting:

  • Our result is rare and under the assumption of no diffference and inconsistent eith the middle 95% of values we would expect

  • if the null hypothesis were true (no difference), there is a 5% probability of getting this extreme (rare)

  • This suggests the result would be very unlikely if H0 were true. We have evidence against H0 and reject it.

20
New cards

What is a TYPE I ERROR?

When there is NO effect and we say '“we reject the null hypothesis”

no effect or difference exists, but say there is

probability of making a Type I error is the a level

i.e., fales positive

21
New cards

What is a TYPE II ERROR?

When there is an effect and we say “We fail to reject the null hypothesis”

An effect or difference exists, which we miss

probability of making a Type II erro ris denoted as B (beta)

i.e., false negative

22
New cards

Statistical power

  • if an effect truly exists in the population, power is the likelihood your test will detect it

  • so power is about correctly rejecting the null hypothesis when the alternative hyothesis is true

  • probability of avoiding a TYPE II error

  • Power = 1 - B

23
New cards

whats a commonly used statisitcal power

.80 (80%)

24
New cards

Fsctors that affect power?

  • small effect between conditions (larger effects → higher power)

  • sample sie is too small (bigger samples → higher power)

  • alpha level threshold is strict (e.g., .01 - > lower power)

  • variability in the data ( more noise → lower power)

  • Test type/hypothesis: one-tailed tests → higher power than two tailed

25
New cards

effect size

an idea of the sixe of the effect we found