11 inferential stats

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/25

flashcard set

Earn XP

Description and Tags

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

26 Terms

1
New cards
null hypothesis significance testing (NHST)

1. formulate a set of hypotheses and assume the null hypothesis
2. collect data
3. calculate probability (*p-value*) and t-obt value
4. decide whether to reject or retain null hypothesis
2
New cards
null hypothesis (Ho)
baseline conservative assumption, IV doesn’t affect DV, no relationship between 2 variables
3
New cards
research hypothesis (H1 or Ha)
IV does affect DV, there IS a relationship between 2 variables, usually the hypothesis you hope will turn out to be true
4
New cards
mutually exclusive
cannot have both hypotheses be true at the same time
5
New cards
exhaustive
must account for all possible outcomes
6
New cards
alpha level
most common cut-off choice in psychology: 5%/0.05, if p < alpha then reject null hypothesis, vice versa

* big alpha = easier it is to find data consistent with research hypothesis, thus greater power
7
New cards
type i error
rejecting the null hypothesis when is it actually true (ie. false alarm, false positive)
8
New cards
publication bias
bias that emerges from the fact that statistically significant data is far more likely to be published than non-significant data
9
New cards
type ii error
retaining the null hypothesis when it is actually false (ie. miss, false negative)
10
New cards
power
probability of correctly rejecting the null hypothesis, ability to detect an effect if one truly exists
11
New cards
sample size
the greater the sample size the greater the power, less error in data to detect effect
12
New cards
magnitude of effect (effect size)
the larger the difference is in population, the easier it is to detect, therefore greater power → as effect size increase, type 2 error decreases, but type 1 error has NO change
13
New cards
william gosset
inventor of t-test, aka Student’s t-test
14
New cards
obtained values (t-obt)
a statistic that captures the r effect observed in your study → calculate t-obt in a t-test
15
New cards
t-obt equation
difference between means/variability in data
16
New cards
examples of noise
poorly worded questions, effect of uncontrolled variables, small sample size
17
New cards
degree of freedom
total number of participants minus number of groups → (n1 + n2 - 2)

* if mean is known then n-1 is the degree of freedom as the last value has to be fixed to match the mean
18
New cards
relationship between p-values & t-obt
p-value can be visualized as an amount of area in the t-distribution that corresponds to a particular value to t-obt, __the higher the t-obt, the lower the p-value = good!__
19
New cards
relationship between alpha level & critical value
alpha levels correspond to critical values (t-crit) the same way that p-values correspond to t-obt
20
New cards
if t-obt < t-crit
retain null hypothesis, p-value > alpha
21
New cards
if t-obt > t-crit
reject null hypothesis, p-value < alpha
22
New cards
p-hacking
set of thically questionable practices researcher use to get significant results, ie. keep adding participants to run study until p-value becomes statistically significant
23
New cards
\+ ways to p-hack
* toss out participants that disagree with your hypothesis
* use many measures of same construct, only report results that show significance
* find/report random interactions as though you predicted them, even if you didn’t
24
New cards
F-test (ANOVA)
extension of the t-test that can be used in many more research designs, usually used when there are more than 2 levels of an IV or when a factorial design with 2 or more IVs has been used
25
New cards
systematic variance
aka between-group variance, numerator in equation, deviation of the group means from the grand mean (mean score of all participants across all conditions)
26
New cards
error variance
aka within-group variance, denominator in equation, captures how much individual scores in each group deviate from respective group means