all quantitative

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/47

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 11:21 AM on 5/20/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

48 Terms

1
New cards

what is a correlational design

Testing associations of X and Y using a single group of participants (based on observations of what is going on - this naturally occurs)

  • regression allows for predicitions to be made

2
New cards

how do we prove causation instead of just correlation

you need to manipulate variables to test for causation

3
New cards

what is covariance

how much the variables vary in relation to one another

4
New cards

what is variance

the average amount the data varies around the mean

5
New cards

what is the third variable problem

in correlation you can’t assume causality as there is likely a third variable effecting the outcome that isnt being measured

6
New cards

what is spurious correlations

there is no obvious link between variables but yet a strong correlation is present

e.g. naming your child olivia and how many potatoes grew in that year

7
New cards

What is pearsons r

  • standard correlation coefficient

  • continous and parametric

  • in correlation DOF is N-2

8
New cards

what is R vs R squared

R = correlation coefficient = -1 to 1

R2 = how much the variance in one variable is explained by the variance in the other variable

  • ranges from 0-1 (0 = there is no explanatory power, 1 is a perfect fit)

9
New cards

what are the non parametric tests

Spearmans Rho and Kendalls tau for correlation

  • we don’t assume the data is normally distriubuted

10
New cards

how do correlational tests work for non parametric testing

  • principle of ranking the data - e.g.the lowest score = 1

  • a similar number of highs and lows between groups suggests dependent variable doesnt differ between groups

<ul><li><p>principle of ranking the data - e.g.the lowest score = 1</p></li><li><p>a similar number of highs and lows between groups suggests dependent variable doesnt differ between groups</p></li></ul><p></p>
11
New cards

What is Spearmans Rho

  • pearsons correlation test on ranked ordinal data (does not use raw data)

  • stronger relationship than Kendals

  • less sensitive to outliers than Pearsons

E.g. does a high score on X correlate with a high score on Y

12
New cards

What is Kendal’s tau

  • better than Spearmans for small sample

  • tied ranks e.g. is there agrrements between observation

E.g. does one judges scores for contestants correlate with the other judges scores

13
New cards

when in PHI used

  • categorical → correlation between two categorical variables based on chi squared

  • the chi squared tells us if there is a relationship then look for PHI for the correlation coefficient

  • ranges from -1 to 1

  • when both variables are binary (yes/no)

14
New cards

when is point biserial correlational

one categorical (binary) and one continious

looks the same as pearsons but sign doesnt tell us the direction

15
New cards

What is partial correlation

Measures the relationship but adjusting for the effect of the third variable (on both variables)

  • on SPSS the bottom R value gives you the score when controlling for a third variable (e.g. time spent revising)

<p>Measures the relationship but adjusting for the effect of the third variable (on both variables)</p><ul><li><p>on SPSS the bottom R value gives you the score when controlling for a third variable (e.g. time spent revising)</p></li></ul><p></p>
16
New cards

What is semi-partial correlation

Only applies to regression, remove the additional variance in one of our predictor variables

17
New cards

what is the difference between correlational research and experimental research

correlational = we observe naturally what is going on

experimental = we manipulate variables to test something

18
New cards

What is regression

  • we make a model to predict the DV using the IV

19
New cards

What is the linear model of regression

Y = b1X + B0

  • slope (gradient) = B1 → direction of the relationship

  • Intercept = B0

20
New cards

how do you draw a line of best fit through regression slope

  • each point will not sit perfectly on the line - this is the residual error

  • some points are above some are below - these would cancel

  • so we sqaure them then add → known as SSR (residual sum of squares)

We want a model with this value as low as possible

21
New cards

What is SST, SSR and SSM

SST (total sum of squares) = all the variation in your data, how much it is spread throughout the mean

SSR(residual/error) = variation your model fails to explain

SSM (model) = variation your model does explain → we want this number to be high

SST = SSM + SSR

22
New cards

What is R squared

SSM/SST → how good your model is overall at explaining the variation

23
New cards

what is the F statistic

is the model actually better than just guessing the mean

  • large F which is sig. model explains more than no model

  • ratio of how much mean variance can be explained by how much mean error

24
New cards

What are the assumptions of regression

  • independence

  • continous dependent variable / continous or categorical independent variable

  • sample size

    • Liberal - 15 participants per predictor (stevens)

    • Conservative = N = 50 + 8m (ivs) (green)

  • Linearity

  • Outliers

  • Multicollinearity

25
New cards

What is a univariate outlier

only present in one variable - one participant varies from the rest → can be seen in box plot/ stem and leaf diagram

26
New cards

what is a multivariant outlier

  • result from the combinaion of two or more variables

    • e.g. they are in the same range as the others but their pattern varies

27
New cards

how to find out a multivariant outlier

  • standerdised residuals, if it is +3 or -3 in the ANOVA spreadsheet on SPSS

  • Cooks distance - how each individual influences the overall fit of the model → if it is greater than 1 we don’t want that

28
New cards

what is multicollinearity (VIF)

-exists if predictors are highly correleated

VIF - indicates a strong linear relationship → 0.8 and above but we want this less than 10

29
New cards

what is multiple regression

two or more predictor variables and one outcome variable (DV)

30
New cards

what are the two hypothesis driven ways to enter data into SPSS

Forced Entry (enter) - all the variables are entered at the same time → researcher does not decide the order

Hierarchical - predictors are entered based on previous research are entered first then we add the others

  • good for testing a theory → bad becuase it relies on pre competence of research and researcher

31
New cards

What are the two data driven methods of entry in SPSS

stepwise

  • we want to avoid this in psychology → only used in exploration research as SPSS will intepret any size difference as importanr

Forward

Backward

32
New cards

what does standerdised beta allow for

comparisions because they are both standard deviations

33
New cards

What is a population?

A population is the collection of units to which we want to generalise a set of findings or a statistical model.

34
New cards

What is a sample?

A sample is a smaller but hopefully more representative collection of units from a population used to determine truths about that population.

35
New cards

What is central tendency?

Central tendency is an indication of the typical scores in a sample and is an estimate of the middle point in the scores; examples include the mean, mode, and median.

36
New cards

What is sampling error and why does it occur?

Sampling error occurs because we are not testing all members of the target population, meaning we are likely to overestimate or underestimate the mean.

37
New cards

How does sample size affect sampling error?

The larger the sample, the more likely we are to measure the extremes of the values, which makes the sample mean more likely to be a good estimate of the population mean.

38
New cards

What is sample error?

Sample error is the difference between the population statistics and the sample statistics.

39
New cards

What is dispersion?

how much variation in a dataset which indicates how much variation there is in the sample.

  • range, sum of squared error, variance and standard deviations

40
New cards

What is the sum of squared errors?

adding the total squared deviations from the mean to eliminate the issue of positive and negative deviations cancelling each other out.

41
New cards

How is variance calculated from the sum of squared errors?

Variance is calculated by dividing the sum of squared errors by the number of scores to account for the actual size of the sample.

42
New cards

What is the standard deviation?

The standard deviation is the square root of the variance, returning the measure to the original units.

  • how much the data varies around the mean.

43
New cards

What is the standard error of the mean?

The standard error of the mean indicates how representative the sample is of the population.

44
New cards

What is a sampling distribution?

A sampling distribution is a hypothetical distribution created by selecting an infinite number of samples from a population, calculating a statistic for each sample, and plotting them as a histogram.

45
New cards

Why is a sampling distribution considered hypothetical?

A sampling distribution is hypothetical because we do not have access to the entire population and therefore cannot observe all possible samples.

46
New cards

How does sample size affect a sampling distribution?

Larger samples produce a smaller sampling error within the sampling distribution.

47
New cards

What is standard error?

Standard error refers to the standard deviation of a sampling distribution, specifically the standard deviation of all sample means.

48
New cards

What does standard error tell us about the sample mean?

Standard error tells us how good the sample mean is as a representation of the population mean.