Statistical Tests

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 66

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

67 Terms

1

What assumptions need to be met for chi-squared tests

• The number of cells with expected frequencies less than 5, are less than 20%
• The minimum expected frequency is at the very least 1.

New cards
2

How can we use a regression model to predict values we don't have for our outcome variable?

Use linear regression model-> save-> click prediction intervals individual

New cards
3

What are the three assumptions for Multiple Regression Inference

1. The relationship between the dependent (Y) and each continuous independent variable (x variables) is linear.
2. Residuals or error terms e should be approximately normally distributed.
We can plot a histogram of the error terms
3. Homoscedasticity (stability in variance of residuals)

New cards
4

What are the steps to test mediation effect (Baron and Kenny )

1 test that the IDV(X) is associated with the DV(Y) c
2 test the association between the IDV and mediator(M) (a)
3 test the association between the mediator on the DV (b)
4 test the IDV association with the DV when controlling for the mediator (c')

New cards
5

What are the two methods to test indirect effects

- Sobel test (Normal Theory Approach)
- Nonparametric Sobel test, PROCESS (bootstrapping)

New cards
6

What is the sobel test for

Measuring the indirect effect of the mediator, ie ab

New cards
7

How do you calculate the estimated linear effect of a categorical variable in the interaction

B1+B3 x levels of modifier (0,1)

New cards
8

What is the confidence interval for?

A statistical estimate of how "good" the test statistic is, with lower % confidence being more cautious

New cards
9

When do we use an independent sample t test

for normally distributed continuous data, to test the differences between two groups/ variables

New cards
10

When do we use a Pearsons chi squared test

If two categorical variables are associated, that meet assumptions
To test if according to the current data, the proportions of two groups are significantly different from each other.

New cards
11

What is the McNemar test used for

test for paired categorical data, that meets assumptions, to see if proportions changed between the paired data (eg over time)

New cards
12

What do we need to look at before reporting a pearsons chi squared test

Assumptions:
• The number of cells with expected frequencies less than 5, are less than 20%
• The minimum expected frequency is at the very least 1.

New cards
13

What do we need to look at before reporting an independent samples t test

the Levene's test, if the levenes test is significant t p<.05, use second row- 'equal variances are not assumed"

New cards
14

What does X2 stand for

Chi square

New cards
15

What test do we use for non-parametric data to compare one group to another group

Mann-Whitney U test

New cards
16

What test is used for comparing independent categorical groups that don't meet assumptions

Fisher's exact test

New cards
17

What are the assumptions of the McNemar test

For McNemar's test, we needed at least 25 discordant observations, and paired 2×2 data, if there is less, or 3×3 or more data we look to the McNemar Bowker test

New cards
18

How do we interpret a scatterplot

See if there is a linear relationship (either positive or negative) and if we can add a linear fit line

New cards
19

How do we test the 3 assumptions for Multiple Regression Inference

1. Use scatterplot, residuals of the dependent variable (Y) plotted against residuals of each independent variable (x)
2.We can plot a histogram of the error terms to see if the errors more or less follow a normal distribution, and p plot
3. A scatterplot of standardised residuals epsilon and standardised predicted values shows no pattern. (ZPRED, ZRESID)

New cards
20

How do we test distributions

For continuous variables we use the histogram plot

New cards
21

How to we test our one sample data against a prediction or other values of population?

Use t test or chi square
Before using test make sure to analyse descriptives!! Look at distribution of data

New cards
22

What do we use the One sample chi square test for

Use to test proportion of results in data, default at 50/50
To test if according to the current data, the proportion in the population equals a certain, pre-specified, value.

New cards
23

How do we calculate the 95% confidence interval?

x̄ +/- (1.96 x SE)

New cards
24

How do we calculate the standard error

SE= sample SD / square root of sample size (n)

New cards
25

How do we calculate a confidence interval

x̄ +/- (CI x SE) , repeat for both upper and lower bound

New cards
26

What type of test to we use for a binary categorical Independent variable of interest and a continuous Dependent variable

T test (if parametric)

New cards
27

What type of test to we use for a categorical Independent variable of interest and a categorical Dependent variable

chi squared test (if assumptions met)

New cards
28

What type of test to we use for a continuous Independent variable of interest and a continuous Dependent variable

Simple linear regression

New cards
29

What type of test to we use for multiple continuous and categorical Independent variables of interest and a continuous Dependent variable

Multiple Linear Regression

New cards
30

When do we use a paired sample t test

When comparing two means from the same group of respondents, with both continuous variables being normally distributed

New cards
31

What test do we use for non-parametric data to compare one group to a pre-defined value

Wilcoxon sign rank test

New cards
32

What test do we use for non-parametric data to compare two related groups

Wilcoxon paired/ matched sign rank

New cards
33

When do we split a variable?

To check the distribution of levels of a categorical variable separately (eg gender or ethnicity) against a continuous outcome

New cards
34

When do we merge two variables?

Only merge paired variables when we want to check the distribution/suitability, we use the differences between paired data to check if it is normally distributed

New cards
35

What test do we use categorical data with >20% cells exp count less than 5 to compare one group to a pre-defined value

One sample Binomial Exact test

New cards
36

What test is used for comparing paired categorical groups that don't meet assumptions

McNemar Bowker test/ paired Binomial exact test

New cards
37

What do you report for non-parametric tests?

medians, with min-max, p value and df

New cards
38

What visualisation is used to see if two continuous variables are associated

Scatterplots are used to see the association between two continuous variables

New cards
39

Why do we use a correlation

When we need an objective measure of strength of a linear relationship.
Correlation 'r' is a method to quantify the Direction and Magnitude, of linear association between two continuous variables.

New cards
40

When do we use Pearson's correlation

When we have two continuous variables with a linear relationship and the data is parametric on a histogram

New cards
41

When do we use Spearman's correlation

When we have two continuous variables with a linear relationship and the data is not parametric/ skewed on a histogram

New cards
42

How do we interpret the results of a correlation?

Coefficient r : r value ranges from -1 to 1, the closer to 1/-1, the stronger the linear correlation (magnitude)
, and p value, the probability

New cards
43

What is a simple linear regression for

In statistical modelling, a regression model is a set of statistical processes for estimating the relationships among variables. These models describe the relationship between variables by fitting a line to the observed data.

New cards
44

What is the equation for a simple linear regression

Y=B0+B1 𝒙 + 𝜺

New cards
45

What is the difference between R2 and R2 adjusted

The R2 value indicates how much of the total variation in the dependent variable, the adjusted is this when adjusting for the number of predictors

New cards
46

What do we do when we have a categorical predictor that we want to test

We code it through dummy variables, with one level acting as a reference category (0)

New cards
47

What are dummy variables represented by

0 and 1

New cards
48

How many dummy variables are needed to represent a variable with 3 levels?

2, labelled d1 and d2, one will be the reference category

New cards
49

How do you use dummy variables in a linear regression with a categorical variable with 3 levels

Input coded dummy variables as usual in regression
If you input all dummy variables (including coded reference category) SPSS will choose a reference category and exclude it
The results table shows the dummy variables COMPARED to the reference category, and if there is a significant difference in the outcome

New cards
50

What is the equation for a multiple linear regression

Y=B0+B1 𝒙1 + B2 𝒙2 + 𝜺

New cards
51

What does x1 or x2 represent in the MLR model

x1 and x2 represent the independent variables of interest

New cards
52

What is the multiple linear regression equation that has a categorical predictor with 3 levels

Y= B0+B1 d1+ B2 d2+ 𝜀

New cards
53

What does R2 represent in a MLR model

R2 is often interpreted as the proportion of the variance in the dependent variable that is "explained" by the independent variables in the model.

New cards
54

What does a higher R2 mean

higher values of R2 indicating better prediction.

New cards
55

What does a B1 of 2 tell us?

That for a 1 unit increase in idv 1, there is an increase of 2 for the dv

New cards
56

What does M stand for

M = the mediator

New cards
57

What does a mediator do

A mediator (M) of the causal effect of independent variable (x1) on dependent variable (Y) is a variable x2 on the causal pathway from x1 to Y.

New cards
58

What is the symbol for the pathway of the total effect

c

New cards
59

What is the difference between complete and partial mediation

Partial mediation, there is still a significant effect of the IDV on the DV when controlling for the mediator but the effect is reduced
Complete mediation, the mediator eliminates the significant effect of the IDV on the DV

New cards
60

What is the equation to compute the indirect effect

SE(ab) = sqr a2 Sb2+ b2 Sa2

New cards
61

What is PROCESS used for

Check the 95% Bias-corrected bootstrap confidence interval in the model, both direct and indirect effects

New cards
62

What is needed for the indirect effect to be significant

the confidence interval does not contain 0

New cards
63

Which steps in the Baron Kenny method are essential to establish mediation

Steps 2 and 3 are essential for establishing mediation

New cards
64

What do you do if there might be a modifier

Create a new variable for the interaction between the IDV and the modifier
Then run a multiple linear regression including the new modifier with the previous IDV(s) and DV

New cards
65

What does Z represent

the modifier

New cards
66

What is B3

𝛃𝟑 is interpreted as the difference of the effect of 𝐱𝟏 on Y by levels of 𝐙 variable.

New cards
67

What are the DFBETA and DFFIT used for?

For outliers and strong influencers on the data
Look at both ascending and descending ( both - and + >1)
In the SDF1 column, if >1 has a strong influence on the model, may need to be removed

New cards

Explore top notes

note Note
studied byStudied by 10 people
752 days ago
5.0(1)
note Note
studied byStudied by 8 people
909 days ago
5.0(1)
note Note
studied byStudied by 56 people
899 days ago
5.0(1)
note Note
studied byStudied by 30 people
974 days ago
4.0(1)
note Note
studied byStudied by 1114 people
680 days ago
4.0(6)
note Note
studied byStudied by 58 people
1065 days ago
5.0(1)
note Note
studied byStudied by 6 people
760 days ago
5.0(1)
note Note
studied byStudied by 139196 people
332 days ago
4.8(594)

Explore top flashcards

flashcards Flashcard (49)
studied byStudied by 111 people
543 days ago
4.8(4)
flashcards Flashcard (138)
studied byStudied by 201 people
870 days ago
5.0(4)
flashcards Flashcard (40)
studied byStudied by 21 people
554 days ago
5.0(2)
flashcards Flashcard (60)
studied byStudied by 7 people
15 days ago
5.0(1)
flashcards Flashcard (63)
studied byStudied by 3 people
739 days ago
5.0(1)
flashcards Flashcard (36)
studied byStudied by 30 people
550 days ago
5.0(4)
flashcards Flashcard (28)
studied byStudied by 2 people
729 days ago
5.0(1)
flashcards Flashcard (46)
studied byStudied by 232 people
69 days ago
5.0(1)
robot