Module 6 - Correlation & Regression

studied byStudied by 9 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 44

flashcard set

Earn XP

Description and Tags

45 Terms

1

Correlation

relationship between two continuous variables, measure degree data points cluster around regression line

New cards
2

Line of Best Fit

show general pattern of relationship between dependent & independent variable

New cards
3

Correlation Coefficients

measure direction & magnitude of relationship between independent & dependent variable

New cards
4

Magnitude of Association

Numerical value of correlation coefficient, show strength of association, 0 = no linear association, -/+ 1 = perfect linear association

New cards
5

Direction of Association

If correlation coefficient positive or negative, show directionality of relationship (pos or neg correlation)

New cards
6

Pearson Correlation

measure degree of relationship between linear related variables

New cards
7

Assumptions of Pearson Correlation

scale measurements, normal distribution, 2 variables = paired, no outliers, linearity, homoscedasticity

New cards
8

Linearity

straight line relationship between 2 variables, as x increase -> y increase/decrease, check with scatter plot visualization

New cards
9

Homoscedasticity

equal spread of data around line of best fit, data = homoscedastic or heteroscedastic, check with scatter plot

New cards
10

r

correlation coefficient

New cards
11

Spearman's Rank Order Correlation

Non-parametric Pearson's correlation, ranks data to explore relationship between 2 variables

New cards
12

Assumptions of Spearman's Rank Order Correlation

scale/ordinal data, any distribution, linear relationship between variables

New cards
13

Intraclass Correlation Coefficient (ICC)

used to evaluate inter-rate reliability, test-retest reliability & intra-rater reliability; for data structured as groups (not pairs)

New cards
14

Inter-Rater Reliability

variation between >=2 raters, measuring same event

New cards
15

Test-Retest Reliability

variation in 2 measurements under same conditions

New cards
16

Intra-Rater Reliability

variation within 1 rater across >= 2 trails

New cards
17

Factors Impacting Correlations

Restricting Data Range (can sometimes be good), Heterogenous Samples, Outliers (alter correlation)

New cards
18

Clinical use of correlation

reliability to clinical assessments tools, impact medical decision making

New cards
19

Regression

also explore relationship between variables, how explanatory variable impact response variable

New cards
20

Response variable

variable you are predicting, outcome/dependent/y variable

New cards
21

explanatory variable

variable you use to predict, x/independent variable

New cards
22

residuals

distance observed y lies from regression line

New cards
23

epsilon

error term, represent residual

New cards
24

beta 0

y-intercept

New cards
25

beta 1

slope of regression line

New cards
26

Least Squares Method

regression - chooses values of y-intercept & slope that minimize sum of squared residuals

New cards
27

what does a lower squares value mean?

smaller difference between data points & line of best fit

New cards
28

Regression Assumptions

scale data, residuals of regression line are normally distributed, no outliers, linear relationship between 2 variables, data = homoscedastic

New cards
29

Least Squares Regression Model

Sum of residuals = 0, line of best fit passes through mean of x & mean of y

New cards
30

R square

coefficient of determination, amount of variation in y explained by x

New cards
31
<p>interpret image coefficients</p>

interpret image coefficients

y-intercept = -77.283, slope = 3.33

New cards
32
<p>Interpret model summary</p>

Interpret model summary

R = simple correlation between variables, 57% of variation of y is explained by X

New cards
33
<p>Interpret ANOVA results</p>

Interpret ANOVA results

model significantly predicts y

New cards
34

r squared characteristics

always positive, as approached 0 = low variation in Y determined by x, max = 1 (all variability in y is determined by x

New cards
35

Linear Regression

1 explanatory & 1 response variable, scale measurements

New cards
36

Multiple Linear Regressions

1 response variable, >1 explanatory variable, scale measurements

New cards
37

Logistic Regression

1 explanatory variable, 1 response variable (dichotomous)

New cards
38

statistical significance

probability of event occurring due to random chance

New cards
39

clinical significance

event/difference is meaningful for a clinical reason

New cards
40

biological significance

whether finding has biological relevance

New cards
41

Data reproducibility

ability to reproduce/replicate findings

New cards
42

Replication crisis

our current inability to reproduce scientific results

New cards
43

causes for replication crisis

publication bias, bad study design & power, questionable research practices

New cards
44

Questionable Research Practices

p-hacking, selective reporting, sampling bias, HARKing ( Hypothesis After Result is Known)

New cards
45

Solutions to replicability crisis

preregistration of studies, replication studies, open science, alternative statistical approaches, education

New cards
robot