Home
Explore
Exams
Search for anything
Search for anything
Login
Get started
Home
Math
Statistics
Probability
Module 6 - Correlation & Regression
Studied by 9 people
0.0
(0)
Add a rating
Learn
A personalized and smart learning plan
Practice Test
Take a test on your terms and definitions
Spaced Repetition
Scientifically backed study method
Matching Game
How quick can you match all your cards?
Flashcards
Study terms and definitions
1 / 44
Earn XP
Description and Tags
Statistics
Probability
University/Undergrad
Add tags
45 Terms
View all (45)
Star these 45
1
Correlation
relationship between two continuous variables, measure degree data points cluster around regression line
New cards
2
Line of Best Fit
show general pattern of relationship between dependent & independent variable
New cards
3
Correlation Coefficients
measure direction & magnitude of relationship between independent & dependent variable
New cards
4
Magnitude of Association
Numerical value of correlation coefficient, show strength of association, 0 = no linear association, -/+ 1 = perfect linear association
New cards
5
Direction of Association
If correlation coefficient positive or negative, show directionality of relationship (pos or neg correlation)
New cards
6
Pearson Correlation
measure degree of relationship between linear related variables
New cards
7
Assumptions of Pearson Correlation
scale measurements, normal distribution, 2 variables = paired, no outliers, linearity, homoscedasticity
New cards
8
Linearity
straight line relationship between 2 variables, as x increase -> y increase/decrease, check with scatter plot visualization
New cards
9
Homoscedasticity
equal spread of data around line of best fit, data = homoscedastic or heteroscedastic, check with scatter plot
New cards
10
r
correlation coefficient
New cards
11
Spearman's Rank Order Correlation
Non-parametric Pearson's correlation, ranks data to explore relationship between 2 variables
New cards
12
Assumptions of Spearman's Rank Order Correlation
scale/ordinal data, any distribution, linear relationship between variables
New cards
13
Intraclass Correlation Coefficient (ICC)
used to evaluate inter-rate reliability, test-retest reliability & intra-rater reliability; for data structured as groups (not pairs)
New cards
14
Inter-Rater Reliability
variation between >=2 raters, measuring same event
New cards
15
Test-Retest Reliability
variation in 2 measurements under same conditions
New cards
16
Intra-Rater Reliability
variation within 1 rater across >= 2 trails
New cards
17
Factors Impacting Correlations
Restricting Data Range (can sometimes be good), Heterogenous Samples, Outliers (alter correlation)
New cards
18
Clinical use of correlation
reliability to clinical assessments tools, impact medical decision making
New cards
19
Regression
also explore relationship between variables, how explanatory variable impact response variable
New cards
20
Response variable
variable you are predicting, outcome/dependent/y variable
New cards
21
explanatory variable
variable you use to predict, x/independent variable
New cards
22
residuals
distance observed y lies from regression line
New cards
23
epsilon
error term, represent residual
New cards
24
beta 0
y-intercept
New cards
25
beta 1
slope of regression line
New cards
26
Least Squares Method
regression - chooses values of y-intercept & slope that minimize sum of squared residuals
New cards
27
what does a lower squares value mean?
smaller difference between data points & line of best fit
New cards
28
Regression Assumptions
scale data, residuals of regression line are normally distributed, no outliers, linear relationship between 2 variables, data = homoscedastic
New cards
29
Least Squares Regression Model
Sum of residuals = 0, line of best fit passes through mean of x & mean of y
New cards
30
R square
coefficient of determination, amount of variation in y explained by x
New cards
31
interpret image coefficients
y-intercept = -77.283, slope = 3.33
New cards
32
Interpret model summary
R = simple correlation between variables, 57% of variation of y is explained by X
New cards
33
Interpret ANOVA results
model significantly predicts y
New cards
34
r squared characteristics
always positive, as approached 0 = low variation in Y determined by x, max = 1 (all variability in y is determined by x
New cards
35
Linear Regression
1 explanatory & 1 response variable, scale measurements
New cards
36
Multiple Linear Regressions
1 response variable, >1 explanatory variable, scale measurements
New cards
37
Logistic Regression
1 explanatory variable, 1 response variable (dichotomous)
New cards
38
statistical significance
probability of event occurring due to random chance
New cards
39
clinical significance
event/difference is meaningful for a clinical reason
New cards
40
biological significance
whether finding has biological relevance
New cards
41
Data reproducibility
ability to reproduce/replicate findings
New cards
42
Replication crisis
our current inability to reproduce scientific results
New cards
43
causes for replication crisis
publication bias, bad study design & power, questionable research practices
New cards
44
Questionable Research Practices
p-hacking, selective reporting, sampling bias, HARKing ( Hypothesis After Result is Known)
New cards
45
Solutions to replicability crisis
preregistration of studies, replication studies, open science, alternative statistical approaches, education
New cards