Correlation and regression analysis

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/9

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

10 Terms

1
New cards

Correlations and regression analysis

knowt flashcard image
2
New cards

Correlation analysis what is it, hypothesis, correlation strength

3
New cards

Parametric vs nonparametric correlation

•If numerical variables and big sample -> parametric approach (Pearson correlation)

•If ordinal variables and/or small sample -> nonparametric approach (Spearman, Kendall correlation) Small sample always nonparametric approach

Sometimes Likert scale+big enough sample → parametric approach

4
New cards

Linear regression model

•A relationship between two variables

•Dependent variable (Y) – output

•Independend variable (X) - input

•Dependent variable is numeric in case of linear regression model

•If only one input variable, then simple regression, if more than two independent variables, then multiple regression

•Sample size is very important (at least 10 observations per variable)

•Y=β01X+u

•Y – dependent variable

•X – independent variable

•u – error term (the only source of randomness in Y)

•β0 – Y intercept

•β1 – the slope

5
New cards
<p>Linear regression model results</p><p></p>

Linear regression model results

knowt flashcard image
6
New cards

Diagnostics of regression model

•Normality of residuals

•Outliers

•Heteroscedasticity

•Multicollinearity

•Model specification

7
New cards

Logistic regression models

•Basically the same as linear regression, but dependent (output) variable is binary, not numeric

•For example, dependent variable can be:

•Unemployed (yes or no)

•Has started a company (yes or no)

•Bad loan (yes or no)

•Drives car daily (yes or no)

•Logit model

•Probit model

•Multinomial logistic model (if more than 2 categories)

8
New cards

Logit model coefficients

for coefficients - interpret only the direction positive or negative

Instead is better to interpret the odds ratio

<p>for coefficients - interpret only the direction positive or negative</p><p>Instead is better to interpret the odds ratio</p>
9
New cards

Logit model odds ratio

for odds ratio - easier to interpret for categorical variables

if it is bigger than 1, the effect is positive, if less negative

<p>for odds ratio - easier to interpret for categorical variables</p><p>if it is bigger than 1, the effect is positive, if less negative</p>
10
New cards
<p>Diagnostics of logistic regression model</p>

Diagnostics of logistic regression model

•How well our model classifies (predicts) the output variable - focusing on correctly classified - 81% of correctly classidied cases

•ROC curve (illustrates the diagnostic ability) - the better the model predicts the higher would be the area under ROC curve

<p><span>•How well our model classifies (predicts) the output variable - focusing on correctly classified - 81% of correctly classidied cases</span></p><p><span>•ROC curve (illustrates the diagnostic ability) - the better the model predicts the higher would be the area under ROC curve</span></p><p></p><p></p><p></p>