Binary Logistics Regression in Correlational Research

0.0(0)
studied byStudied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/79

flashcard set

Earn XP

Description and Tags

Lecture 11

Last updated 10:45 AM on 1/16/25
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

80 Terms

1
New cards

Binary Logistic Regression

Predict a binary dependent variable based on one or multiple quantitative or categorical independent variables.

2
New cards

Dependent variable

A variable that is being tested and measured in an experiment.

3
New cards

Independent variable

A variable that is manipulated to observe its effect on the dependent variable.

4
New cards

Continuous variable

A variable that can take an infinite number of values within a given range.

5
New cards

Categorical variable

A variable that can take on one of a limited, fixed number of possible values.

6
New cards

Linear regression

A statistical method for modeling the relationship between a dependent variable and one or more independent variables using a linear equation.

7
New cards

Logistic regression

A statistical method for predicting the outcome of a binary dependent variable based on one or more independent variables.

8
New cards

ANCOVA

Analysis of Covariance, a blend of ANOVA and regression that evaluates whether population means of a dependent variable differ across levels of a categorical independent variable while statistically controlling for the effects of other continuous variables.

9
New cards

χ²-tests

Chi-squared tests, statistical tests used to determine if there is a significant association between categorical variables.

10
New cards

t-test

A statistical test used to compare the means of two groups.

11
New cards

ANOVA

Analysis of Variance, a statistical method used to compare the means of three or more samples.

12
New cards

Contingency tables

A type of table in a matrix format that displays the frequency distribution of the variables.

13
New cards

Dichotomous dependent variables

Dependent variables that have two possible outcomes, such as pass/fail or alive/dead.

14
New cards

Research question

A question that a research project sets out to answer.

15
New cards

Sample size

The number of subjects included in a study, in this case, 75 students.

16
New cards

Raw Data

The initial data collected that has not been processed or analyzed.

17
New cards

Prediction errors

The difference between the predicted values and the actual values.

18
New cards

Linear Regression Equation

𝑌′ = 𝑏0 + 𝑏1𝑋, where 𝑌 is the predicted value, 𝑏0 is the intercept, and 𝑏1 is the slope.

19
New cards

Assumption of normality

The assumption that the errors in a regression analysis are normally distributed.

20
New cards

Trend analysis

The practice of collecting information and attempting to spot a pattern, or trend, in the data.

21
New cards

Prediction Errors

The errors e (prediction errors) are not normally distributed; so this assumption of linear regression analysis is violated.

22
New cards

Probability Range

Probabilities lie between 0 and 1.

23
New cards

Positive Relationship

The more hours someone studies, the higher the probability that he/she will obtain the bachelor's degree.

24
New cards

Negative Relationship

The more hours someone studies, the lower the probability that he/she will obtain the bachelor's degree.

25
New cards

Logistic Function

A mathematical function used to describe the relationship between hours studied and the probability of obtaining a bachelor's degree.

26
New cards

Logistic Function Formula

𝑃𝑌= 1 𝑋= 𝑒(𝑏0+𝑏1𝑋) / (1+𝑒(𝑏0+𝑏1𝑋))

27
New cards

Base of Natural Logarithm

𝑒 = 2.718282.

28
New cards

S-shaped Function

A non-linear function that takes values between 0 and 1.

29
New cards

Logistical Coefficients

𝑏0 and 𝑏1 are the logistical (regression) coefficients.

30
New cards

Example Logistic Function

Example with 𝑏0 = −1 and 𝑏1 = 1.5 → 𝑃𝑌= 1 𝑋= 𝑒(−1+1.5𝑋) / (1+𝑒(−1+1.5𝑋)).

31
New cards

Parameter b1

Determines the slope of the functions, and whether the function is increasing (𝑏1 > 0) or decreasing (𝑏1 < 0).

32
New cards

Using Logistic Function

Logistic function is a non-linear function; linear regression techniques cannot be applied.

33
New cards

Transforming Probabilities

Using some simple (mathematical) steps, we can turn a non-linear logistic regression function into a linear function.

34
New cards

Transforming to Odds

First, we transform probabilities to odds.

35
New cards

Proportion of Obtaining Bachelor's

We could calculate the proportion of obtained bachelor degrees for each score on the variable hours studying.

36
New cards

Logistic Function with b0 = 0

Logistic function: 𝑃𝑌= 1 𝑋= 𝑒(𝑏0+𝑏1𝑋) / (1+𝑒(𝑏0+𝑏1𝑋)) = 𝑒𝑏1𝑋 / (1+𝑒𝑏1𝑋).

37
New cards

Logistic Function Examples

Examples of logistic functions with different 𝑏1 and 𝑏0 = 0.

38
New cards

Logit

logit Y = 1 X = ln odds[Y = 1|X]

39
New cards

Natural logarithm

ln is the natural logarithm (logarithm with the base e)

40
New cards

Logistic model

logit Y = 1 X = b0 + b1X

41
New cards

From probability to odds

odds = P / (1 - P)

42
New cards

From odds to probability

P = odds / (1 + odds)

43
New cards

From odds to logit

logit = ln(odds)

44
New cards

From probability to logit

logit = ln P / (1 - P)

45
New cards

From logit to odds

odds = e^logit

46
New cards

From logit to probability

P = e^logit / (1 + e^logit)

47
New cards

Interpretation of Odds

How many times larger is the probability of Y=1 than of Y=0 given X?

48
New cards

Example of Odds Calculation

P(Y=1|X) = 0.10 → odds = 0.10/(1 − 0.10) = 0.10/0.90 = 0.111

49
New cards

Relationship between scales

If the probability increases, the odds and logit also increase, and vice versa.

50
New cards

Probability and Odds Table

Probability (P) | Odds | Logit

51
New cards

Probability 0.5

P = 0.5 → Odds = 1 → Logit = 0

52
New cards

Probability less than 0.5

0 ≤ P < 0.5 → 0 < odds < 1 → Negative Logit

53
New cards

Probability greater than 0.5

0.5 < P ≤ 1 → Odds > 1 → Positive Logit

54
New cards

Logit Interpretation

The logit is a linear function of X.

55
New cards

Graph of Odds

The corresponding graph for b0 = -1 and b1 = 1.5 shows Odds[Y=1|X].

56
New cards

Graph of Logit

The corresponding graph for b0 = -1 and b1 = 1.5 shows Logit[Y=1|X].

57
New cards

Exam Question Example

What are the odds of guessing the correct answer in an exam with four categories?

58
New cards

Odds

odds = 𝑃/(1 −𝑃)

59
New cards

Probability from Odds

𝑃= odds/(1 + odds)

60
New cards

Logit from Odds

logit = ln(odds)

61
New cards

Logit from Probability

logit = ln 𝑃/(1 −𝑃)

62
New cards

Odds from Logit

odds = 𝑒logit

63
New cards

Probability from Logit

𝑃=𝑒logit/(1+𝑒logit)

64
New cards

Logistic Regression Analysis

A statistical method to predict the outcome of a dependent variable based on one or more independent variables.

65
New cards

Independent Variable

Number of hours studied

66
New cards

Dependent Variable

Obtaining (𝑌= 1) or not obtaining (𝑌= 0) the Bachelor's degree in three years

67
New cards

Logit Equation

Logit[obtaining bachelor's degree] = −4.9 + 0.294Hours

68
New cards

Effect of Hours on Logit

If the amount of hours studied increases, the logit - and thus probability - of obtaining the Bachelor's degree also increases.

69
New cards

Numerical Example for Logit

Logit = −4.9 + 0.294 ∙Hours

70
New cards

Effect of Hours on Odds

odds = 𝑒logit = 𝑒−4.9+0.249∙Hours

71
New cards

Odds Increase Factor

When the number of hours studied increases by one, the odds increase by a factor 𝑒0.294 = 1.342.

72
New cards

Wald Test Hypotheses

H0: 𝐵= 0 against H1: 𝐵≠0

73
New cards

Significant Effect of Hours

There is a significant effect of Hours on the probability of obtaining the Bachelor's degree.

74
New cards

Logit Equation for Blood Pressure

Logit(BloodPressure) = -4.2 + 0.07*Age

75
New cards

Probability Calculation

What is the probability that a 40 year-old person has a high blood pressure?

76
New cards

Age Probability Threshold

At what age is the probability that someone has a high blood pressure 0.5?

77
New cards

SPSS Output Example

B: .294, S.E.: .067, Wald: 18.994, df: 1, Sig.: .000, Exp(B): 1.342

78
New cards

Constant in SPSS Output

Constant: -4.900, S.E.: 1.157, Wald: 17.923, df: 1, Sig.: .000, Exp(B): .007

79
New cards

Logit for 15 Hours

Logit = -0.490, Odds = .380, Probability = .380

80
New cards

Logit for 20 Hours

Logit = 0.980, Odds = 2.664, Probability = .727