Binary Logistics Regression in Correlational Research

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 79

flashcard set

Earn XP

Description and Tags

Lecture 11

80 Terms

1

Binary Logistic Regression

Predict a binary dependent variable based on one or multiple quantitative or categorical independent variables.

New cards
2

Dependent variable

A variable that is being tested and measured in an experiment.

New cards
3

Independent variable

A variable that is manipulated to observe its effect on the dependent variable.

New cards
4

Continuous variable

A variable that can take an infinite number of values within a given range.

New cards
5

Categorical variable

A variable that can take on one of a limited, fixed number of possible values.

New cards
6

Linear regression

A statistical method for modeling the relationship between a dependent variable and one or more independent variables using a linear equation.

New cards
7

Logistic regression

A statistical method for predicting the outcome of a binary dependent variable based on one or more independent variables.

New cards
8

ANCOVA

Analysis of Covariance, a blend of ANOVA and regression that evaluates whether population means of a dependent variable differ across levels of a categorical independent variable while statistically controlling for the effects of other continuous variables.

New cards
9

χ²-tests

Chi-squared tests, statistical tests used to determine if there is a significant association between categorical variables.

New cards
10

t-test

A statistical test used to compare the means of two groups.

New cards
11

ANOVA

Analysis of Variance, a statistical method used to compare the means of three or more samples.

New cards
12

Contingency tables

A type of table in a matrix format that displays the frequency distribution of the variables.

New cards
13

Dichotomous dependent variables

Dependent variables that have two possible outcomes, such as pass/fail or alive/dead.

New cards
14

Research question

A question that a research project sets out to answer.

New cards
15

Sample size

The number of subjects included in a study, in this case, 75 students.

New cards
16

Raw Data

The initial data collected that has not been processed or analyzed.

New cards
17

Prediction errors

The difference between the predicted values and the actual values.

New cards
18

Linear Regression Equation

𝑌′ = 𝑏0 + 𝑏1𝑋, where 𝑌 is the predicted value, 𝑏0 is the intercept, and 𝑏1 is the slope.

New cards
19

Assumption of normality

The assumption that the errors in a regression analysis are normally distributed.

New cards
20

Trend analysis

The practice of collecting information and attempting to spot a pattern, or trend, in the data.

New cards
21

Prediction Errors

The errors e (prediction errors) are not normally distributed; so this assumption of linear regression analysis is violated.

New cards
22

Probability Range

Probabilities lie between 0 and 1.

New cards
23

Positive Relationship

The more hours someone studies, the higher the probability that he/she will obtain the bachelor's degree.

New cards
24

Negative Relationship

The more hours someone studies, the lower the probability that he/she will obtain the bachelor's degree.

New cards
25

Logistic Function

A mathematical function used to describe the relationship between hours studied and the probability of obtaining a bachelor's degree.

New cards
26

Logistic Function Formula

𝑃𝑌= 1 𝑋= 𝑒(𝑏0+𝑏1𝑋) / (1+𝑒(𝑏0+𝑏1𝑋))

New cards
27

Base of Natural Logarithm

𝑒 = 2.718282.

New cards
28

S-shaped Function

A non-linear function that takes values between 0 and 1.

New cards
29

Logistical Coefficients

𝑏0 and 𝑏1 are the logistical (regression) coefficients.

New cards
30

Example Logistic Function

Example with 𝑏0 = −1 and 𝑏1 = 1.5 → 𝑃𝑌= 1 𝑋= 𝑒(−1+1.5𝑋) / (1+𝑒(−1+1.5𝑋)).

New cards
31

Parameter b1

Determines the slope of the functions, and whether the function is increasing (𝑏1 > 0) or decreasing (𝑏1 < 0).

New cards
32

Using Logistic Function

Logistic function is a non-linear function; linear regression techniques cannot be applied.

New cards
33

Transforming Probabilities

Using some simple (mathematical) steps, we can turn a non-linear logistic regression function into a linear function.

New cards
34

Transforming to Odds

First, we transform probabilities to odds.

New cards
35

Proportion of Obtaining Bachelor's

We could calculate the proportion of obtained bachelor degrees for each score on the variable hours studying.

New cards
36

Logistic Function with b0 = 0

Logistic function: 𝑃𝑌= 1 𝑋= 𝑒(𝑏0+𝑏1𝑋) / (1+𝑒(𝑏0+𝑏1𝑋)) = 𝑒𝑏1𝑋 / (1+𝑒𝑏1𝑋).

New cards
37

Logistic Function Examples

Examples of logistic functions with different 𝑏1 and 𝑏0 = 0.

New cards
38

Logit

logit Y = 1 X = ln odds[Y = 1|X]

New cards
39

Natural logarithm

ln is the natural logarithm (logarithm with the base e)

New cards
40

Logistic model

logit Y = 1 X = b0 + b1X

New cards
41

From probability to odds

odds = P / (1 - P)

New cards
42

From odds to probability

P = odds / (1 + odds)

New cards
43

From odds to logit

logit = ln(odds)

New cards
44

From probability to logit

logit = ln P / (1 - P)

New cards
45

From logit to odds

odds = e^logit

New cards
46

From logit to probability

P = e^logit / (1 + e^logit)

New cards
47

Interpretation of Odds

How many times larger is the probability of Y=1 than of Y=0 given X?

New cards
48

Example of Odds Calculation

P(Y=1|X) = 0.10 → odds = 0.10/(1 − 0.10) = 0.10/0.90 = 0.111

New cards
49

Relationship between scales

If the probability increases, the odds and logit also increase, and vice versa.

New cards
50

Probability and Odds Table

Probability (P) | Odds | Logit

New cards
51

Probability 0.5

P = 0.5 → Odds = 1 → Logit = 0

New cards
52

Probability less than 0.5

0 ≤ P < 0.5 → 0 < odds < 1 → Negative Logit

New cards
53

Probability greater than 0.5

0.5 < P ≤ 1 → Odds > 1 → Positive Logit

New cards
54

Logit Interpretation

The logit is a linear function of X.

New cards
55

Graph of Odds

The corresponding graph for b0 = -1 and b1 = 1.5 shows Odds[Y=1|X].

New cards
56

Graph of Logit

The corresponding graph for b0 = -1 and b1 = 1.5 shows Logit[Y=1|X].

New cards
57

Exam Question Example

What are the odds of guessing the correct answer in an exam with four categories?

New cards
58

Odds

odds = 𝑃/(1 −𝑃)

New cards
59

Probability from Odds

𝑃= odds/(1 + odds)

New cards
60

Logit from Odds

logit = ln(odds)

New cards
61

Logit from Probability

logit = ln 𝑃/(1 −𝑃)

New cards
62

Odds from Logit

odds = 𝑒logit

New cards
63

Probability from Logit

𝑃=𝑒logit/(1+𝑒logit)

New cards
64

Logistic Regression Analysis

A statistical method to predict the outcome of a dependent variable based on one or more independent variables.

New cards
65

Independent Variable

Number of hours studied

New cards
66

Dependent Variable

Obtaining (𝑌= 1) or not obtaining (𝑌= 0) the Bachelor's degree in three years

New cards
67

Logit Equation

Logit[obtaining bachelor's degree] = −4.9 + 0.294Hours

New cards
68

Effect of Hours on Logit

If the amount of hours studied increases, the logit - and thus probability - of obtaining the Bachelor's degree also increases.

New cards
69

Numerical Example for Logit

Logit = −4.9 + 0.294 ∙Hours

New cards
70

Effect of Hours on Odds

odds = 𝑒logit = 𝑒−4.9+0.249∙Hours

New cards
71

Odds Increase Factor

When the number of hours studied increases by one, the odds increase by a factor 𝑒0.294 = 1.342.

New cards
72

Wald Test Hypotheses

H0: 𝐵= 0 against H1: 𝐵≠0

New cards
73

Significant Effect of Hours

There is a significant effect of Hours on the probability of obtaining the Bachelor's degree.

New cards
74

Logit Equation for Blood Pressure

Logit(BloodPressure) = -4.2 + 0.07*Age

New cards
75

Probability Calculation

What is the probability that a 40 year-old person has a high blood pressure?

New cards
76

Age Probability Threshold

At what age is the probability that someone has a high blood pressure 0.5?

New cards
77

SPSS Output Example

B: .294, S.E.: .067, Wald: 18.994, df: 1, Sig.: .000, Exp(B): 1.342

New cards
78

Constant in SPSS Output

Constant: -4.900, S.E.: 1.157, Wald: 17.923, df: 1, Sig.: .000, Exp(B): .007

New cards
79

Logit for 15 Hours

Logit = -0.490, Odds = .380, Probability = .380

New cards
80

Logit for 20 Hours

Logit = 0.980, Odds = 2.664, Probability = .727

New cards

Explore top notes

note Note
studied byStudied by 55 people
873 days ago
5.0(1)
note Note
studied byStudied by 8 people
898 days ago
5.0(1)
note Note
studied byStudied by 25 people
805 days ago
5.0(1)
note Note
studied byStudied by 7 people
952 days ago
5.0(1)
note Note
studied byStudied by 26 people
839 days ago
5.0(1)
note Note
studied byStudied by 20 people
705 days ago
5.0(1)
note Note
studied byStudied by 72 people
828 days ago
5.0(1)
note Note
studied byStudied by 259 people
971 days ago
5.0(1)

Explore top flashcards

flashcards Flashcard (41)
studied byStudied by 8 people
138 days ago
5.0(1)
flashcards Flashcard (45)
studied byStudied by 6 people
722 days ago
5.0(2)
flashcards Flashcard (60)
studied byStudied by 15 people
785 days ago
5.0(1)
flashcards Flashcard (148)
studied byStudied by 3 people
819 days ago
5.0(1)
flashcards Flashcard (53)
studied byStudied by 17 people
556 days ago
5.0(1)
flashcards Flashcard (20)
studied byStudied by 2 people
95 days ago
5.0(1)
flashcards Flashcard (20)
studied byStudied by 7 people
740 days ago
4.0(1)
flashcards Flashcard (67)
studied byStudied by 16 people
46 days ago
5.0(1)
robot