Biostats - ROC curves and logistic regression

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/49

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

50 Terms

1
New cards

What does binary logistic regression predict?

The probability of a binary outcome (success=1 or failure=0).

2
New cards

How is the dependent variable different from linear regression?

It is binary, not continuous.

3
New cards

Examples of binary DV responses?

Yes/no, alive/dead, has/doesn't have, increased/decreased.

4
New cards

Why is logistic regression widely used?

Useful for research questions based on dichotomies (medicine, psych, sociology, etc.).

5
New cards

What does logistic regression try to predict mathematically?

p(success) = f(covariates).

6
New cards

What assumption must success/failure categories meet?

They must be mutually exclusive and collectively exhaustive.

7
New cards

How must success/failure be coded in logistic regression?

Failure=0, success=1.

8
New cards

Why can’t linear regression be used to predict probabilities?

Linear regression can produce predictions outside the 0–1 range.

9
New cards

What ensures logistic regression predictions stay between 0 and 1?

A link function.

10
New cards

What link function does binary logistic regression use?

The logit function.

11
New cards

What is a Generalized Linear Model (GLM)?

A regression model that uses a link function.

12
New cards

What does the logit function represent?

The natural log of the odds of success.

13
New cards

Instead of b-coefficients, what do we usually interpret in logistic regression?

Odds ratios (Exp(B)).

14
New cards

What does an odds ratio represent?

Change in odds of success for a 1-unit change in the predictor.

15
New cards

How do you compute an odds ratio from b?

Odds ratio = e^b.

16
New cards

What does H₀ say about the odds ratio?

H₀: Odds ratio = 1 (no effect).

17
New cards

If odds ratio > 1, what does it mean?

Positive relationship; higher covariate increases odds of success.

18
New cards

If odds ratio < 1, what does it mean?

Negative relationship; higher covariate decreases odds of success.

19
New cards

Example: If b = 2.69, what is the odds ratio?

14.73 (e^2.69).

20
New cards

Interpretation of odds ratio = 14.73?

The odds of success are 14.73 times higher for a 1-unit increase in predictor.

21
New cards

How do you compute odds ratio for a non-unit change?

Multiply b by the change size and exponentiate: e^(b * change).

22
New cards

If b = 2.69, what is OR for a 2-unit increase?

e^(5.38) = 217.02.

23
New cards

How do multiple predictors combine in logistic regression?

Their effects multiply on the odds.

24
New cards

How do effects combine in linear regression?

They add on the outcome.

25
New cards

Why is this difference important?

Logistic = multiplicative effects; Linear = additive effects.

26
New cards

What shape do predicted probability curves form?

An S-shaped logistic (sigmoid) curve.

27
New cards

Summary: What does logistic regression do?

Predicts probability of success/failure using a logit link and odds ratios.

28
New cards

What determines classification in logistic regression?

Predicted probability of success.

29
New cards

What is the default cut value for classifying success?

p(success) > .50.

30
New cards

What is a classification table?

A confusion matrix of predicted vs observed outcomes.

31
New cards

What does the classification table show?

Correct successes, correct failures, false positives, false negatives.

32
New cards

Examples of performance measures derived from it?

Sensitivity, specificity, PPV, NPV.

33
New cards

Why can’t you minimize all classification errors?

Reducing false positives usually increases false negatives (trade-off).

34
New cards

What is the Null model in logistic regression?

A model with no predictors.

35
New cards

What does the Null model predict for every case?

The most common outcome category.

36
New cards

Why does it always predict the largest category?

It minimizes overall classification error without predictors.

37
New cards

Why are rare events a problem in logistic regression?

The Null model predicts “does not occur” and is mostly correct, leaving little room for improvement.

38
New cards

How do you decide which covariates to include?

Check if success/fail groups differ on continuous variables (mean differences) or categorical variables (chi-square differences).

39
New cards

How do you expand a logistic model?

Add predictors and compare classification tables.

40
New cards

Why do we need ROC curves?

They show model performance for different cut values (not just .50).

41
New cards

What is an ROC curve?

A plot of sensitivity (TPR) vs 1–specificity (FPR) across cut values.

42
New cards

Where did ROC curves originate?

WWII radar detection.

43
New cards

What does sensitivity represent?

True positive rate.

44
New cards

What does specificity represent?

True negative rate.

45
New cards

What happens with a low cut value?

High sensitivity, more false positives.

46
New cards

What happens with a high cut value?

High specificity, more false negatives.

47
New cards

What is the purpose of comparing ROC curves?

To see which model achieves better sensitivity-specificity trade-offs.

48
New cards

What is the AUC (Area Under the Curve)?

An overall measure of diagnostic accuracy.

49
New cards

What does AUC close to 1 mean?

Model distinguishes successes from failures very well.

50
New cards

What does AUC = .50 mean?

Model is no better than chance.