data science

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/12

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 4:24 PM on 4/1/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

13 Terms

1
New cards

accuracy

total correct/total predictions

if 95% passengers died, predicting died every time gets 95% accuracy but catches zero survivors

TP+TN / TP + TN+ FP + FN

2
New cards

precision

of everything you predicted as positive/flagged how many actually were positive. use when false positives are costly, like flagging a trade as fraudulent

TP/ TP + FP

3
New cards

recall (sensitivity)

of all actual postivits, how many did you catch. high recall = fewer misses.

use when false negatives are costly, like missing a client or cancer

4
New cards

F1 Score

harmonic mean of precision and recall, use when you need to balance both.

5
New cards

AUC - ROC

measures hwo well the model seperates classes across possible thresholds. 0.5 = random guessing, and 1 = perfect

6
New cards

confusion matrix

the 2×2 table behind everything

7
New cards

RMSE (root mean squared error)

average prediction error in teh same units as the target

penalizes large errors heavily bc of the squaring

big misses are much worse than small ones

8
New cards

MAE mean absolute error

treats all error equally, more robust to outliers than rmse

if rmse is much higher than mae, then you have large outliers in predictions

9
New cards

proportion of vairance explained by the x variables.

1 = perfect, 0 = not good

10
New cards

roc curve

AUC = Area under the curve, 0.5 means the model is gussing randomly and 1 emans perfect

Interpretation of AUC:

  • 1.0: Perfect Classifier.

  • 0.9–0.99: Excellent discrimination.

  • 0.8–0.89: Good discrimination.

  • 0.7–0.79: Fair discrimination.

  • < 0.7: Poor/Weak discrimination.

  • 0.5: No better than random chance.

11
New cards

Bias-variance tradeoff

high bias (underfits), complex model = high variance (overfits)

12
New cards

Regularization

L1 (Lasso) drives coefficients to zero (feature selection), L2 (Ridge) shrinks them (prevents overfitting)

13
New cards

Explore top notes

note
Science Test
Updated 1280d ago
0.0(0)
note
Different Types of Rocks
Updated 1260d ago
0.0(0)
note
AP Human Geography
Updated 112d ago
0.0(0)
note
3.1 Intro to Culture
Updated 128d ago
0.0(0)
note
Chapter Eight: Group Processes
Updated 1140d ago
0.0(0)
note
Period 1, c.1200 to c.1450
Updated 1161d ago
0.0(0)
note
Science Test
Updated 1280d ago
0.0(0)
note
Different Types of Rocks
Updated 1260d ago
0.0(0)
note
AP Human Geography
Updated 112d ago
0.0(0)
note
3.1 Intro to Culture
Updated 128d ago
0.0(0)
note
Chapter Eight: Group Processes
Updated 1140d ago
0.0(0)
note
Period 1, c.1200 to c.1450
Updated 1161d ago
0.0(0)

Explore top flashcards

flashcards
Spanish Set 8
53
Updated 848d ago
0.0(0)
flashcards
Areas of focus - Business
28
Updated 1161d ago
0.0(0)
flashcards
Chemistry - Ions and molecules
57
Updated 409d ago
0.0(0)
flashcards
Earthquakes and Volcanoes
28
Updated 497d ago
0.0(0)
flashcards
E2: ortho practice questions
101
Updated 411d ago
0.0(0)
flashcards
Chapter 1 - Financial Literacy
35
Updated 930d ago
0.0(0)
flashcards
Spanish Set 8
53
Updated 848d ago
0.0(0)
flashcards
Areas of focus - Business
28
Updated 1161d ago
0.0(0)
flashcards
Chemistry - Ions and molecules
57
Updated 409d ago
0.0(0)
flashcards
Earthquakes and Volcanoes
28
Updated 497d ago
0.0(0)
flashcards
E2: ortho practice questions
101
Updated 411d ago
0.0(0)
flashcards
Chapter 1 - Financial Literacy
35
Updated 930d ago
0.0(0)