CPSC 392 E1

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/37

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

38 Terms

1
New cards

Positive

Observation is positive

2
New cards

negative

observation is negative

3
New cards

True Postives

observation +, prediction +

4
New cards

False Postive

observation -, preditction +

5
New cards

True Negative

observation -, prediction -

6
New cards

false negative

observation +, predicition -

7
New cards

Accuracy

How close is predicitoin to the actual value

8
New cards

Accuracy Formula

(TN+TP)/N

9
New cards

Precision

How many selected values are relevant

10
New cards

Precision Formula

TP/(TP+FP)

11
New cards

Recall (Sensitivity)

How many relevant items are selected

12
New cards

Recall/Sensitivity/TPR Formula

TP/(TP+FN)

13
New cards

F1 Score Formula

2* ((recall*precision)/(recall+precision))

14
New cards

F1 Score

evaluates both precision and recall

15
New cards

Normalization

x-min(x)/max(x)-min(x)

16
New cards

Standardization

x-mean/Standard deviation

17
New cards

Overfit

When the model is too complex for the data given, resulting in reaching for correct values, think kNN but the dividing line is too specific

18
New cards

Underfit

when the model is too simple for the data given, resulting in undershooting the values. EX: using a linear model for non-linear data

19
New cards

Two ways of removing outliers

Removal, iterative Removal

20
New cards

Removal (Outliers)

Simply removing outliers

21
New cards

Iterative Removal (Outliers)

Removing and replacing outliers with values found from a model

22
New cards

2 ways of handling Missing Data

Deletion, imputation

23
New cards

deletion

Listwise, Pairwise, Variable Dropping

24
New cards

Imputation

LOCF, NOCB, interpolation, Extrapolation

25
New cards

Category Encoding

Giving categorical variables numeric values (Binary, Target-Based)

26
New cards

Binary Enconding

giving categorical variables a corresponding number (male 0, female 1)

27
New cards

Target Based Enconding

Binary encoding but finding the proportions

28
New cards

Supervised

Model is given a data set with labels

29
New cards

Unsupervised

Working with data without labels

30
New cards

Classification

Supervised Machine learning dealing with categorical value

31
New cards

Regression

Supervised Machine learning dealing with numerical value

32
New cards

algortithm

a methodical, logical rule or procedure that guarantees solving a particular problem. Contrasts with speedier heuristics

33
New cards

hyperparameter

the parameters that changes how the model deals with the normal parameters

34
New cards

FPR - False Positive Rate

FP/(FP+TN)

35
New cards

ROC

Graphs the different threshold points to see which threshold gives the most accurate data. y-axis: TPR, x-axis: FPR

36
New cards

AUC

area under the curve, used to determine which model's are more accurate. Whichever has the higher AUC value is more accurate

37
New cards

KNN

distance based classification to see which neighbors are closest, then out of the neighbors whichever is a majority one class is the class of the value

38
New cards

linear regression

a statistical method used to fit a linear model to a given data set