Alg for Machine Learning Quiz 3 Prep

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/68

There's no tags or description

Looks like no tags are added yet.

Last updated 2:53 PM on 4/7/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

69 Terms

New cards

KNN Algorithm Type

Instance-based

New cards

Euclidean Distance Formula

d = sqr(sum(xi-yi)^2))

New cards

Manhattan Distance Formula

| x1 - x2 | + | y1 - y2 |

New cards

Minkowski Distance Formula

d = (sum(|xi-yi|^p)^(1/p)

New cards

KKN Steps

- Compute distance to all points

- Sort distances

- Select k nearest

- Majority vote based on k nearest

New cards

What is K in KNN?

Number of neighbors used for voting

New cards

Effect of small K in KKN

High Variance & Overfitting

New cards

Effect of large K in KKN

High bias & Underfitting

New cards

What is KNN's main flaw?

The curse of dimensionality

New cards

Curse of Dimensionality

High-dimensional data requires more samples

New cards

Normalization Formula

(x - min) / (max - min)

New cards

Cosine Similarity Formula

cos(theta) = (x * y)/||x||||y||

New cards

Jaccard Similarity Formula

J(A,B) = | A intersection B |/| A union B |

New cards

How to calculate Hamming Distance

Count the number of differing positions

New cards

In what type of sets will KNN perform poorly?

- High dimensional Data

- Large dataset

- Sets with unscaled features

New cards

What is the purpose of Support Vector Machines

Finding the best separating hyperplane / margin between classification classes in a model

New cards

SVM Decision Boundary Formula

(w^T)x + b = 0

New cards

What is the margin in SVM?

The distance between the boundary and closest points

New cards

SVM Margin Formula

2/||w||

New cards

What are the 3 SVM Optimization Formulas

- Hard Margin

- Soft Margin

- Hinge Loss

New cards

Hard Margin Formula

min(1/2)||w||^2

New cards

Soft Margin Formula

(min(1/2)∣∣w∣∣^2) + C∑ξi

New cards

Hinge Loss Formula

L = max(0,1 − y((w^T)x + b))

New cards

What does the parameter C control in SVM?

Margin vs misclassification tradeoff

New cards

What affects can you expect in SVM when parameter C is small?

- Largin margin

- Large amount of errors

New cards

What affects can you expect in SVM when parameter C is large?

- Small margin

- Few errors

New cards

SVM Kernel Trick Purpose

Transforms data to higher dimensions

New cards

SVM Kernel Trick Formula

K(x,z) = ϕ(x) ⋅ ϕ(z)

New cards

Guassian RBF Formula

K(x,l) = exp(−γ∣∣x−l∣∣^2)

New cards

What is the expected outcome of a large Gamma(Y) In Guassian RBF?

Overfitting & Wiggly Boundary

New cards

What is the expected outcome of a small Gamma(Y) In Guassian RBF?

Underfitting & Smooth boundaries

New cards

What kind of algorithm is Naive Bayes

Probabilistic Classifier

New cards

Steps of Naive Bayes

- Compute prior P(c)

- Compute likelihoods

- Multiply

- Choose max

New cards

Gaussian NB Formula

P(x∣c) = (1/sqr(2πσ^2))e^(-(x-y)^2/2σ^2

New cards

What are the assumptions when using Gaussian NB?

- Independent Features

- Gaussian for continuous

New cards

What type of algorithm is SOFTMAX Regression?

Multi-Class Classification

New cards

What algorithm is SOFTMAX Regression a version of?

Logistic Regression

New cards

What does SOFTMAX produce?

- Outputs from range sum to 1

- Probabilities

New cards

What is the assumption when using SOFTMAX?

Classes are mutually exclusive

New cards

SOFTMAX Net Input Z Formula

Z = XW + b

New cards

Cross-Entropy Formula

-sum(ylog(yhat))

New cards

How to calculate total parameters in SOFTMAX Regression?

(features * classes) + classes

New cards

What are odd K's used in KNN

To avoid ties

New cards

What is the most common distance formula in KNN?

Euclidean

New cards

In KNN what happens if one feature has much larger values than others?

The large value dominates the distance

New cards

What technique in KNN is used to reduce dimensionality?

PCA

New cards

What is the angle-based distance metric in KNN?

Cosine

New cards

What property is not required for Minkowski Distance metric?

Linearity

New cards

In the Minkowski Distance formula what value give Manhattan Distance?

New cards

In the Minkowski Distance formula what value give Euclidean Distance?

New cards

What are support vectors ins SVM?

Closest points to boundary that determine the hyperplane

New cards

What does hinge loss penalize?

Points that are inside margin or have been misclassified

New cards

What happens if Naive Bayes encounters a feature value not seen in training?

Probability becomes zero

New cards

Laplace Smoothing

Technique to handle zero probabilities in classification

New cards

What assumption is made in Naive Bayes

Features are independent

New cards

What happens if Naive Bayes assumption is violated?

Accuracy may decrease

New cards

In softmax, what happens if one logit is much larger?

That class gets probability ≈ 1

New cards

What models are most sensitive to unscaled features?

- KNN

- SVM

New cards

What happens when features are not scaled in SVM?

Margin becomes skewed

New cards

What kernel maps to infinite-dimensional space

RBF

New cards

What happens if a probability becomes zero in Naive Bayes?

Entire product becomes zero

New cards

What does Naive Bayes produce?

Multiple probabilities of classes

New cards

What is the distribution assumption of GNB?

Normal Distribution

New cards

What are the required parameters of GNB

- mean

- variance

New cards

What type of features is GNB used on?

Continuous Features

New cards

NB Normalized Term

P(X)

New cards

NB Prior Class

P(Y)

New cards

NB Likelihood

P(X|Y)

New cards

NB Posterior

P(Y|X)