Discriminant Analysis

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/33

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

34 Terms

New cards

LDA assumptions

normality and same covariance matrix

New cards

QDA assumptions

normal, different variance/covariance matrix

New cards

Gaussian Naive Bayes assumptions

normal, different variance , no covariance

New cards

how to check assumptions graphically

qqplot

side by side boxplot

covariance ellipse

perspective and contour plot

scatter plot

New cards

how to check assumptions with tests

boxm

mvn

kolomogrov- smirnov

shapiro-wilk

correlation test

New cards

LDA bias

could have high bias when covariance matrix is different

New cards

naive bias

reduces variance but could have large bias

New cards

naive bias posterior probability

sum of the functions of the input features . GAM generalized additive model you essentially add together the effects of each variable

New cards

log odds posterior probability LDA

linear in X

New cards

log odds posterior probability QDA

quadratic

New cards

log odds posterior probability naive bias

Generalized additive model

New cards

log odds posterior probability logistic

linear of x

New cards

which two have decision boundaries

LDA and multinomial regression (logistic)

New cards

When all covariance matrices are the same (quadratic term is 0) what is LDA a special case of

QDA

New cards

Special case of naive bias

LDA and multinomial

New cards

which for continuous predictors

LDA and QDA

New cards

categorical predictors

Multinomial Regression, Naive Bayes

New cards

high value for LD1 indicates

most group seperation happens along that axis

New cards

Do you need to specify a k value for Hierarchical clustering

New cards

why called hierarchal clustering ?

clusters obtained by cutting the dendrogram at given height are nested within the clusters obtained by cutting any higher.

New cards

linkage

dissimilarity between clusters with multiple observations

New cards

four type of linkage

average, completed, single and centroid

New cards

preferred linkage

average and completed

New cards

what is hierarchal clustering based on

the distance matrix typically for numerical data

New cards

large number of variables

k-methods

New cards

structure k-methods vs hierarchal

unstructured hierarchal is more interpretable and informative

New cards

easier to determine number of clusters in

hierarchal clustering dendrogram

New cards

distinguishes based on prior beliefs

hierarchal clustering may be used to know the number of clusters

New cards

Specific number of clusters but the group they belong to is unknown

k-methods

New cards

is clustering robust

not robust to perturbations to the data

New cards

complete linkage

looks at distance between points in two clusters and pick the largest one.

New cards

single linkage

looks at the distance between points in two clusters and picks the smallest one.

New cards

average linkage

calculates the average distance between all pairs of points in two cluster

New cards

centroid linkage

compares the central points of two clusters, ignoring how spread out the individual points are