ML Exam 2 : Module 15 - Cluster Validation

0.0(0)
Studied by 1 person
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/12

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 5:30 AM on 4/1/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

13 Terms

1
New cards

Clustering Tendency

Before running a clustering algorithm, evaluate whether a data set has a cluster-like structure, using statistical tests for spatial randomness for data in Euclidean space ( low dim )

2
New cards

Hopkins Statistic

A method used to measure clustering tendency by checking to see if the data is more clustered than random. M points are randomly generated in the data space, then m real points are chosen from the data. Them for each random/real point, find the distance to the nearest respective data point.

3
New cards

Similarity Matrix

A matrix used to cluster data and sort points according to their respective cluster assignments.

4
New cards

Internal Measures

A way to evaluate how effective a clustering is without using any external labels (unsupervised validation); the quality of the data is solely evaluated based on the data and the cluster assignments by checking how compact the clusters are and how separated individual clusters are from one another.

5
New cards

Cluster Cohesion

How closely related are objects within a cluster (WSS).

6
New cards

Cluster Separation

How distinct a cluster is from other clusters (BSS).

7
New cards

Total Sum of Squares

A measurement that evaluates the overall spread of data around the global centroid by summing WSS and BSS.

8
New cards

Silohouette Coefficient

An internal measure that combines cohesion and separation metrics to evaluate how well each point fits within its assigned cluster.

9
New cards

Density Based Cluster Validation (DBCV)

A cluster validity index used for DBSCAN; the sparsetest part inside a cluster should always be denser than the densest region between clusters. A high DBCV value indicates that clusters are properly separated by low-density regions.

10
New cards

Cophenetic Distance

The proximity at which the agglomerative clustering put them in the same cluster

11
New cards

Cophenetic Correlation Coefficient (CPCC)

Correlation between cophenetic distance matrix and the proximity matrix of the original data points.

12
New cards

External Validation

Methods to evaluate clustering when the class labels are available by comparing cluster assignments to the true class labels. Impurity, precision, recall, and F-measure are used as classification measures.

13
New cards

Relative Cluster Validation

Comparing clustering results collectively by using a validity measure to compare two or more clustering solutions to decide which is better. Examples include comparing different clustering algos, choosing the best number of clusters, comparing two specific clusters, and evaluating individual points.

Explore top notes

note
Ap Human Georgaphy
Updated 1064d ago
0.0(0)
note
Summary: Arctic and Antartic
Updated 1225d ago
0.0(0)
note
Chp 15: Delivery
Updated 1183d ago
0.0(0)
note
Unit 4 - Chapter 16
Updated 916d ago
0.0(0)
note
Microbiomes
Updated 1336d ago
0.0(0)
note
IB PHYSICS Option D: Astrophysics
Updated 598d ago
0.0(0)
note
Ap Human Georgaphy
Updated 1064d ago
0.0(0)
note
Summary: Arctic and Antartic
Updated 1225d ago
0.0(0)
note
Chp 15: Delivery
Updated 1183d ago
0.0(0)
note
Unit 4 - Chapter 16
Updated 916d ago
0.0(0)
note
Microbiomes
Updated 1336d ago
0.0(0)
note
IB PHYSICS Option D: Astrophysics
Updated 598d ago
0.0(0)

Explore top flashcards

flashcards
HP - Muscle groups
28
Updated 782d ago
0.0(0)
flashcards
Cells and Cell Functions
32
Updated 1298d ago
0.0(0)
flashcards
Circulatory System
37
Updated 1059d ago
0.0(0)
flashcards
Geography 2
91
Updated 386d ago
0.0(0)
flashcards
EM E2: Infectious Disease
87
Updated 342d ago
0.0(0)
flashcards
Geo5 Final
132
Updated 1219d ago
0.0(0)
flashcards
HP - Muscle groups
28
Updated 782d ago
0.0(0)
flashcards
Cells and Cell Functions
32
Updated 1298d ago
0.0(0)
flashcards
Circulatory System
37
Updated 1059d ago
0.0(0)
flashcards
Geography 2
91
Updated 386d ago
0.0(0)
flashcards
EM E2: Infectious Disease
87
Updated 342d ago
0.0(0)
flashcards
Geo5 Final
132
Updated 1219d ago
0.0(0)