Clustering Methods

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/14

flashcard set

Earn XP

Description and Tags

Flashcards covering the key terms and concepts related to clustering methods discussed in the lecture.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

15 Terms

1
New cards

Dendrogram

A tree diagram used to express relationships often in clustering.

2
New cards

Clustering

A method of finding natural groupings to see how things relate to each other. Samples within the group are more similar to each other than samples from different groups.

3
New cards

Supervised Method

Discriminant function analysis where groups are predefined.

4
New cards

Unsupervised Method

Clustering analysis where the data finds the groups.

5
New cards

Agglomerative

A clustering approach that builds up by adding sites or organisms to form a tree.

6
New cards

Divisive

A clustering approach that starts with one big group and chops it down to form groups.

7
New cards

Hierarchical Clustering

A method that proceeds step-by-step, where once an item is in a group, it stays in that group.

8
New cards

Non-Hierarchical Clustering

A method that allows items to switch groups during the process based on an iterative measure.

9
New cards

UPGMA

Unweighted paired groups method using arithmetic averages; a commonly used method in hierarchical agglomerative cluster analysis.

10
New cards

Single Linkage

Defines clusters by the smallest dissimilarity; can produce elongated dendrograms and cluster chains.

11
New cards

Complete Linkage

Defines clusters using the largest dissimilarities; sensitive to outliers.

12
New cards

Proportional Averaging

A method used in unweighted pairing to calculate average distances between clusters, dividing by the number of items in the clusters being compared.

13
New cards

K-means Clustering

A non-hierarchical method where K represents the number of clusters defined beforehand.

14
New cards

Elbow Plot

A method used to determine the optimum number of clusters by plotting the weighted sum of squares and looking for the 'elbow' in the plot.

15
New cards

Kalinsky-Harabasz Criteria

A method using the ratio of between-cluster variance to within-cluster variance to determine the number of clusters.