Chapter9 ML

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/21

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

22 Terms

1
New cards

Unsupervised Learning

A type of machine learning that deals with data sets without labeled responses, aiming to find patterns and relationships in the data.

2
New cards

Clustering

The process of grouping similar instances together into clusters based on feature similarity.

3
New cards

Anomaly Detection

The task of identifying instances that deviate significantly from the normal pattern, often considered outliers.

4
New cards

Density Estimation

The estimation of the probability density function that represents the distribution of the dataset.

5
New cards

K-Means Algorithm

A popular unsupervised learning algorithm that partitions data into k clusters by assigning instances to the nearest cluster centroid.

6
New cards

DBSCAN

A density-based clustering algorithm that can identify clusters of arbitrary shapes by finding dense regions separated by low-density regions.

7
New cards

Gaussian Mixture Model (GMM)

A probabilistic model that assumes that all instances are generated from a mixture of several Gaussian distributions with unknown parameters.

8
New cards

Silhouette Score

A metric used to evaluate the quality of a clustering by measuring how similar an instance is to its own cluster compared to other clusters.

9
New cards

Mini-Batch K-Means

A variant of K-Means that processes small random batches of data rather than the entire dataset at once, improving speed and efficiency.

10
New cards

Label Propagation

A technique in semi-supervised learning where the labels from a small number of labeled instances are spread to the unlabeled instances within the same cluster.

11
New cards

Bayesian Gaussian Mixture Model

A variant of GMM that can automatically adjust the number of clusters based on the data, allowing for more flexible modeling of the data.

12
New cards

Expectations-Maximization Algorithm

An iterative method to find maximum likelihood estimates of parameters in statistical models, commonly used in GMM.

13
New cards

Inertia Metric

A measure of how well the clustering algorithm has performed, calculated as the average squared distance between instances and their closest centroid.

14
New cards

Hierarchical Clustering

A type of clustering that builds a hierarchy of clusters, typically represented as a dendrogram.

15
New cards

One-Class SVM

A variant of Support Vector Machines suited for novelty detection, where the model learns to recognize the normal instances and flags others as anomalies.

16
New cards

Active Learning

An iterative process where the model requests labels for instances from a human expert when it is uncertain about them, improving learning efficiency.

17
New cards

Dimensionality Reduction

The process of reducing the number of features in a dataset, which can improve computational efficiency and help in visualization.

18
New cards

Affinity Propagation

A clustering algorithm that uses message passing between data points to identify exemplars and clusters.

19
New cards

Mean-Shift Clustering

A clustering algorithm that iteratively shifts data points towards the mode (highest density point) of data points in their neighborhood.

20
New cards

Gaussian Radial Basis Function

A function used as a kernel in clustering algorithms, particularly measuring the distance between instances and centroids.

21
New cards

Elbow Method

A heuristic used to determine the optimal number of clusters in K-Means clustering by plotting the inertia against the number of clusters.

22
New cards

Cut-off Distance in Clustering

The threshold distance used to determine which instances in a cluster are labeled when performing semi-supervised learning.