CS 441: Applied Machine Learning - Final Review

0.0(0)

Studied by 0 people

Call with Kai

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/237

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

238 Terms

New cards

Lecture 2-3: K-NN, Classification, Regression, and Data

True or false: With different sets of M test samples, we would probably get the same error measurement.

False. There would be some variance in the error measurement.

New cards

Lecture 2-3: K-NN, Classification, Regression, and Data

True or false: If we increase M, we should get a more accurate (lower variance) estimate of the error.

True. Increasing M decreases the variance of the estimate.

New cards

Lecture 2-3: K-NN, Classification, Regression, and Data

True or false: If we increase N (training size) but do not change M, we'd expect the test error to be unchanged.

False. Test error should generally go down because more training samples help better fit the model.

New cards

Lecture 2-3: K-NN, Classification, Regression, and Data

True or false: The expected error does not depend on M, but it does depend on N.

True.

New cards

Lecture 2-3: K-NN, Classification, Regression, and Data

Which assumptions are implied by using Euclidean (L2) distance for K-NN?

(a, b) Each feature dimension is equally important, and feature dimensions have comparable scales.

New cards

Lecture 2-3: K-NN, Classification, Regression, and Data

Classify the '+' with 1-NN. ('o' or 'x'?)

'x'

New cards

Lecture 2-3: K-NN, Classification, Regression, and Data

Classify the '+' with 3-NN. ('o' or 'x'?)

'o'

New cards

Lecture 2-3: K-NN, Classification, Regression, and Data

Which of these are true of nearest neighbor? (choose all that apply)

Options: Fast inference, Fast training, Can be applied if only one sample per class is available, Not commonly used in practice, Most powerful with feature learning

(b, c, e) Fast training, Can be applied if only one sample per class, Most powerful with feature learning.

New cards

Lecture 4: Clustering and Retrieval

True or false: K-means assigns each point to the nearest of the established K centers.

True.

New cards

Lecture 4: Clustering and Retrieval

True or false: A very structured distribution of points can make K-means not converge.

False. K-means always converges.

New cards

Lecture 4: Clustering and Retrieval

True or false: High-dimensional data points cause K-means to iterate more times before a good clustering.

False. Number of iterations depends on #clusters and #points, not directly on dimension.

New cards

Lecture 4: Clustering and Retrieval

True or false: High-dimensional data increases computational cost and people often stop K-means early.

True.

New cards

Lecture 4: Clustering and Retrieval

True or false: K-means is deterministic but sensitive to initialization.

True.

New cards

Lecture 4: Clustering and Retrieval

True or false: If we don't know much about the data, people often choose K based on memory or computational limits.

True.

New cards

Lecture 4: Clustering and Retrieval

True or false: Clustering methods like K-means and hierarchical K-means are sensitive to local connectivity.

False. They are not sensitive to local connectivity in the way suggested.

New cards

Lecture 4: Clustering and Retrieval

True or false: If some attributes are more important, standard K-means can still yield a good clustering without adjustments.

False. K-means treats all features equally unless we use weighting.

New cards

Lecture 4: Clustering and Retrieval

True or false: One big advantage of hierarchical K-means is computational efficiency.

True.

New cards

Lecture 4: Clustering and Retrieval

True or false: Agglomerative clustering can be sensitive to local connectivity with a good choice of linkage.

True.

New cards

Lecture 4: Clustering and Retrieval

True or false: LSH idea is used where an approximate nearest neighbor is acceptable.

True.

New cards

Lecture 4: Clustering and Retrieval

If you have continuous-valued feature vectors and want to group them, how do clustering methods help?

Clustering assigns cluster-IDs based on similarity, making it easier to group continuous vectors by similar attributes.

New cards

Lecture 4: Clustering and Retrieval

For a group of pictures, what attributes could produce a good clustering?

Mixed attributes (discrete like "contains humans?" or "landscape type" and continuous like brightness or texture density) can be used.

New cards

Lecture 4: Clustering and Retrieval

If you used two clustering algorithms on unlabeled data, how do you compare results?

Label a subset and compute purity. Compare which clustering yields higher purity.

New cards

Lecture 4: Clustering and Retrieval

Which distance measure for "Each dimension same scale, dominated by large differences"? (L2, L1, Mahalanobis)

New cards

Lecture 4: Clustering and Retrieval

Which distance measure for "Each dimension same scale, sensitive to sum of absolute differences"?

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

True or false: In dimensionality reduction, points in lower dimension should preserve some relationship from original dimension.

True.

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

True or false: PCA eigenvectors can be imaginary, making PCA useless.

False. Eigenvectors of a real symmetric covariance matrix are real.

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

True or false: PCA eigenvectors capture discriminative features.

False. PCA captures directions of maximum variance, not necessarily discriminative features.

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

True or false: PCA components may have qualitative significance (e.g., eigenfaces).

True.

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

True or false: The largest PCA components are always most important.

False. Depends on what is considered "important" for the application.

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

True or false: Non-linear embedding methods focus on relationships even if reconstruction is impossible.

True.

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

True or false: MDS preserves pairwise distances with a user-defined metric.

True.

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

True or false: MDS always works even if no proper distance metric is defined.

False. If pairwise relationships don't satisfy metric properties, non-metric MDS is needed.

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

True or false: ISOMAP defines a unique graph structure.

False. The graph construction depends on user choices.

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

True or false: t-SNE minimizes KL divergence to preserve local structure.

True.

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

True or false: UMAP is computationally less expensive and widely used.

True.

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

How to choose the number of components in PCA?

Consider cumulative explained variance and choose K where adding more components adds little variance.

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

Why use PCA before MDS in high dimension? (2 reasons)

(1) Introduce heterogeneity in distances to help MDS find structure, (2) Reduce computational cost.

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

Why does MDS have an S-shape similar to the original data's shape?

MDS preserves global structure, thus retaining the original S-shape.

New cards

Lecture 5: Dimensionality Reduction: PCA and Low-D Embeddings

Why might t-SNE not preserve an S-shape?

t-SNE focuses on local structure, not global shape.

New cards