Machine Learning, Stats, Math

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/58

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

59 Terms

New cards

What is supervised learning?

Learning a mapping from inputs to outputs using labeled examples.

New cards

What is unsupervised learning?

Discovering structure in unlabeled data (clustering, dimesionality reduction)

New cards

Define overfitting and name two prevention methods.

Model fits noise; prevent with regularization, cross-validation, early stopping, or more data

New cards

Difference between classification and regression?

Classification predicts discrete labels; regression predicts continuous values.

New cards

How does k-NN predict class?

By majority vote among the k nearest neighbors in feature space.

New cards

Why scale features?

To ensure features are comparable, important for kNN, SVM and gradient descent

New cards

What is cross-validation?

Method to estimate model generalization by splitting data into training and validation folds.

New cards

What is the ROC curve?

Plot of True Positive Rate vs False Positive Rate; AUC summarizes discrimination ability.

New cards

What does regularization do?

Adds penalty to model weights to reduce overfitting; L1 (Lasso) and L2 (Ridge)

New cards

Define bias and variance.

Bias = error from incorrect assumptions; Variance = error from model sensitivity to data.

New cards

What is PCA?

Linear dimensionality reduction projecting data onto principal components with max variance.

New cards

What is SMOTE?

Oversampling technique that synthesizes new minority class samples to fix class imbalance.

New cards

What is the softmax function used for?

To convert logits to normalized class probabilities.

New cards

What is dropout?

Randomly setting neurons to zero during training to prevent overfitting.

New cards

What is early stopping?

Halting training when validation error stops decreasing to prevent overfitting.

New cards

What is ensemble learning?

Combining multiple models to improve performance, e.g. bagging or boosting

New cards

What is XGBoost known for?

Regularized gradient boosting, effective and efficient for tabular data

New cards

What is SHAP?

Shapley value-based feature attribution for interpretability.

New cards

What is feature drift vs concept drift?

Feature drift: change in X distribution; Concept drift: change in P(y|X).

New cards

Why calibrate models?

To make predicted probabilities match actual outcome frequencies (important in risk).

New cards

How does Random Forest reduce overfitting?

By averaging many decision trees trained on bootstrapped data.

New cards

Difference between bagging and boosting?

Bagging trains models in parallel to reduce variance; boosting trains sequentially to reduce bias.

New cards

What is t-SNE mainly used for?

Visualizing high-dimensional data in 2D while preserving local structure.

New cards

What is transfer learning?

Using pretrained models on new tasks to save time and data.

New cards

What problem does batch normalization solve?

Stabilizes layer activations, speeds convergence, and allows for higher learning rates

New cards

Name local and global model explanation methods.

Local: LIME; Global: SHAP or feature importance.

New cards

What is adversarial training?

Training with perturbed inputs to improve robustness.

New cards

What is CatBoost notable for?

Handles categorical variables natively without one-hot encoding.

New cards

What is out-of-bag (OOB) error?

Validation metric using unused samples per tree in a Random Forest.

New cards

What is model serialization?

Saving trained models for deployment, e.g. using pickle or joblib

New cards

What is quantile regression?

Estimates conditional quantiles (like median or 95th percentile).

New cards

What is feature hashing?

Hashes categorical features into fixed-length numeric vectors.

New cards

What is cross-validation leakage?

Information from validation leaks into training via preprocessing before splitting.

New cards

What is bootstrapping used for?

Resampling with replacement to estimate uncertainty and confidence intervals.

New cards

When to use precision vs recall?

Precision when false positives are costly; recall when false negatives are costly.

New cards

Difference between calibration and discrimination?

Calibration = probability accuracy; discrimination = ranking quality (AUC).

New cards

What are Bayesian neural networks used for?

Quantifying uncertainty in predictions, useful for risk-sensitive tasks.

New cards

What is knowledge distillation?

Training a smaller model to mimic the outputs of a larger model.

New cards

Define concept drift detection best practice.

Monitor feature distributions and predictive performance; retrain when drift detected.

New cards

What is active learning?

Model selects most informative samples to query for labels, reducing labelling cost

New cards

Formula for sample variance?

s² = (1/(n-1)) sigma (xi - xbar)².

New cards

Bayes theorem?

P(A|B) = [P(B|A) * P(A)] / P(B).

New cards

Central Limit Theorem (CLT)?

Sample mean of i.i.d. variables approximates a Normal distribution as n approaches infinity

New cards

When to use t-test vs z-test?

Use t-test when Ïƒ unknown or n small; z-test when Ïƒ known or n large.

New cards

Use-case for Poisson distribution?

Modeling counts of rare events per time unit (e.g. fraud events)

New cards

Law of total probability?

P(B) = sigmai P(B|Ai)P(Ai) for a partition of Ai.

New cards

Meaning of p-value?

Probability of observing data at least as extreme assuming H0 is true.

New cards

MLE vs Method of Moments?

MLE maximizes likelihood; MoM matches sample and population moments.

New cards

When to use Poisson vs Binomial?

Poisson for rare events; Binomial for fixed number of independent trials.

New cards

Complexity of BFS?

O(V + E).

New cards

Binomial coefficient formula?

C(n, k) = n! / [k!(n-k)!]

New cards

Use-case for Dijkstra?

Finding shortest path with non-negative edge weights.

New cards

What is dynamic programming?

Breaking problems into overlapping subproblems and caching results.

New cards

Master theorem concept?

Compare f(n) vs n^(log_b a) to estimate asymptotic complexity.

New cards

Generating function use?

Solve recurrences and count combinatorial objects.

New cards

What is modular arithmetic used for?

Cryptography and hashing operations.

New cards

Definition of NP-hard?

As hard as hardest problems in NP, not necessarily solvable in polynomial time

New cards

Recurrence relation for Fibonacci?

Fn = Fn-1 + Fn-2, F0 = 0, F1 = 1

New cards

Example of greedy optimal algorithm?

Kruskals algorithm for Minimum Spanning Tree.