Foundations of Machine Learning – Vocabulary (Module 1 Introduction)

0.0(0)
studied byStudied by 1 person
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/42

flashcard set

Earn XP

Description and Tags

Vocabulary flashcards covering key ML concepts, terminology, and evaluation ideas from the lecture notes.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

43 Terms

1
New cards

Machine Learning

Field where computer programs improve performance on tasks with experience from data; learning from E with respect to tasks T and measure P.

2
New cards

Task

The behavior or task being improved (e.g., classification, acting in an environment).

3
New cards

Data

Experiences used to improve performance in the task; training examples.

4
New cards

Performance Measure

Criterion to evaluate task performance (e.g., accuracy, error rate).

5
New cards

Example (x,y)

An instance x with label y; used to learn mappings from inputs to outputs.

6
New cards

Hypothesis Space

Set of all hypotheses (functions) that a learning algorithm may output.

7
New cards

Hypothesis

A function that approximates the target function mapping inputs to outputs.

8
New cards

Target Function

The true function f that maps each input x to its correct output y.

9
New cards

Training Data

Collection of observed examples used to learn a hypothesis.

10
New cards

Instance Space

Set of all possible objects describable by features X.

11
New cards

Concept

Subset of X representing objects with a property; unknown.

12
New cards

Feature

A measurable property describing an aspect of an instance.

13
New cards

Feature Vector

An n-dimensional vector of features representing an object.

14
New cards

Decision Tree

Tree-structured model that splits data on feature values to predict outputs.

15
New cards

Linear Function

Model where output is a weighted sum of inputs (linear relationship).

16
New cards

Perceptron

Single-layer neural unit computing a weighted sum with an activation function.

17
New cards

Multi-Layer Neural Network

Network with multiple hidden layers enabling complex representations.

18
New cards

Supervised Learning

Learning from labeled data (X,y) to predict labels for new inputs.

19
New cards

Unsupervised Learning

Learning from unlabeled data to cluster or summarize data.

20
New cards

Semi-Supervised Learning

Learning from a mix of labeled and unlabeled data.

21
New cards

Reinforcement Learning

Learning to act in an environment to maximize cumulative reward; agent, environment, state, action, reward.

22
New cards

Classification

Predicting a discrete label.

23
New cards

Regression

Predicting a continuous value.

24
New cards

Confusion Matrix

Table of true vs. predicted classes used to compute accuracy, precision, recall.

25
New cards

Accuracy

Proportion of correct predictions: (TP+TN)/(P+N).

26
New cards

Precision

TP/(TP+FP) – correctness of positive predictions.

27
New cards

Recall (TP rate)

TP/P – proportion of actual positives correctly predicted.

28
New cards

Bias

Systematic error from model assumptions; can cause underfitting.

29
New cards

Variance

Variability of model predictions due to different training data.

30
New cards

Generalization

Ability to perform well on unseen data; low generalization error.

31
New cards

Overfitting

Model fits noise in training data; high variance, poor generalization.

32
New cards

Underfitting

Model too simple; high bias, poor training and test performance.

33
New cards

Evaluation

Process of assessing algorithm performance using metrics and validation.

34
New cards

Cross-Validation

Estimating generalization by partitioning data into training/validation sets; common as k-fold.

35
New cards

Training Set

Data used to train the model.

36
New cards

Test Set

Independent data used to evaluate model performance.

37
New cards

Occam's Razor

Among consistent hypotheses, the simplest is preferred.

38
New cards

Minimum Description Length (MDL)

Inductive bias favoring shorter descriptions of hypotheses.

39
New cards

Maximum Margin

SVM principle: maximize the distance between class boundaries.

40
New cards

Inductive Bias

Assumptions guiding the learning process to prefer certain hypotheses.

41
New cards

Hypothesis Language

Formalism used to express hypotheses, influencing bias.

42
New cards

Inductive Learning

Inferring a general function from training examples.

43
New cards

Experimentation Error

Error due to finite sample size and bias-variance trade-offs.