Module 1 Introduction to Machine Learning and Prerequisites

0.0(0)

Studied by 0 people

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/29

Earn XP

Description and Tags

Vocabulary flashcards covering key ML concepts, PAC theory, version spaces, hypothesis concepts, and common algorithms from the lecture notes.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

30 Terms

New cards

Machine Learning (ML)

A field where systems learn from past experiences and data to improve performance, as defined by learning from experience (E) with respect to tasks (T) and a performance measure (P).

New cards

Learning Paradigms

Categories of ML such as supervised learning, unsupervised learning, and reinforcement learning used to learn from data or interactions.

New cards

PAC Learning

Probably Approximately Correct learning; a theoretical framework describing how much data is needed for reliable generalization to unseen data.

New cards

ε (epsilon)

The upper bound on the error rate of a hypothesis on unseen data in PAC learning.

New cards

δ (delta)

The probability of failure in achieving the ε-accuracy; confidence is 1 minus δ.

New cards

Version Space

The set of all hypotheses in the hypothesis space that are consistent with the observed training examples.

New cards

Target Concept

The true boolean-valued function to be learned, denoted as c, defined over the instance space.

New cards

Hypothesis

A candidate function h in the hypothesis space that maps instances to labels (0/1) to approximate the target concept.

New cards

Hypothesis Space (H)

The set of all hypotheses the learner may consider, such as conjunctions of literals in some tasks.

New cards

EnjoySport

A toy concept-learning task where the goal is to learn when EnjoySport is true based on attributes like Sky, AirTemp, Humidity, etc.

New cards

Positive Example

An instance x for which the target concept c(x) equals 1 (belongs to the concept).

New cards

Negative Example

An instance x for which the target concept c(x) equals 0 (does not belong to the concept).

New cards

Inductive Learning Hypothesis

The assumption that hypotheses fitting the training data well will also fit unseen data if the training set is sufficiently large.

New cards

Inductive Learning

The process of deriving general rules or hypotheses from specific training examples.

New cards

FIND-S Algorithm

Starts with the most specific hypothesis and only generalizes to cover positive examples; ignores negative examples.

New cards

Most General Hypothesis

The broadest hypothesis, often with “don’t care” values (e.g., with ? placeholders).

New cards

Most Specific Hypothesis

The narrowest hypothesis, typically the fully constrained form (often with φ or fully specified attributes).

New cards

Hypothesis Representation

The way hypotheses are encoded, such as conjunctions of attribute constraints.

New cards

Conjunctions of Literals

A common hypothesis form where multiple attribute constraints are combined using AND.

New cards

Don’t Care (?)

A placeholder value indicating that an attribute can take any value in a constraint.

New cards

ɸ (phi)

The symbol for the empty set of instances, representing no positive coverage.

New cards

Target Function c

The true function mapping each instance to 0 or 1 in a learning task.

New cards

Positive Training Example

A training pair

New cards

Negative Training Example

A training pair

New cards

Training Sample

The set of examples used to train a learning algorithm.

New cards

Validation Sample

The set of examples used to tune the parameters of a learning algorithm.

New cards

Test Sample

The set of examples used to evaluate the performance of a learned model on unseen data.

New cards

Loss Function

A function that measures the difference between predicted labels and true labels.

New cards

Features

The set of attributes (attributes or attributes vectors) associated with each example.

New cards

Labels

The values or categories assigned to examples (the target outputs).