Intro to AI - Machine Learning

0.0(0)

Studied by 2 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/398

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

399 Terms

New cards

Machine Learning

Agents building models and systems based on observed data.

New cards

Artificial Intelligence

AI ≠ Machine Learning (ML); Machine Learning is a subset of AI.

New cards

Deep Learning

A part of ML involving artificial neural networks.

New cards

Learning Agent

An agent is learning if it improves performance after making observations about the world.

New cards

Model

A hypothesis about the world and software that can solve problems.

New cards

Supervised Learning

Learn a function from labeled data.

New cards

Unsupervised Learning

Learn patterns from unlabeled data.

New cards

Reinforcement Learning

Learn best actions from experience of rewards and punishments.

New cards

Labeled Data

Input-output pairs where the label is the output.

New cards

Classification

Output: finite set of values called classes or labels (e.g. true/false, sunny/rainy/cloudy).

New cards

Regression

Output: A number (e.g. temperature, which can be an integer or a real number).

New cards

Clustering

Output: Sets of similar data (based on a defined criteria).

New cards

Association Rule Mining

Output: Correlations and associations; Example: which items shoppers tend to purchase together.

New cards

Nearest Neighbors, Decision Trees, Neural Networks, Support Vector Machines, Linear Regression

Supervised learning algorithms

New cards

K Means Clustering, Hierarchical Clustering, Gaussian Mixture Models, Apriori Algorithm (association rule mining)

Unsupervised learning algorithms

New cards

Q-Learning

A reinforcement learning algorithm.

New cards

SARSA

State-Action-Reward-State-Action; a reinforcement learning algorithm.

New cards

Deep Q Network

A reinforcement learning algorithm.

New cards

Exploration

Try other options to get additional information.

New cards

Exploitation

Stay with what has given most reward.

New cards

Restaurant Waiting

Will we wait for a table in a restaurant?

New cards

Labeled Dataset

A dataset where each instance has a corresponding label.

New cards

Instances

Examples in a dataset, which can be rows in a table.

New cards

Features

Attributes or characteristics of an instance, typically represented as columns in a dataset.

New cards

Labels

The output or target variable in a supervised learning task, usually found in the last column of a dataset.

New cards

Decision Tree

A model used in machine learning for classification and regression tasks.

New cards

Attributes

The features that describe instances in a dataset.

New cards

Goal of Classification

To accurately predict the label of instances based on their features.

New cards

Rows in a Dataset

Represent individual instances in a dataset.

New cards

Columns in a Dataset

Represent features or attributes of instances in a dataset.

New cards

Last Column in Dataset

Typically contains the labels in a labeled dataset.

New cards

Example of a Labeled Dataset

A dataset containing instances with both features and corresponding labels.

New cards

Russel & Norvig

Authors of published material referenced in the slides.

New cards

Past Examples

Twelve examples with decisions made, each having ten attributes/features.

New cards

Decision

The outcome of a classification task, either 'Will Wait' or 'Will Not'.

New cards

Function (Model)

A function derived based on a dataset to predict the label of an instance with an unknown label.

New cards

Model Training

The process of training a model using features and labels from a dataset.

New cards

Model Testing

The process of testing a trained model using test features to predict labels.

New cards

Untrained Model

A model that has not yet been trained on any dataset.

New cards

Trained Model

A model that has been trained using a dataset and can make predictions.

New cards

Prediction

The output label predicted by a trained model for an instance.

New cards

Classification Models

Models used to categorize data into predefined classes, including Nearest Neighbors, Decision Trees, Random Forest, Support Vector Machines, and Neural Networks.

New cards

K in K Nearest Neighbors

The number of nearest neighbors to consider when classifying an unlabeled instance.

New cards

Instance as Datapoint

Each instance in a dataset represented as a point in a graph based on its features.

New cards

Label Representation

The label that represents most of the K nearest points in K Nearest Neighbors classification.

New cards

Feature Table

A table where each row represents a person with various features and their corresponding labels.

New cards

Patrons

Individuals represented in the feature table.

New cards

Hungry

A feature indicating whether a patron is hungry or not.

New cards

Type

A feature indicating the type of food a patron prefers.

New cards

Will Wait

A label indicating whether a patron is willing to wait for food.

New cards

Blue Label

Indicates a patron who waited for food.

New cards

Red Label

Indicates a patron who did not wait for food.

New cards

Test

Each test is based on a single feature.

New cards

Predicted Label

Eventually leads to a predicted label.

New cards

Goal of Decision Tree

A tree that most consistently leads to the correct labels (of the dataset).

New cards

Feature Selection

The feature that can best distinguish examples by their labels.

New cards

Branch via new feature

The process of splitting the decision tree based on a new feature.

New cards

Same number of 'Yes' and 'No'

Indicates a bad split in the decision tree.

New cards

Examples

Instances that are used to build the decision tree.

New cards

Dataset

A collection of examples used for training the decision tree.

New cards

Full

Indicates a complete set of data or features.

New cards

Some

Indicates a partial or incomplete set of data or features.

New cards

None

Indicates the absence of data or features.

New cards

Types of Cuisine

Examples include Thai, French, Italian, and Burger.

New cards

Decision Outcomes

The results of the tests leading to labels.

New cards

Feature Distinction

The ability of a feature to separate examples effectively.

New cards

Random Forest

An ensemble method that predicts labels based on multiple decision trees, each from a random sample of the main dataset.

New cards

Overfitting

A problem with Decision Trees where the model fits well with the training dataset but does not perform well with new instances.

New cards

Support Vector Machines (SVM)

A method where instances are treated as datapoints and features as dimensions in a hyperplane, aiming to linearly divide labeled datapoints.

New cards

Support Vectors

Points closest to the boundary in Support Vector Machines.

New cards

Artificial Neural Networks (ANN)

A model inspired by neurons and synapses in the human brain, consisting of layers of neurons connected to each other.

New cards

Input Layer

The layer in an ANN that takes in input signals, such as features.

New cards

Output Layer

The layer in an ANN that provides the output, such as labels.

New cards

Hidden Layers

Layers in an ANN that facilitate computations between the input and output layers.

New cards

Ensemble Method

A technique that combines multiple models to improve prediction accuracy.

New cards

Dimensions

Features in a dataset represented as axes in a hyperplane.

New cards

Training Dataset

The dataset used to train a model.

New cards

New Instances

Data points that the model has not seen during training.

New cards

Popular in the early 2000s

A description of the widespread use of Support Vector Machines during that time.

New cards

Layers of Neurons

The structure of an ANN where neurons are organized in layers.

New cards

Computations

The processes carried out by hidden layers in an ANN to transform inputs into outputs.

New cards

Good in Practice

A phrase describing the effectiveness of Support Vector Machines in real-world applications.

New cards

Random Sample

A subset of data selected randomly from the main dataset for training individual decision trees.

New cards

Goal of Random Forest

To predict labels based on the aggregation of predictions from multiple decision trees.

New cards

Neuron activation

A neuron is activated based on input signals, weights, thresholds, and activation function.

New cards

Back propagation

Uses back propagation to learn weights and thresholds.

New cards

Model Evaluation

To evaluate a classification model, we split our dataset into training set and test set.

New cards

Training set

used to train the model

New cards

Test set

used to evaluate the model

New cards

Accuracy

correct predictions / # total predictions

New cards

Confusion Matrix

shows correct results against predicted results for each class (i.e. possible values of label)

New cards

True Positive (TP)

Correctly predicted positive instances

New cards

False Positive (FP)

Incorrectly predicted positive instances

New cards

True Negative (TN)

Correctly predicted negative instances

New cards

False Negative (FN)

Incorrectly predicted negative instances

New cards

Number of test instances

New cards

Number of correct predictions

New cards

Accuracy example

9/12 = 0.75 or 75%

New cards

Correct predictions

are along the diagonal in the confusion matrix.

100

New cards

Example of Unsupervised Learning

Input: Images of animals; Output: Groups of similar images