Chapter 1-10: Machine Learning — Key Vocabulary for Lecture Review

0.0(0)

Studied by 0 people

View linked note

Call with Kai

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/43

Earn XP

Description and Tags

These flashcards cover the core vocabulary and concepts introduced across Chapters 1–10 of the lecture notes, including AI/ML/DL distinctions, data concepts, Python/Pandas/NumPy essentials, and ML modeling ideas. Use them to reinforce key terms and their definitions ahead of the exam.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

44 Terms

New cards

Artificial Intelligence (AI)

A broad field of computer science focused on creating systems that exhibit intelligent behavior; includes rule-based systems, machine learning, and deep learning.

New cards

Machine Learning (ML)

A subset of AI that learns patterns from data to make predictions or decisions, using algorithms like linear regression, SVMs, and decision trees.

New cards

Deep Learning (DL)

A subset of ML based on artificial neural networks with many layers that learn hierarchical representations from data.

New cards

Generative AI

A form of AI focused on generating new data (e.g., text, images); often used colloquially to refer to models like GPT-based systems.

New cards

Turing Test

A test proposed by Alan Turing to determine if a machine can exhibit indistinguishable behavior from a human in conversation.

New cards

Dartmouth Conference (1956)

The conference that helped establish AI as a formal field of study.

New cards

Perceptron (1957)

One of the earliest neural network models; a fundamental building block for later neural networks.

New cards

Neural Network

A computing system inspired by the brain, consisting of interconnected units (neurons) that learn from data.

New cards

Ground Truth

The actual, correct label or outcome used to supervise or evaluate a model’s predictions.

New cards

Feature

A measurable property or input clue used by a model to make predictions.

New cards

Label

The target value or category that a model tries to predict (the ground truth).

New cards

Weights

Parameters that quantify the importance of each feature in predicting the output; higher weight means greater influence.

New cards

Bias (Intercept)

An offset term in a linear model that allows shifting the decision boundary; enables better fitting beyond passing through the origin.

New cards

Loss Function

A function that quantifies prediction error; used to guide the optimization process to improve model accuracy.

New cards

Gradient Descent

An optimization method that updates weights to minimize loss, using iterative steps toward smaller error (often described with a hot/cold analogy in class).

New cards

Garbage In, Garbage Out (GIGO)

The principle that poor quality input data or labels lead to poor model performance.

New cards

Overfitting / Overtraining

When a model learns noise and specifics of the training data too well, resulting in poor generalization to new data.

New cards

Python

The programming language used in the course for implementing ML pipelines.

New cards

Pandas

A Python library for data manipulation and analysis, centered on DataFrame and Series structures.

New cards

DataFrame

A Pandas data structure for storing tabular data with labeled axes (rows and columns).

New cards

DataFrame.head()

A Pandas function that displays the first few rows of a DataFrame (default 5).

New cards

DataFrame.info()

A Pandas function that shows data types, non-null counts, and basic info about a DataFrame.

New cards

DataFrame.describe()

A Pandas function that provides summary statistics (mean, std, min, max, percentiles) for numeric columns.

New cards

Display (Pandas)

A Pandas function that renders DataFrames in a readable, interactive format (especially in Jupyter).

New cards

read_csv

A Pandas function to read data from a CSV (Comma-Separated Values) file into a DataFrame.

New cards

read_excel

A Pandas function to read data from an Excel file (XLSX) into a DataFrame.

New cards

NumPy

A Python library for numerical computing, providing multi-dimensional arrays and operations on them.

New cards

Array

A core data structure in NumPy representing a grid of values (1D, 2D, etc.).

New cards

Matrix Multiplication

The operation of multiplying two matrices when their dimensions are compatible; in NumPy via dot or @ operator.

New cards

Zeros / Ones / Identity (NumPy)

Functions to create arrays of zeros, ones, or an identity matrix used for initializing computations.

New cards

Shape

The dimensions of a NumPy array (rows, columns); accessed via the shape attribute.

New cards

Indexing / Slicing

Accessing elements of an array or DataFrame by position (index) or by label; negative indices access from the end.

New cards

Random Seed

A value used to initialize a pseudo-random number generator to ensure reproducible results.

New cards

Jupyter Notebook / JupyterLab

Interactive environments for writing and executing Python, especially useful for data analysis and visualization.

New cards

Conda / Virtual Environments

Tools to create isolated Python environments with specific package versions to avoid conflicts.

New cards

Matplotlib Visualizations

A plotting library (often used with Pandas) for creating graphs like scatter plots, line plots, histograms, and more.

New cards

GroupBy / Apply (Pandas)

DataFrame operations for aggregating data by groups and applying custom functions to groups.

New cards

Supervised Learning

A type of machine learning where the model learns from a dataset with labeled examples (features and corresponding ground truth labels) to make predictions on new, unlabeled data.

New cards

Unsupervised Learning

A type of machine learning that works with unlabeled data to find hidden patterns, structures, or relationships within the data, such as clustering or dimensionality reduction.

New cards

Activation Function

A function that determines the output of a neuron in a neural network; it introduces non-linearity, allowing the network to learn complex patterns.

New cards

Backpropagation

An algorithm used to train neural networks by iteratively adjusting the weights of the network based on the gradient of the loss function with respect to those weights.

New cards

Training Set

The portion of the dataset used to train the machine learning model, where the model learns patterns and relationships.

New cards

Validation Set

A portion of the dataset used to tune hyperparameters and evaluate the model's performance during training, helping to prevent overfitting.

New cards

Test Set

A completely independent portion of the dataset used to evaluate the final performance of a trained model on unseen data, assessing its generalization ability.