Chapter 1-8: Deep Learning & Imitation Learning - Vocabulary Flashcards

0.0(0)

Studied by 0 people

0.0(0)

View linked note

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/29

Earn XP

Description and Tags

Vocabulary flashcards covering core concepts from the lecture notes on supervised/unsupervised/self-supervised learning, reinforcement learning basics, neural networks, CNNs, imitation learning, and related techniques.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

30 Terms

New cards

Supervised learning

Training with input-output pairs (x, y) to learn a function f that maps inputs to labels.

New cards

Unsupervised learning

Learning from data without labeled outputs; the model discovers structure or representations.

New cards

Self-supervised learning

A form of learning where labels are generated from the data itself (pretext tasks); popular in language models.

New cards

Reinforcement learning

Agent learns by interacting with an environment, taking actions to maximize cumulative rewards.

New cards

Classification

A supervised learning task where inputs are mapped to discrete labels, e.g., iris species.

New cards

Iris dataset

Classic four-feature dataset used for plant species classification: sepal/petal length and width.

New cards

Generalization

Model performance on unseen data beyond the training set.

New cards

IID (independent and identically distributed)

Assumes training and test samples come from the same distribution and are independent.

New cards

Training/testing split

Dividing data into training and testing sets (e.g., 80/20) to assess generalization.

New cards

Gradient descent

Optimization method that updates parameters in the direction of steepest descent of the loss.

New cards

Backpropagation

Algorithm to compute gradients across neural network layers using the chain rule.

New cards

Activation function

Nonlinear function applied to neuron outputs to enable nonlinear modeling.

New cards

ReLU (Rectified Linear Unit)

Activation f(x) = max(0, x); fast to compute and has simple gradients.

New cards

Sigmoid

S-shaped activation mapping inputs to (0, 1); commonly used for probabilities in binary tasks.

New cards

Softmax

Activation that converts logits to a probability distribution over multiple classes.

New cards

Neural network

Computational model with layers of neurons; learns via weights and biases.

New cards

Deep neural network

Neural network with multiple hidden layers; depth provides greater representational power.

New cards

Convolutional neural network (CNN)

Neural network using convolutional layers to capture spatial structure in images.

New cards

Spatial locality

CNNs capture local spatial relationships via small receptive fields.

New cards

Translation invariance / equivariance

Ability to detect patterns regardless of position due to sliding filters.

New cards

Parameter sharing

Same filter parameters used across different image locations, reducing parameters.

New cards

Behavior cloning

Imitation-learning approach that mimics teacher demonstrations via supervised learning.

New cards

Imitation learning

Learning a policy from expert demonstrations (state-action pairs) to imitate behavior.

New cards

Demonstrations

Recorded state-action sequences provided by a teacher or expert.

New cards

Dataset Aggregation (DAG)

Iterative imitation-learning method that interleaves teacher and student actions and aggregates data to reduce covariate shift.

New cards

Multimodality

Situations where multiple actions may be appropriate; requires stochastic policies to model multiple modes.

New cards

Diffusion policy

A method using diffusion models to sample from a multi-modal action distribution.

New cards

Kinesthetic teaching

Physically guiding a robot by hand to demonstrate movements.

New cards

Teleoperation

Remote control of a robot to collect demonstrations via a human operator.

New cards

Feature maps

Intermediate representations produced by CNN layers that reveal detected features (e.g., edges).