Reinforcement Learning Vocabulary Flashcards

0.0(0)

Studied by 0 people

View linked note

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/19

Earn XP

Description and Tags

20 vocabulary flashcards focused on reinforcement learning, MDPs, Q-learning, and related concepts from the lecture notes.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

20 Terms

New cards

Machine Learning

A subset of artificial intelligence that allows machines to learn automatically and improve from experience without explicit programming.

New cards

Supervised Learning

A category of machine learning where models are trained using labeled data.

New cards

Unsupervised Learning

A category of machine learning where models infer patterns from unlabeled data.

New cards

Reinforcement Learning

A type of machine learning where an agent learns to behave in an environment by taking actions and observing results.

New cards

Agent

The RL component that learns from trial and error.

New cards

Environment

The world through which the agent moves and interacts.

New cards

Action

Any permissible move the agent can take in a given state.

New cards

State

The current condition or situation returned by the environment.

New cards

Reward

The instantaneous feedback from the environment evaluating the last action.

New cards

Policy

The strategy the agent uses to decide the next action based on the state.

New cards

Value

The expected long-term return with discount applied, contrasting with immediate reward.

New cards

Action-value (Q)

A value function that also accounts for the current action, Q(s,a).

New cards

Markov Decision Process (MDP)

The mathematical framework for modeling decision making in RL with states, actions, and rewards.

New cards

Graph

A network of nodes connected by edges used to model relationships, such as rooms and doors.

New cards

Node

A state in a graph, e.g., a room.

New cards

Edge

A connection between two nodes, e.g., a door linking rooms.

New cards

Door

A two-way link between rooms that enables movement.

New cards

Instant Reward

The reward value attached to a single transition (arrow) between states.

New cards

Q-Learning

A reinforcement learning algorithm that learns Q-values for state-action pairs from experience.

New cards

Gamma (Γ)

The discount factor in Q-learning (0 to 1) that weighs future rewards versus immediate rewards.