AI Agents, Planning, Logic, and Search - Lecture Notes

0.0(0)

Studied by 0 people

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/113

Earn XP

Description and Tags

Vocabulary flashcards covering AI agents, planning, search, logic, and planning formalisms from the notes.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

114 Terms

New cards

Standard Model (AI)

The idea that AI research aims to build agents that act rationally to achieve the given objective, limitations of computation to find optimal solution and value alignment.

New cards

Limited rationality

Optimal solution may be computationally intractable in complex environments, so agents settle for good enough options.

New cards

Value Alignment Problem

Challenge of aligning an agent's objective with human values so it acts in humans best interests.

New cards

Agent

Entity that perceives the environment via sensors and acts through actuators, capable of autonomy over time.

New cards

Rational/Intelligent Agent

Agent whose actions aim to maximize expected performance given evidence and knowledge.

New cards

Agent function

Abstract mapping from percepts to actions that supports the agent objective.

New cards

Agent program

Concrete implementation of the agent function in software.

New cards

Simple reflex agent

Agent that acts only on the current perception without history.

New cards

Model based reflex agent

Agent that uses internal state plus a transition model and sensor model to act.

New cards

Internal state

Perception history kept to infer the current state of the environment.

New cards

Transition model

Model of how actions change the world and how the world evolves independently.

New cards

Sensor model

Model of how sensors interpret the state of the world.

New cards

Goal based agent

Agent that reasons about future outcomes with respect to a goal.

New cards

Utility based agent

An agent that chooses actions to maximise a performance metric describing how desirable a state is.

New cards

Utility function

Internal performance measure assigning a value to states to reflect desirability.

New cards

Learning Agent

Agent that improves its performance over time by learning from experiences.

New cards

Learning element

Component responsible for making improvements based on experience.

New cards

Performance Measure

Numeric measure of how desirable the outcome of actions is.

New cards

Performance element

The part of an agent that decides which actions to take based on its perceptions, in order to achieve its goals or objectives.

New cards

Critic

Module that judges the learning element against a fixed performance standard.

New cards

Problem generator

Module that suggests exploratory actions to gain informative experiences.

New cards

Problem solving agent

Agent that plans ahead by considering a sequence of actions to reach a goal.

New cards

Planning agents

Agents that use structured representations to plan actions.

New cards

Agent architecture

The computing device on which the agent program runs.

New cards

Autonomy

Ability to act based on perception and learning rather than only prior knowledge.

New cards

PEAS framework

Performance, Environment, Actuators, Sensors used to define a task environment.

New cards

Observable environment

Environment fully observable by the agent senses.

New cards

Partially observable environment

Agent cannot observe the entire environment at once.

New cards

Multiagent

Environments with multiple agents with competitive or cooperative goals.

New cards

Deterministic

Next state is completely predictable base on current state and actions.

New cards

Sequential

Decisions affect future states (opposite of Episodic).

New cards

Static

Environment does not change while making a decision (opposite of dynamic)

New cards

Semi dynamic

Environment unchanged but performance score can change (chess clock decreases)

New cards

Discrete

Finite states (opposite of continuous)

New cards

Known

Action outcomes are known to the agent.

New cards

Informed search

Search strategies that use domain knowledge to guide exploration.

New cards

Problem formulation

Defining states and actions needed to solve the goal.

New cards

State space

Set of all possible states the environment can be in.

New cards

State

Representation of the environment at a point in time.

New cards

Initial state

The starting point of the search.

New cards

Transition model

Specification of how actions change states.

New cards

Goal test

Criterion to determine if a state satisfies the goal.

New cards

Goal state

A state that satisfies the goal condition.

New cards

Active cost function

Cost of applying an action used to evaluate paths.

New cards

Touring problems

Problems where a set of locations must be visited rather than a single goal.

New cards

Search tree

Structure representing possible action sequences from initial state to goal.

New cards

Search Node

Node containing state, parent, action, path cost, and depth.

New cards

Frontier

Set of nodes that can be expanded in the next step.

New cards

Queue types

Types include priority queue, FIFO queue, and LIFO queue.

New cards

Reached

Nodes that have been generated in the search.

New cards

Best-first search

Expands the frontier node with the best evaluation function value.

New cards

Graph search

Must avoids revisiting states

New cards

Uninformed search

blind search strategies without domain knowledge.

New cards

Breadth-first search

Expands the shallowest nodes first using a FIFO frontier.

New cards

Complete

The algorithm finds a solution when one exists

New cards

Time/space complexity

Measures of the resources required by a search algorithm.

New cards

Uniform-cost search / Dijkstra’s

Best first by path cost; complete and optimal with nonnegative costs.

New cards

Depth-first search

Expands deep paths using a stack; not cost-optimal and not complete in infinite spaces.

New cards

Bidirectional search

Search from both start and goal, aiming to meet in the middle.

New cards

Heuristic

Estimated cost to reach a goal used to guide informed search.

New cards

Greedy Best-First Search

Expands node with lowest heuristic value; not guaranteed optimal.

New cards

A* search

Uses f(n) = g(n) + h(n); can be optimal if h is admissible and consistent.

New cards

Admissible heuristic

Never overestimates the true cost to reach the goal.

New cards

Consistent heuristic

h(n) is no greater than the cost to move to a neighbor plus its h value.

New cards

Weighted A*

A* with h weighted to favor faster solution at the cost of optimality.

New cards

Effective branching factor (b*)

Average number of branches per node that yields the same node count as a perfectly balanced tree.

New cards

Relaxed problem

Easier version of a problem used to derive an admissible heuristic.

New cards

Adversarial search

Search in competitive environments where opponents have conflicting goals.

New cards

Deterministic games

Games with perfect information and zero-sum payoffs.

New cards

MAX and MIN

Opponents in adversarial search where MAX seeks maximum and MIN seeks minimum.

New cards

Minimax

Recursive strategy that backs up values from leaves to decide the root move.

New cards

Alpha-beta pruning

Prunes branches that cannot yield better outcomes than already explored paths.

New cards

Transposition table

Cache that stores evaluated game states to avoid re-searching identical positions.

New cards

Type A vs Type B strategies

Type A searches wide but shallow; Type B searches deep but narrow.

New cards

Heuristic evaluation function EVAL(s,p)

Estimates the expected utility of a position for a player.

New cards

Expectimax search

Models average case by introducing EXPECT nodes instead of MIN nodes.

New cards

Expectiminimax

Extends expectimax to stochastic two player games with MIN, MAX and EXPECT nodes.

New cards

Monte Carlo tree search

Estimates state value by averaging results of many simulated playouts.

New cards

Playout policy

Policy used to simulate moves in Monte Carlo search.

New cards

Selection policy

Strategy to focus computation on important parts of the game tree.

New cards

Knowledge based agents

Agents that use reasoning over a knowledge base to make decisions.

New cards

Knowledge Base (KB)

Set of sentences in a knowledge representation language.

New cards

Knowledge Representation Language

Formal language to encode knowledge, such as propositional or first order logic.

New cards

Axiom

Sentence accepted as true without proof within the KB.

New cards

TELL / ASK operations

TELL adds sentences to the KB; ASK queries what the KB knows.

New cards

Syntax and Semantics

Syntax is sentence structure; semantics is truth conditions for sentences.

New cards

Inference

Deriving new sentences from existing ones in the KB.

New cards

Declarative vs Procedural

Declarative tells what is true; procedural specifies how to act.

New cards

Wumpus World

A grid world with Wumpus, pits, gold; sensor cues and actions shape a planning problem.

New cards

Propositional Logic

Logic without variables where sentences are built from propositional symbols.

New cards

Model, satisfiability, entailment

Models assign truth values; satisfiable means some model makes a formula true; entailment means one sentence follows from another.

New cards

Entailment α |= β

Sentence α semantically entails β if every model of α is also a model of β.

New cards

Grounding

Link between logical reasoning and real world perception and action.

New cards

Propositional atoms, literals, clauses

Atoms are basic symbols; literals are atoms or their negation; clauses are disjunctions of literals.

New cards

CNF (Conjunctive Normal Form)

Formula expressed as a conjunction of disjunctions of literals.

New cards

Resolution

Inference rule that derives new clauses by eliminating a complementary literal pair.

New cards

Definite clause

Disjunction with exactly one positive literal used in forward chaining.

New cards

Horn clause

Disjunction with at most one positive literal; supports efficient forward chaining.

New cards

Forward chaining

Data-driven inference that adds conclusions as rules fire, linear time in KB size.

100

New cards

Backward chaining

Goal-driven inference that works back from a query to find supporting facts.