1/76
Final Exam KSU
Name | Mastery | Learn | Test | Matching | Spaced |
|---|
No study sessions yet.
How do knowledge-based agents differ from the problem-solving agents discussed in earlier chapters?
They reason over an internal knowledge base to decide actions, rather than having limited, inflexible knowledge of actions and outcomes.
What is the central component of a knowledge-based agent?
Its knowledge base, or KB, which is a set of sentences representing assertions about the world.
In the context of a knowledge base, what does the 'TELL' operation do?
It adds new knowledge (sentences) to the knowledge base.
What is the purpose of the 'ASK' operation in a knowledge-based agent?
It queries the knowledge base to determine what is known, often to decide on an action.
The process of deriving new knowledge from existing knowledge in a KB is called _.
inference
What are the three main steps a knowledge-based agent program performs each time it is called?
What is the 'declarative approach' to building a knowledge-based agent?
It involves starting with an empty knowledge base and TELLing it sentences one by one until it knows how to operate.
In the Wumpus World, what perception is associated with squares adjacent to a pit?
The agent perceives a Breeze.
What perception indicates that gold is in the same square as the agent in Wumpus World?
The agent perceives a Glitter.
What are the properties of the Wumpus World environment in terms of observability, determinism, and static/dynamic nature?
It is partially observable, deterministic, sequential (not episodic), static, discrete, and single-agent.
In logic, what is the 'semantics' of a language?
It defines the 'meaning' of sentences by specifying their truth with respect to each possible world, or model.
What does it mean for a sentence $α$ to logically entail a sentence $β$ (written as $α |= β$)?
It means that in every model in which $α$ is true, $β$ is also true.
An inference algorithm that derives only entailed sentences is called _.
sound or truth-preserving
What property does a 'complete' inference algorithm have?
It can derive any sentence that is logically entailed by the knowledge base.
In propositional logic, what is an 'atomic sentence'?
It consists of a single proposition symbol that can be either true or false.
List the five common logical connectives used in propositional logic.
Negation ($¬$), conjunction ($∧$), disjunction ($∨$), implication ($⇒$), and biconditional ($⇔$).
In propositional logic, if P is 'Today is Wednesday', what does $P ⇒ Q$ mean when P is false?
The implication is true, regardless of the truth value of Q.
What is the primary limitation of propositional logic that first-order logic addresses?
Propositional logic has limited expressive power and cannot generalize facts, such as 'pits cause breezes in adjacent squares', without creating a sentence for each square.
What are the two standard quantifiers used in first-order logic?
The universal quantifier ($∀$, 'for all') and the existential quantifier ($∃$, 'for some').
In first-order logic, what is the purpose of a Predicate?
A Predicate stands for a relation or property of objects, such as Student(x).
How can the entailment $α |= β$ be tested using model checking?
By testing the unsatisfiability of the sentence $α ∧ ¬β$.
What is the core idea behind theorem proving as an inference method?
It applies rules of inference directly to sentences in the KB to construct a proof without consulting models.
What is the rule of inference known as Modus Ponens?
If the knowledge base contains $α$ and $α ⇒ β$, then $β$ can be inferred.
What is the And-Elimination rule of inference?
If the knowledge base contains $α ∧ β$, then both $α$ and $β$ can be inferred individually.
Describe the process of forward chaining for inference.
It starts with known facts and adds conclusions of implications to the KB whenever all premises are known, continuing until the query is derived.
Forward chaining is an example of what general concept of reasoning?
Data-driven reasoning.
Describe the process of backward chaining for inference.
It works backward from the query, finding implications that conclude the query and then trying to prove their premises.
Backward chaining is a form of _-directed reasoning.
goal
The Davis–Putnam algorithm (DPLL) is an efficient backtracking search for model checking that embodies what three improvements?
Early termination, the pure symbol heuristic, and the unit clause heuristic.
In the DPLL algorithm, what is a 'pure symbol'?
A symbol that appears with only one sign (either always positive or always negative) in all clauses of a sentence.
What is the definition of a 'unit clause' in the context of the DPLL algorithm?
A clause with just one literal, or a disjunction with one literal and many falses, which forces a specific value assignment.
What is the task of 'classical planning'?
Finding a sequence of actions to accomplish a goal in a discrete, deterministic, static, and fully observable environment.
What is PDDL?
Planning Domain Definition Language, a standard language used to describe automated planning problems.
In PDDL, how is a state represented?
As a conjunction of ground atomic fluents.
In PDDL, what is an 'action schema' composed of?
The action name, a list of variables, a precondition, and an effect.
What two components are added to a PDDL domain definition to specify a particular problem?
An initial state and a goal.
In the context of planning algorithms, what is 'forward search'?
A search that starts from the initial state and moves through the state space by applying applicable actions, looking for a goal state.
How does 'backward search' for planning work?
It starts at the goal and applies actions in reverse, searching for a sequence of steps that reaches the initial state.
The ignore-delete-lists heuristic simplifies a planning problem by removing all _ literals from action effects.
negative
What does the 'ignore-preconditions' heuristic do to simplify a planning problem?
It drops all preconditions from actions, making every action applicable in every state.
What is reinforcement learning (RL)?
A type of learning where an agent interacts with an environment, receiving rewards or punishments, and learns to maximize cumulative rewards.
In a Markov decision process (MDP), what does the transition model $P(s' | a, s)$ represent?
The probability of reaching state s' if action a is taken in state s.
What is a 'policy' ($π$) in reinforcement learning?
A function that specifies what action an agent should take for any given state.
What is an 'optimal policy' ($π*$)?
A policy that yields the highest expected utility.
In reinforcement learning, what is the purpose of the discount factor $γ$?
It describes an agent's preference for current rewards over future rewards, decaying the value of rewards based on how far in the future they are.
What is the main difference between model-based and model-free reinforcement learning?
Model-based RL uses a transition model of the environment to make decisions, while model-free RL learns how to behave directly without building a model.
Contrast passive and active reinforcement learning.
In passive RL, the agent executes a fixed policy to evaluate it, while in active RL, the agent decides which actions to take to learn an optimal policy.
What is 'direct utility estimation' in passive reinforcement learning?
A method where the utility of a state is estimated by averaging the observed reward-to-go from that state over all trials.
What is the core idea of temporal-difference (TD) learning?
It uses observed transitions to incrementally adjust the utility estimates of observed states based on the utility of the successor state.
The TD update rule uses the difference between the utility and the utility.
expected, observed
What is the fundamental tradeoff an active reinforcement learning agent must manage?
The tradeoff between exploitation (using known information to maximize reward) and exploration (trying new actions to gain information).
What does the action-utility function $Q(s, a)$ in Q-learning represent?
The expected total discounted reward if the agent takes action 'a' in state 's' and acts optimally thereafter.
How does a Q-learning agent select an action in a given state 's' once it has learned the Q-function?
It chooses the action 'a' that maximizes $Q(s, a)$.
What is supervised learning?
A learning paradigm where an agent learns a function that maps from input to output by observing a training set of input-output pairs.
What is the most common task in unsupervised learning?
Clustering, which involves detecting potentially useful groups of input examples without explicit labels.
In supervised learning, the function $h$ that a learning algorithm discovers is called a , and it is drawn from a space H.
hypothesis, hypothesis
Define 'bias' in the context of machine learning models.
Bias is the systematic error caused by a model making incorrect assumptions about the data.
Define 'variance' in the context of machine learning models.
Variance is the amount a model's prediction changes due to fluctuations in the training data.
What is the bias-variance tradeoff?
A choice between more complex, low-bias models that fit training data well and simpler, low-variance models that may generalize better.
What is the main difference between a regression problem and a classification problem?
Regression predicts a continuous valued output, while classification predicts a discrete valued output.
What is the objective of linear regression?
To find the weights of a linear function that best fits the data, typically by minimizing the sum of squared errors.
What is the purpose of the gradient descent algorithm?
To find the optimal parameters (weights) of a model by iteratively moving in the direction of the steepest descent of the loss function.
What type of function does logistic regression use to model the probability of a discrete outcome?
A logistic function, also known as a sigmoid function.
What is a decision tree in machine learning?
A model that uses a tree-like structure of if-else conditions to map a vector of attribute values to an output value.
The LEARN-DECISION-TREE algorithm uses a greedy strategy that always tests the most _ attribute first.
important
What distinguishes a nonparametric model from a parametric model?
A nonparametric model cannot be characterized by a bounded set of parameters; its complexity can grow with the amount of training data.
How does a K-nearest-neighbors (KNN) model make a prediction for a classification task?
It finds the 'k' nearest examples to the query point in the training data and predicts the most common output value among them.
What is the core principle of a Support Vector Machine (SVM)?
It constructs a maximum margin separator, which is a decision boundary with the largest possible distance to the nearest example points of any class.
What is the basic idea of deep learning that allows it to be more expressive than shallow models?
It uses models with long computation paths (deep circuits), allowing input variables to interact in complex ways.
In a neural network, what is a 'unit'?
A node that calculates the weighted sum of its inputs and then applies a nonlinear activation function to produce an output.
What are the three main types of layers in a typical feedforward neural network?
The input layer, hidden layer(s), and the output layer.
What is the purpose of the back-propagation algorithm in training neural networks?
It computes the gradient of the loss function with respect to each weight in the network, allowing weights to be updated via gradient descent.
What type of data are Convolutional Neural Networks (CNNs) primarily designed to process?
Images, by exploiting properties like local spatial invariance.
In a CNN, what is a 'kernel'?
A pattern of weights (a filter) that is replicated and applied across multiple local regions of an image to extract features.
What is the function of a 'pooling layer' in a CNN?
It summarizes a set of adjacent units from a preceding layer with a single value, performing down-sampling.
What distinguishes Recurrent Neural Networks (RNNs) from feedforward networks?
RNNs allow cycles in their computation graph, which gives them internal state or memory to process sequential data.
What type of data are Recurrent Neural Networks (RNNs) especially well-suited for?
Language and other sequential data, where information from previous time steps is relevant to the current one.