Theory of Computation – Automata & Computability Comprehensive Notes

Computation and Computability

Computation (in Theory of Computation – TOC) is any task that can be executed by a calculator or computer. In TOC we build an abstract machine and ask three meta-questions:

Can the machine perform the task?
For which problems does the machine succeed?
Where are its limitations?

Why Study Computability Theory?

• Limits of computation – some problems are inherently unsolvable by any algorithm.
• Classifying problems – we distinguish:
– Decidable (algorithm exists).
– Undecidable (no algorithm can ever exist).
• Programming-language foundations – grammars, compilers, interpreters rely on formal models from computability.
• Algorithmic guidance – we only optimise after we confirm a problem is solvable.
• Comparing computational models – finite automata, pushdown automata, Turing machines.
• Bridge to complexity theory – once we know a problem is computable we ask how much time/space it costs.
• Ethical & practical implication – prevents wasting resources on the impossible and underpins reliability proofs in safety-critical systems.

Mathematical Modelling and Automata

Natural language is ambiguous; mathematics gives precision, abstraction and formal reasoning.
Automata are the simplest such models. Each automaton is a triple of:

States – finite memory.
Alphabet $\Sigma$ – finite set of input symbols.
Transitions $\delta$ – rules mapping (state,input) → state.
They model sequential devices (switches, vending machines), software (lexers, parsers) and hardware (sequential circuits).

Basics of Automata Theory

• Study of abstract machines (automata) and the languages they recognise.
• Central notions:

Symbol – single character, digit, icon, etc.
Alphabet $\Sigma$ – finite, non-empty set of symbols, e.g. $\Sigma = {0,1}$ or ASCII.
String / word – finite sequence of symbols from $\Sigma$ . Example: $01110$ .
Empty string $\epsilon$ – length $0$ .
Length $|w|$ – number of symbols in string $w$ .
Powers of an alphabet:
$\Sigma^k$ = set of strings of length $k$ .
$\Sigma^0 = {\epsilon}$ , $\Sigma^1 = \Sigma$ .
Kleene star $\Sigma^* = \bigcup_{k\ge0} \Sigma^k$ – all finite strings.
$\Sigma^+ = \Sigma^* \setminus {\epsilon}$ .
Concatenation: if $x=a1\dots am$ and $y=b1\dots bn$ then $xy=a1\dots am b1\dots bn$ .

Languages and Problems

A language $L$ over $\Sigma$ is any subset $L \subseteq \Sigma^*$ . Examples:

${\epsilon,01,0011,000111,\dots}$ – $n$ zeros followed by $n$ ones.
Equal numbers of $0$ ’s and $1$ ’s.
Binary representations of prime numbers.
Languages can be finite (e.g., all binary strings of length $2$ ) or infinite (e.g., all strings starting with $a$ ).

Problem-view: Given $w \in \Sigma^*$ , decide whether $w \in L$ . A machine that solves this decision problem *recognises* $L$ .

Illustrative Finite State Machines

ON/OFF Switch

States ${ON,OFF}$ , alphabet ${push}$ , start $OFF$ , toggling transition. Demonstrates the minimal concept of state memory.

Coffee Vending Machine

States: $S0$ (idle), $S1$ (coin), $S_2$ (dispense). Alphabet ${insert_coin, select_coffee}$ . Transition table included in transcript. Shows how automata encode interaction order.

Finite Automata (FA)

Formal 5-tuple ${Q,\Sigma,q0,F,\delta}$ • $Q$ – finite state set. • $\Sigma$ – input alphabet. • $q0 \in Q$ – start state.
• $F \subseteq Q$ – accepting states.
• $\delta : Q \times \Sigma \to Q$ – transition function.

Types:

With output – Moore & Mealy machines.
Without output – DFA, NFA, $\epsilon$ -NFA.

Deterministic Finite Automata (DFA)

• Exactly one next state for every $q$ and $a \in \Sigma$ ; no $\epsilon$ moves.

Processing a String

Given $w = a1a2\dots an$ , start in $q0$ . Recursively compute $qi = \delta(q{i-1},ai)$ . If $qn \in F$ → accept, else reject. Language of the DFA = all accepted strings.

Example: Substring $01$

Language $L = {x01y \mid x,y \in {0,1}^*}$ . DFA states represent "how close" we are to having seen 01:
• $q0$ – none seen. • $q2$ – last symbol was 0.
• $q1$ – have already seen 01 (accepting). Transitions built accordingly; full tuple $A = ({q0,q1,q2},{0,1},q0,{q1},\delta)$ .

Common DFA Exercises

Strings ending with $a$ (2 states).
$|w| \ge 2$ over ${a,b}$ (accept all of length ≥2).
$|w| \le 2$ .
$|w| = 2$ .
Minimal DFA for even-length binary strings (two-state modulo-2 counter).

Nondeterministic Finite Automata (NFA)

• $\delta : Q \times \Sigma \to 2^Q$ – may return set of states (incl. empty set).
• Machine branches; conceptually explores all paths in parallel.

Guess & Verify Paradigm

– Guess (branch) at nondeterministic choice points.
– Verify deterministically along each branch.
Input is accepted if any branch ends in $F$ after whole input.

Formal Definition

Tuple ${Q,\Sigma,q_0,F,\delta}$ with set-valued $\delta$ .

Example NFA for Strings Ending in $01$

Adds nondeterministic "try now" arc so machine can jump into a two-state checker when it guesses the final 01 has started. Threads that get stuck simply die; acceptance needs one live accepting thread.

$\epsilon$ -NFA (NFA with Free Moves)

Special transitions labelled $\epsilon$ allow state changes without consuming input.

Acceptance: There exists a path consisting of input symbols plus any number of $\epsilon$ moves from $q_0$ to some $f \in F$ .

Processing Example “ba”

Start $q0 \xrightarrow{\epsilon} q1$ , read $b$ to stay in $q1$ , read $a$ to reach ${q1,q2}$ → $q2$ is final → accept.

Eliminating $\epsilon$ Moves

Compute $\epsilon$ -closure( $q$ ) – all states reachable from $q$ via only $\epsilon$ .
For each state $q$ and symbol $a$ :
– take $\epsilon$ -closure( $q$ );
– follow $a$ transitions;
– take $\epsilon$ -closure$ again → new transition.
A state becomes final in the new NFA if its \epsilon $-closure contains an old final. Worked example with states$ {q0,q1,q_2} $and alphabet$ {a} $given in transcript.</li></ol><h3 id="be5b703e-e4e8-419b-a4d5-32388f928f11" data-toc-id="be5b703e-e4e8-419b-a4d5-32388f928f11" collapsed="false" seolevelmigrated="true">Converting NFA to DFA – Subset (Powerset) Construction</h3>Every NFA has an equivalent DFA (same language) even though NFA may appear more powerful. Algorithm:<ol><li>DFA states = subsets of NFA states ($ 2^{|Q|} $possible).</li><li>Start state =$ \epsilon $-closure($ q_0^{NFA} $).</li><li>For each DFA state$ S $and input$ a $: $ \delta{DFA}(S,a) = \epsilon $-closure$ \Big(\bigcup{q\in S} \delta_{NFA}(q,a)\Big) $.</li><li>DFA accepting states = those subsets containing at least one NFA final state.</li></ol>Tiny sample: NFA with$ q0 \xrightarrow{a} q0, q1; $q1$ accepting. DFA has two states A={q0} $(non-final) and$ B={q0,q_1} $(final) with transition$ A\xrightarrow{a}B, \; B\xrightarrow{a}B $.<h3 id="a6f906e9-f84b-4131-a1e8-c500f76bffb8" data-toc-id="a6f906e9-f84b-4131-a1e8-c500f76bffb8" collapsed="false" seolevelmigrated="true">DFA Minimization</h3>Goal: smallest DFA recognising a language.<ol><li>Remove unreachable states (graph reachability from$ q_0 $).</li><li>Table-filling method: mark distinguishable pairs (final vs non-final). Iteratively mark pairs whose transitions lead to already-distinguished pairs.</li><li>Merge unmarked (equivalent) states.</li><li>Rebuild DFA. Example in transcript shows 4-state DFA already minimal because every pair became distinguishable.</li></ol><h3 id="577e008d-281f-4d8c-8a27-294fa56faa41" data-toc-id="577e008d-281f-4d8c-8a27-294fa56faa41" collapsed="false" seolevelmigrated="true">Practical Applications</h3>• Text search / pattern matching – NFAs encode the pattern, DFAs perform fast linear scanning (e.g., grep). • Keyword recognition in lexical analysis – compiler’s lexer uses a DFA whose states represent longest-prefix matches. Example keyword set$ {if,int,while} $produced states$ q0\dots q9 $with deterministic edges; accepting states$ q2,q4,q_9 $. • Hardware control (on/off, vending machine). • Network protocol validation, UI workflows, regular expression engines.<h3 id="dcdb2583-8460-4522-9dba-766b14ee9afc" data-toc-id="dcdb2583-8460-4522-9dba-766b14ee9afc" collapsed="false" seolevelmigrated="true">Connections to Further Topics</h3>• Once a language is beyond DFA power (e.g., balanced parentheses) we move to pushdown automata. • Undecidable properties appear at the Turing-machine level (e.g., Halting Problem). • Complexity classes ($ P $,$ NP $,$ PSPACE$$) build on computability: first ensure solvability, then measure resources.