Self-Balancing Binary Trees

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/18

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

19 Terms

New cards

Searching Problem/Index System

Set of entries: {𝑒1, … , 𝑒𝑛}
Each entry 𝑒𝑖 is a pair (𝑘𝑖, 𝑣𝑖), where:
- 𝑘𝑖 is a key value from some totally ordered domain, say integers or strings,
- 𝑣𝑖 is an associated data value
We assume keys are unique
We do not use data values to solve the searching problem, but they will be important to the applications that use our data structure

New cards

Searching Problem

Given an arbitrary search key 𝑘, determine if there exists an entry matching this key value and if so, what the associated data value is

New cards

Dictionary (Map) ADT

An ADT with operations insert, delete and find:

void insert(Key 𝒌, Value 𝒗): Stores an entry with the key-value pair (𝑘, 𝑣). (If 𝑘 is already present in the dictionary, error)

void delete(Key k): Deletes the entry associated with key 𝑘 from the dictionary. (If it does not exist, error)

Value find(Key k): Determine whether there is an entry in the dictionary associated with key 𝑘. If there is, return the associated value. Otherwise, return null reference

You can also support iteration over the entries, range queries (enumerate or count all entries between some minimum and maximum value), returning 𝒊th smallest value as well as union and intersection

New cards

Sequential Allocation for Dictionary ADT

Most naïve implementation: Store the entries in a linear array without sorting
- To find key value, linear search -> 𝑂(𝑛) time
- Insertion only takes 𝑂(1) time, if we still have enough space in the array. But checking for duplicates -> 𝑂(𝑛) time
Otherwise, use a sorted (by key value) array
- To find key value, binary search -> 𝑂(log 𝑛) time
  - For 𝑛 = 10^9 entries, log2 𝑛 ≤ 30
- But updates (insertion and deletion) take 𝑶(𝒏) time as elements in array must be moved around to make space

New cards

Binary Search Tree

at most two children per node - left child and right child

each node has a key-value pair, and are sorted by their keys

for each node its key is >= all keys in its left subtree, and < all keys in its right subtree

New cards

Are these Binary Trees Equal?

New cards

Setting Up Binary Search Tree ADT

public class BST<Key extends Comparable<Key>, Value> {
   private Node root;

   private class Node {
      private Key key;
      private Value value;
      private Node leftChild, rightChild;

      public Node(Key key, Value value) {
         this.key = key;
         this.value = value;
      }
   }
   ...
}

New cards

Implementing Find

Leverage the sortedness property:

If root has key, return root,
Else if key smaller, go left,
Else, go right

public Value find(Key key) {
   Node x = root;
   while (x != null) {
      int cmp = key.compareTO(x.key);
      if (cmp < 0) x = x.leftChild;
      //cmp < 0 == key < x.key
      else if (cmp > 0) x = x.rightChild;
      else return x.value; //cmp == 0
   }
   return null;
}

<p>Leverage the sortedness property:</p><ol type="1"><li><p><span>If root has key, return root,</span></p></li><li><p><span>Else if key smaller, go left,</span></p></li><li><p><span>Else, go right</span></p></li></ol><pre><code class="language-Java">public Value find(Key key) {
Node x = root;
while (x != null) {
int cmp = key.compareTO(x.key);
if (cmp < 0) x = x.leftChild;
//cmp < 0 == key < x.key
else if (cmp > 0) x = x.rightChild;
else return x.value; //cmp == 0
}
return null;
}</code></pre><p></p>

New cards

Time Complexity of Find

For a BST of height ℎ, the worst-case complexity of find() is 𝑂(ℎ)
Height of BST = Ω(log 𝑛) and 𝑂(𝑛)
BST is very efficient on balanced trees, but inefficient on degenerate trees

<ul><li><p><span>For a BST of height </span><span style="font-family: "Cambria Math"">ℎ</span><span>, the worst-case complexity of find() is </span><span style="font-family: "Cambria Math"">𝑂</span><span>(</span><span style="font-family: "Cambria Math"">ℎ</span><span>)</span></p></li><li><p><span>Height of BST = Ω(log </span><span style="font-family: "Cambria Math"">𝑛</span><span>) and </span><span style="font-family: "Cambria Math"">𝑂</span><span>(</span><span style="font-family: "Cambria Math"">𝑛</span><span>)</span></p></li><li><p><span>BST is very efficient on balanced trees, but inefficient on degenerate trees</span></p></li></ul><p></p>

New cards

Implementing Insertion

Need to maintain sortedness:

If x is null, insert new node
If x has key, give error,
Else if key < x’s key, recurse left
Else recurse right

public void insert(Key key, Value value) {
   root = inset(root, key, value);
}

private Node insert(Node x, Key key, Value value) {
   if (x == null) return new Node(key, value);
   int cmp = key.compareTo(x.key);
   if (cmp < 0) {
      x.leftChild = insert(x.leftChild, key, value);
   }
   else if (cmp > 0) {
      x.rightChild = insert(x.rightChild, key, value);
   }
   else System.out.println("Insert Exception");
      //x.value = value;
   return x;
}

New cards

Insertion Order Matters

New cards

Time Complexity of Insertion

For a BST of height ℎ, worst-case time complexity = 𝑂(ℎ)
Depends on the insertion order

New cards

Implementing Deletion

New cards

Code for delete()

successor of a node with value = node in tree with value just above that

private Node successor(Node x) {
   if (x == null || x.rightChild == null) return null;
   else {
      x = x.rightChild;
      while (x.leftChild != null) x = x.leftChild;
      return x;
   }
}

public void deleteMin() {
   root = deleteMin(root);
}

private Node deleteMin(Node x) {
   if (x.leftChild == null) return x.rightChild;
   else {
      x.leftChild = deleteMin(x.leftChild);
      return x;
   }
}

public void delete(Key key) {
   root = delete(root, key);
}

private Node delete(Node x, Key key) {
   if (x == null) return null;
   int cmp = key.compareTo(x.key);
   if (cmp < 0) x.leftChild = delete(X.leftChild, key);
   else if (cmp > 0) x.rightChild = delete(x.rightChild, key);
   else { //if we need to delete node x
      if (x.rightChild == null) return x.leftChild;
      else if (x.leftChild == null) return x.rightChild;
      else {
         Node t = x;
         x = successor(t);
         x.rightChild = deleteMin(t.rightChild);
         x. leftChild = t.leftChild;
      }
   }
   return x;
}

New cards

Self-Balancing Search Trees

There are many options, with diverse trade-offs

2-3 trees: tertiary self-balancing search trees
AVL trees: self-balancing BST
Red-black trees are a (BST) generalization of 2-3 trees
B-trees: generalization of 2-3 trees to more keys per node

Some are not height-balanced:

Splay Trees
Treaps

For example, you may want your search tree to self-balance for common letter (key) patterns rather than for any letter (key) sequence

New cards

2-3 Trees

2-nodes: 1 key, 2 children
3-nodes: 2 keys, 3 children
(Natural variant of) Inorder traversal gives keys in ascending order
Perfectly balanced tree: Every path from the root to a leaf has the same length
How do we maintain this property despite insertions and deletions?

<ul><li><p><span>2-nodes: 1 key, 2 children</span></p></li><li><p><span>3-nodes: 2 keys, 3 children</span></p></li><li><p><span>(Natural variant of) Inorder traversal gives keys in ascending order</span></p></li><li><p><span>Perfectly balanced tree: Every path from the root to a leaf has the same length</span></p></li><li><p><span>How do we maintain this property despite insertions and deletions?</span></p></li></ul><p></p>

New cards

Find (2-3 Trees)

Works (almost) just as in BSTs,
Example: find(8)

<ul><li><p><span>Works (almost) just as in BSTs,</span></p></li><li><p><span>Example: find(8)</span></p></li></ul><p></p>

New cards

Insertion (2-3 Trees)

New cards

Cons of 2-3 Trees

Quite inconvenient to implement
- Different types of nodes (2-nodes and 3-nodes)
- We need to do multiple compares at each node,
- We need to move back up the tree to split 4-nodes 4. Large number of cases for that splitting
Red-black trees are BSTs that build upon 2-3 trees, representing 3-node as a binary tree node with a red left link

<ul><li><p>Quite inconvenient to implement</p><ul><li><p>Different types of nodes (2-nodes and 3-nodes)</p></li><li><p>We need to do multiple compares at each node,</p></li><li><p>We need to move back up the tree to split 4-nodes 4. Large number of cases for that splitting</p></li></ul></li><li><p>Red-black trees are BSTs that build upon 2-3 trees, representing 3-node as a binary tree node with a red left link</p></li></ul><p></p>