CS483 9/22

Comparison-based sorting algorithms, regardless of how they partition elements (e.g., into two or three parts), fundamentally rely on comparisons between elements.
This characteristic limits their optimal runtime. The theoretical lower bound for comparison-based sorts is $O(N \log N)$ .

Impact of Uneven Splits: When Quicksort doesn't pick a perfect median, it can lead to uneven partitions. For example, splitting a list into $1/4$ and $3/4$ of its elements ( $N/4$ and $3N/4$ ).
Recursive Calls: Each element is then further divided: a block of $N/16$ and $3N/16$ , and another block of $3N/16$ and $9N/16$ .
Tree Structure Consequences: This uneven partitioning affects the structure of the recursion tree:
- Some branches will be shorter (ending sooner) and some will be longer (ending later).
- The height of the tree will vary across different branches.
- The shortest path to a base case (e.g., a single element) will be determined by repeatedly dividing by the largest reduction factor (e.g., $4$ if one side is $1/4$ ). Its height would be $\log_4 N$ .
- The longest path to a base case will be determined by repeatedly dividing by the smallest reduction factor (e.g., $4/3$ if one side is $3/4$ ). Its height would be $\log_{4/3} N$ .
Asymptotic Equivalence of Logarithms: Despite different bases, both heights are asymptotically equivalent to $\Theta(\log N)$ . This means that in terms of overall Big-O complexity, the base of the logarithm does not change the asymptotic behavior ( $\log<em>a N = k \log</em>b N$ for some constant $k = \log_a b$ ).

Randomized Algorithms (e.g., Randomized Quicksort):
- May have a worst-case runtime (e.g., $N^2$ ) but an excellent expected-case runtime (e.g., $N \log N$ ).
- The worst-case scenario is typically rare and requires a specific, often pathological, input sequence or consistently 'unlucky' random choices.
- The randomness usually means that no single input consistently triggers the worst case; the worst case depends on the random choices made by the algorithm, not solely on the input data.
Deterministic Algorithms (e.g., Heapsort, Mergesort):
- Have a consistent runtime performance for a given input size, meaning their best, average, and worst-case runtimes are often the same (e.g., $N \log N$ for Heapsort and Mergesort).
- The runtime is entirely predictable for any given input.
- Lack the unpredictability of randomized choices, which can be an advantage in scenarios where consistent performance is critical.