Heaps and Priority Queues: Internals, Trade-offs, and When Theory Breaks Down

Heaps provide the fundamental abstraction for “give me the most important thing next” in O(log n) time. Priority queues — the abstract interface — power task schedulers, shortest-path algorithms, and event-driven simulations. Binary heaps dominate in practice not because they’re theoretically optimal, but because array storage exploits cache locality. Understanding the gap between textbook complexity and real-world performance reveals when to use standard libraries, when to roll your own, and when the “better” algorithm is actually worse.

Decision tree for choosing a priority-queue implementation. Binary heaps are the default; reach for d-ary, pairing, or Fibonacci variants only when a specific operation or scale forces it.

Abstract

A heap is a complete binary tree stored in an array where each parent dominates its children (min-heap: parent ≤ children, max-heap: parent ≥ children). Array indices encode parent-child relationships: for a node at index i, the parent is at ⌊(i-1)/2⌋, children at 2i+1 and 2i+2¹. This implicit structure eliminates pointer overhead and enables contiguous memory access.

The core operations:

Operation	Binary Heap	What Happens
`peek`	O(1)	Root is always min/max
`insert`	O(log n)	Add at end, bubble up
`extractMin/Max`	O(log n)	Swap root with last, bubble down
`buildHeap`	O(n)	Heapify bottom-up (not n × log n)
`decreaseKey`	O(log n)	Update value, bubble up

The O(n) buildHeap complexity is counterintuitive — it works because most nodes are near the leaves where heapify is cheap². Fibonacci heaps offer O(1) amortized decreaseKey³, but their constant factors make them slower than binary heaps in practice for all but very large, dense graphs⁴.

Important

Theoretical complexity is dominated by cache effects once the heap exceeds L1/L2. A 4-ary heap is consistently faster than a binary heap on large workloads — LaMarca and Ladner measured roughly – end-to-end speedups on out-of-cache workloads, and follow-up benchmarks routinely report 10–30% wins — because the shorter tree fits more children per cache line, despite costing more comparisons per level⁵⁶.

Warning

Binary-heap decreaseKey is only if you already know the element’s index. Without an auxiliary value → index map maintained on every swap, you must scan to find the node first — turning the operation into . This is the single most common heap pitfall in graph code; see the duplicate-node strategy below for the simpler escape hatch.

The Heap Property and Array Representation

A binary heap satisfies two constraints¹:

Shape property — a complete binary tree: all levels filled except possibly the last, which fills left-to-right.
Heap property — each node dominates its children (min-heap: A[parent] ≤ A[child], max-heap: A[parent] ≥ A[child]).

The complete-tree constraint is what makes the array representation work. Without it, gaps would appear in the array as nodes are removed.

Why Array Storage Works

The complete binary tree constraint enables an elegant array representation without explicit pointers:

1// Fundamental relationships for array-based heaps23function parent(i: number): number {4  return Math.floor((i - 1) / 2) // Equivalent: (i - 1) >> 15}67function leftChild(i: number): number {8  return 2 * i + 1 // Equivalent: (i << 1) + 19}1011function rightChild(i: number): number {12  return 2 * i + 2 // Equivalent: (i << 1) + 213}

CLRS uses 1-indexed arrays where parent(i) = ⌊i/2⌋, left(i) = 2i, right(i) = 2i + 1. Production implementations almost always use 0-indexed arrays — the arithmetic is slightly messier but avoids wasting index 0.

Array-to-tree mapping for a 7-element binary heap. The contiguous array on top is the implicit complete tree below; index arithmetic encodes parent/child links so no pointers are stored.

Memory layout advantage: with 64-byte cache lines and 8-byte elements, one cache line holds eight heap nodes. Sequential access during heapify loads multiple nodes for free. This is the structural reason array-based heaps outperform pointer-based structures despite identical asymptotic complexity⁵.

The Shape Guarantee

The complete tree property means:

A heap with n nodes has height ⌊log₂ n⌋.
Level k contains at most 2^k nodes.
The last level may be incomplete, but fills left-to-right.

This guarantees balanced structure without rotations or rebalancing — unlike binary search trees, where adversarial insertion order creates O(n) height.

Core Operations: How Bubbling Works

Insert (Bubble Up / Sift Up)

Insert places the new element at the end (maintaining completeness), then restores the heap property by repeatedly swapping with the parent if violated:

1// Binary min-heap implementation2// heap is an array, size is the current element count34class MinHeap<T> {5  private heap: T[] = []6  private compare: (a: T, b: T) => number78  constructor(compareFn: (a: T, b: T) => number = (a, b) => (a as number) - (b as number)) {9    this.compare = compareFn10  }1112  insert(value: T): void {13    this.heap.push(value) // Add at end (O(1) amortized)14    this.bubbleUp(this.heap.length - 1) // Restore heap property15  }1617  private bubbleUp(index: number): void {18    while (index > 0) {19      const parentIdx = Math.floor((index - 1) / 2)20      if (this.compare(this.heap[index], this.heap[parentIdx]) >= 0) {21        break // Heap property satisfied22      }23      // Swap with parent24      ;[this.heap[index], this.heap[parentIdx]] = [this.heap[parentIdx], this.heap[index]]25      index = parentIdx26    }27  }28}

Worst case: the new element is smaller than the root, requiring log n swaps up the entire height. Best case: the element belongs at the bottom, O(1). Average case for random insertions: about half the path to root, still O(log n).

Four-step insert with sift-up: append the new value at the end of the array, then swap it with its parent while the heap property is violated.

Extract Min/Max (Bubble Down / Sift Down)

Extraction swaps the root with the last element, removes the last, then restores the heap property by bubbling down:

1// Continuing the MinHeap class23extractMin(): T | undefined {4  if (this.heap.length === 0) return undefined;5  if (this.heap.length === 1) return this.heap.pop();67  const min = this.heap[0];8  this.heap[0] = this.heap.pop()!;  // Move last to root9  this.bubbleDown(0);               // Restore heap property10  return min;11}1213private bubbleDown(index: number): void {14  const size = this.heap.length;15  while (true) {16    const left = 2 * index + 1;17    const right = 2 * index + 2;18    let smallest = index;1920    // Find smallest among node and its children21    if (left < size && this.compare(this.heap[left], this.heap[smallest]) < 0) {22      smallest = left;23    }24    if (right < size && this.compare(this.heap[right], this.heap[smallest]) < 0) {25      smallest = right;26    }2728    if (smallest === index) break;  // Heap property satisfied2930    [this.heap[index], this.heap[smallest]] = [this.heap[smallest], this.heap[index]];31    index = smallest;32  }33}

Key detail: we compare with both children and swap with the smaller one (min-heap). Swapping with the larger child would violate the heap property for the other subtree.

Three-step extract-min on a binary heap: pull the root, move the last leaf to the root, then bubble it down by swapping with the smaller child until the invariant holds.

Why Swap-with-Last Works

Removing the root directly would leave a hole requiring expensive restructuring. Swapping with the last element:

Maintains the complete tree shape (last element gone, root replaced).
Only potentially violates the heap property at the root.
Bubbling down is bounded by tree height.

Tip

CPython’s heapq deliberately violates the textbook bubble-down at this point. After moving the tail to the root, it bubbles up the smaller child along the path to a leaf, then sifts the original tail element down only once — a bottom-up technique attributed to Knuth (TAOCP Vol. 3, §5.2.3) that measurably reduces comparisons because the moved element usually does belong near the bottom⁷.

Build Heap: The O(n) Surprise

The naive approach — insert n elements one by one — costs O(n log n). But building a heap by calling heapify (bubble down) from the middle of the array upward runs in O(n)².

The Mathematical Insight

At most ⌈n/2^(h+1)⌉ nodes exist at height h. Heapify at height h costs O(h). The total work:

The infinite series converges. Most nodes are leaves (height 0, zero work). Only one node has height log n (the root). The work distribution is heavily skewed toward cheap operations.

Why buildHeap is O(n): work concentrates at the leaves. Roughly half the nodes are leaves with zero work, a quarter cost O(1), and only the lone root costs O(log n) — the geometric series of (n / 2^(h+1)) * h converges.

1// Convert arbitrary array to valid heap23function buildHeap<T>(arr: T[], compare: (a: T, b: T) => number): void {4  const n = arr.length5  // Start from last non-leaf node, heapify each6  for (let i = Math.floor(n / 2) - 1; i >= 0; i--) {7    heapify(arr, n, i, compare)8  }9}1011function heapify<T>(arr: T[], size: number, index: number, compare: (a: T, b: T) => number): void {12  let smallest = index13  const left = 2 * index + 114  const right = 2 * index + 21516  if (left < size && compare(arr[left], arr[smallest]) < 0) smallest = left17  if (right < size && compare(arr[right], arr[smallest]) < 0) smallest = right1819  if (smallest !== index) {20    ;[arr[index], arr[smallest]] = [arr[smallest], arr[index]]21    heapify(arr, size, smallest, compare)22  }23}

Practical implication: when you have all elements upfront, use buildHeap. It’s roughly 2× faster than repeated insert for the same result, and standard libraries expose it directly (heapq.heapify in Python, std::make_heap in C++, heap.Init in Go).

Heap Sort: O(n log n) Guaranteed, But Slower in Practice

Heap sort works by building a max-heap, then repeatedly extracting the maximum:

1// In-place, unstable, O(n log n) guaranteed23function heapSort<T>(arr: T[], compare: (a: T, b: T) => number): void {4  const n = arr.length56  // Build max-heap (reverse comparison)7  for (let i = Math.floor(n / 2) - 1; i >= 0; i--) {8    maxHeapify(arr, n, i, compare)9  }1011  // Extract elements one by one12  for (let i = n - 1; i > 0; i--) {13    ;[arr[0], arr[i]] = [arr[i], arr[0]] // Move max to end14    maxHeapify(arr, i, 0, compare) // Restore heap on reduced array15  }16}

Why Heap Sort Loses to Quicksort

Factor	Heap Sort	Quicksort
Worst case	O(n log n) always	O(n²) without pivot optimization
Cache locality	Poor — jumps across the array	Excellent — linear partitioning
Branch prediction	Poor — unpredictable swap paths	Better — partition scans are predictable
Practical speed	1× baseline	Typically 2-3× faster

Quicksort scans memory linearly during partitioning, loading cache lines efficiently. Heap sort jumps between parent and children nodes, causing cache misses⁸. Modern implementations like introsort use heap sort as a fallback when quicksort recursion exceeds 2 ⌊log₂ n⌋ — getting quicksort’s average speed with heap sort’s worst-case guarantee. C++ std::sort is required to be O(n log n), and libstdc++ implements it as introsort precisely because of this trade-off.

Heap Sort Is Not Stable

Stability means equal elements maintain their original relative order. Heap sort swaps elements across large distances (root with last position), breaking stability⁹. For stability use merge sort, Timsort (Python’s list.sort), or pdqsort (Rust’s slice sort).

D-ary Heaps: When More Children Means Faster

A d-ary heap generalizes binary heaps to d children per node⁶:

Index calculations: parent(i) = ⌊(i-1)/d⌋, children at d*i + 1 through d*i + d.
Tree height: log_d(n) instead of log_2(n).
insert: fewer levels to bubble up (faster).
extractMin: more comparisons per level to find the smallest child (slower).

Why 4-ary Heaps Win in Practice

LaMarca and Ladner’s foundational study on caches and heaps showed that aligning the fanout with the cache line size dramatically reduces cache misses for extractMin-heavy workloads, and that 4-ary aligned heaps consistently beat binary heaps on out-of-cache workloads⁵. The original paper measured roughly – end-to-end speedups across Pentium / PowerPC / Alpha; later cache-tuned implementations have pushed past on memory-bound workloads¹⁰, with general-purpose follow-ups settling into the often-cited 10-30% range⁶.

The reasons:

Reduced height — a 4-ary heap with 1M elements has height ~10 vs ~20 for binary.
Cache line utilization — four 8-byte children fit in a 64-byte cache line, so all candidate children are usually loaded by a single line fetch.
Fewer cache misses — despite more comparisons per level, the dominant cost on modern hardware is memory traffic, not ALU work.

1// D-ary heap with d=423const D = 445function parent(i: number): number {6  return Math.floor((i - 1) / D)7}89function firstChild(i: number): number {10  return D * i + 111}1213// During bubble-down, compare all D children14function findSmallestChild(heap: number[], parentIdx: number, size: number): number {15  let smallest = parentIdx16  const start = D * parentIdx + 117  const end = Math.min(start + D, size)1819  for (let i = start; i < end; i++) {20    if (heap[i] < heap[smallest]) {21      smallest = i22    }23  }24  return smallest25}

When to use d-ary: large heaps that exceed L2 cache and where memory access dominates CPU cycles. For small heaps (< ~1000 elements), binary heaps are simpler and fast enough — the tree fits in L1 and the wider fanout buys nothing.

Fibonacci Heaps: Theoretically Optimal, Practically Slow

Fibonacci heaps achieve³:

insert: O(1) amortized
findMin: O(1)
decreaseKey: O(1) amortized
extractMin: O(log n) amortized
merge: O(1)

This makes Dijkstra’s algorithm O(E + V log V) instead of O((V + E) log V) with a binary heap¹¹ — a meaningful improvement when E approaches V².

Why Fibonacci Heaps Lose in Practice

High constant factors — the lazy consolidation machinery and per-node bookkeeping (parent, child, sibling pointers, mark bit, degree) add substantial overhead⁴.
Pointer chasing — the node-based structure defeats cache prefetching.
Complex implementation — more opportunities for bugs, harder to optimize.
Amortized ≠ consistent — individual operations can spike to O(n).

Experimental studies — Stasko & Vitter (1987)¹², Moret & Shapiro (1992), and Larkin, Sen & Tarjan (2014)¹³ — consistently show pairing heaps outperform Fibonacci heaps in decreaseKey-heavy graph workloads despite weaker theoretical guarantees¹⁴. For most applications, a binary heap with the “duplicate node” strategy beats both.

The Duplicate Node Strategy

Instead of decreaseKey, insert a new node with the updated priority. When extracting, skip stale entries¹¹:

1// Used in Dijkstra's algorithm implementations2// Trade memory for simplicity—no need to track node positions34interface PQEntry<T> {5  priority: number6  value: T7  valid: boolean // Or use a Set to track processed nodes8}910class SimplePriorityQueue<T> {11  private heap: PQEntry<T>[] = []1213  insert(priority: number, value: T): void {14    this.heap.push({ priority, value, valid: true })15    this.bubbleUp(this.heap.length - 1)16  }1718  // Instead of decrease-key, insert again and mark old as invalid19  update(oldEntry: PQEntry<T>, newPriority: number): void {20    oldEntry.valid = false21    this.insert(newPriority, oldEntry.value)22  }2324  extractMin(): T | undefined {25    while (this.heap.length > 0) {26      const entry = this.extractRoot()27      if (entry?.valid) return entry.value28      // Skip invalid entries from previous decrease-key operations29    }30    return undefined31  }32}

This wastes memory (duplicate entries) but avoids tracking node positions in the heap — a significant implementation simplification. The heap can grow to O(E) in the worst case for Dijkstra, which is rarely an issue in practice.

Other Variants Worth Knowing

Most production code never reaches for these, but they fill specific gaps in the design space and show up in algorithms papers.

Variant	Headline property	When to reach for it
Leftist heap	worst-case `merge` via right-spine bound on rank¹⁵	You need a mergeable heap and want a simple pointer-based structure with proven bounds.
Skew heap	Self-adjusting cousin of leftist; same amortized `merge`, no rank stored	Same as leftist when amortized bounds are acceptable; less code.
Binomial heap	insert and merge; foundation for Fibonacci’s amortization	Educational; rarely shipped except as a Fibonacci-heap building block.
Brodal queue	Worst-case for `insert`, `findMin`, `meld`, `decreaseKey`; delete-min¹⁶¹⁷	Real-time systems where a single amortized spike is unacceptable. Implementation is notoriously complex.

Note

CLRS treats binary, binomial, and Fibonacci heaps end-to-end; Sedgewick adds leftist and pairing variants. The Tarjan / Sleator pairing-heap paper (1986) and the Brodal–Okasaki (1996) and Brodal (1996) papers cover the worst-case end of the spectrum. None of these has displaced the binary heap from standard libraries.

Standard Library Implementations

Almost every mainstream standard library that ships a priority queue ships a binary heap. The interface differences matter more than the algorithmic ones.

Python: heapq Module

Python’s heapq provides functions operating on regular lists. It is min-heap only — for a max-heap, negate values or use _heapq_max (private).

1# heapq operates on lists, doesn't wrap them23import heapq45data = [5, 1, 8, 3, 2]6heapq.heapify(data)       # O(n) in-place transformation7heapq.heappush(data, 4)   # O(log n) insert8smallest = heapq.heappop(data)  # O(log n) extract min910# No decrease-key—use duplicate node strategy11# No max-heap—negate values or use custom comparison

CPython ships a C accelerator at Modules/_heapqmodule.c; import heapq transparently uses it when available and falls back to the pure-Python Lib/heapq.py⁷.

Warning

heapq is not stable. Equal-priority items may come out in any order. The Python docs explicitly recommend (priority, count, task) tuples with a monotonically increasing count from itertools.count() as the standard tiebreaker — this also avoids TypeError when task itself is unorderable¹⁸.

Go: container/heap Package

Go’s container/heap requires implementing an interface rather than providing a ready-to-use type:

1// Go's heap requires implementing heap.Interface2// which embeds sort.Interface (Len, Less, Swap) plus Push and Pop34package main56import (7    "container/heap"8)910type IntHeap []int1112func (h IntHeap) Len() int           { return len(h) }13func (h IntHeap) Less(i, j int) bool { return h[i] < h[j] }14func (h IntHeap) Swap(i, j int)      { h[i], h[j] = h[j], h[i] }1516// Push and Pop are called by heap package, not directly17func (h *IntHeap) Push(x any) { *h = append(*h, x.(int)) }18func (h *IntHeap) Pop() any {19    old := *h20    n := len(old)21    x := old[n-1]22    *h = old[0 : n-1]23    return x24}2526// Usage27func main() {28    h := &IntHeap{5, 1, 8, 3, 2}29    heap.Init(h)           // O(n)30    heap.Push(h, 4)        // O(log n)31    min := heap.Pop(h)     // O(log n)32}

Why Go’s design is confusing: the Push and Pop methods on your type are for the heap package to call internally — you call heap.Push(h, x) and heap.Pop(h). This indirection enables generic algorithms without generics (pre-Go 1.18 design), and container/heap was never re-typed after generics landed¹⁹.

C++ and Java

C++ does not expose a “heap object”. The standard library provides the algorithm trio std::make_heap, std::push_heap, std::pop_heap operating on any random-access range, and std::priority_queue is a thin container adaptor over a std::vector plus those algorithms. Default is a max-heap; pass std::greater<T> as the comparator for min-heap behavior²⁰.
Java ships java.util.PriorityQueue, an array-backed binary heap. Default is a min-heap; pass Comparator.reverseOrder() for max-heap behavior. Like heapq, it is unbounded, unsynchronised, and not stable for equal priorities.

JavaScript: No Built-in Heap

JavaScript has no standard heap or priority queue — neither ECMAScript nor the WHATWG runtime specs define one. Common approaches:

Third-party libraries — fastpriorityqueue is V8-tuned; heap-js is API-friendly.
Roll your own — ~50 lines for a basic binary heap.
Sorted array — O(n) insert but trivial; fine for very small n.

1// Production-grade implementation would add error handling23class MinHeap<T> {4  private heap: T[] = []56  constructor(private compare: (a: T, b: T) => number = (a, b) => (a as any) - (b as any)) {}78  push(val: T): void {9    this.heap.push(val)10    let i = this.heap.length - 111    while (i > 0) {12      const p = (i - 1) >> 113      if (this.compare(this.heap[i], this.heap[p]) >= 0) break14      ;[this.heap[i], this.heap[p]] = [this.heap[p], this.heap[i]]15      i = p16    }17  }1819  pop(): T | undefined {20    if (this.heap.length <= 1) return this.heap.pop()21    const top = this.heap[0]22    this.heap[0] = this.heap.pop()!23    let i = 024    while (true) {25      const l = 2 * i + 1,26        r = 2 * i + 227      let min = i28      if (l < this.heap.length && this.compare(this.heap[l], this.heap[min]) < 0) min = l29      if (r < this.heap.length && this.compare(this.heap[r], this.heap[min]) < 0) min = r30      if (min === i) break31      ;[this.heap[i], this.heap[min]] = [this.heap[min], this.heap[i]]32      i = min33    }34    return top35  }3637  peek(): T | undefined {38    return this.heap[0]39  }40  get size(): number {41    return this.heap.length42  }43}

Real-World Applications

Dijkstra’s Shortest Path

The canonical priority-queue application. Each vertex gets a tentative distance; the heap efficiently finds the next vertex to process:

1// Graph represented as adjacency list2// Edge: { to: number, weight: number }34interface Edge {5  to: number6  weight: number7}8type Graph = Edge[][]910interface HeapEntry {11  vertex: number12  distance: number13}1415function dijkstra(graph: Graph, source: number): number[] {16  const n = graph.length17  const dist = Array(n).fill(Infinity)18  dist[source] = 01920  const heap = new MinHeap<HeapEntry>((a, b) => a.distance - b.distance)21  heap.push({ vertex: source, distance: 0 })2223  const visited = new Set<number>()2425  while (heap.size > 0) {26    const { vertex, distance } = heap.pop()!2728    if (visited.has(vertex)) continue // Skip stale entries29    visited.add(vertex)3031    for (const { to, weight } of graph[vertex]) {32      const newDist = distance + weight33      if (newDist < dist[to]) {34        dist[to] = newDist35        heap.push({ vertex: to, distance: newDist }) // Duplicate node strategy36      }37    }38  }3940  return dist41}

Dijkstra using a binary min-heap with the duplicate-node strategy: each relaxation pushes a fresh (vertex, distance) pair; pops are filtered through a visited set so stale entries are skipped without ever calling decrease-key.

Complexity: O((V + E) log V) with binary heap. For dense graphs (E ≈ V²) this becomes O(V² log V); a Fibonacci heap improves it to O(E + V log V), but the constant-factor penalty is rarely worth it on real hardware¹¹.

K-way Merge (External Sorting)

Merging k sorted streams using a heap of size k:

1// Merge k sorted iterators into one sorted output2// Used in external sorting, merge sort, database joins34interface StreamEntry<T> {5  value: T6  streamIndex: number7}89function* kWayMerge<T>(streams: Iterator<T>[], compare: (a: T, b: T) => number): Generator<T> {10  const heap = new MinHeap<StreamEntry<T>>((a, b) => compare(a.value, b.value))1112  // Initialize heap with first element from each stream13  for (let i = 0; i < streams.length; i++) {14    const result = streams[i].next()15    if (!result.done) {16      heap.push({ value: result.value, streamIndex: i })17    }18  }1920  // Extract min and refill from same stream21  while (heap.size > 0) {22    const { value, streamIndex } = heap.pop()!23    yield value2425    const next = streams[streamIndex].next()26    if (!next.done) {27      heap.push({ value: next.value, streamIndex })28    }29  }30}

K-way merge with a size-k min-heap: each stream contributes its current front element to the heap; popping the min emits the next sorted output and pulls the replacement from the same stream.

Complexity: O(n log k) where n is the total elements across all streams. Each of n elements enters and leaves the heap once at O(log k). This is the engine inside heapq.merge, external merge sort in databases, and the merge step of LSM-tree compaction. A loser-tree (tournament tree) variant cuts the per-pop comparison count from to at the cost of more bookkeeping; binary heaps still dominate for moderate .

Event-Driven Simulation / Task Scheduling

Priority queues order events by timestamp, enabling efficient simulation:

1// Generic event-driven simulation framework23interface Event {4  timestamp: number5  execute: () => Event[] // Returns new events to schedule6}78class EventScheduler {9  private queue = new MinHeap<Event>((a, b) => a.timestamp - b.timestamp)10  private currentTime = 01112  schedule(event: Event): void {13    if (event.timestamp < this.currentTime) {14      throw new Error("Cannot schedule event in the past")15    }16    this.queue.push(event)17  }1819  run(until: number): void {20    while (this.queue.size > 0 && this.queue.peek()!.timestamp <= until) {21      const event = this.queue.pop()!22      this.currentTime = event.timestamp23      const newEvents = event.execute()24      for (const e of newEvents) {25        this.schedule(e)26      }27    }28  }29}

Finding K Largest/Smallest Elements

A min-heap of size k efficiently tracks the k largest elements in a stream:

1// O(n log k) for n elements, O(k) space23function topK<T>(items: Iterable<T>, k: number, compare: (a: T, b: T) => number): T[] {4  const heap = new MinHeap<T>(compare)56  for (const item of items) {7    if (heap.size < k) {8      heap.push(item)9    } else if (compare(item, heap.peek()!) > 0) {10      heap.pop()11      heap.push(item)12    }13  }1415  // Extract in sorted order16  const result: T[] = []17  while (heap.size > 0) {18    result.push(heap.pop()!)19  }20  return result.reverse()21}

For the k largest, use a min-heap of size k and keep only elements larger than the current minimum. The heap always contains the k largest seen so far. This is exactly how heapq.nlargest and heapq.nsmallest work internally.

Edge Cases and Failure Modes

Empty Heap Operations

extractMin and peek on an empty heap should be handled gracefully:

1// Return undefined rather than throw23peek(): T | undefined {4  return this.heap[0];  // undefined if empty5}67pop(): T | undefined {8  if (this.heap.length === 0) return undefined;9  // ... rest of implementation10}

Comparison Function Pitfalls

Incorrect comparisons cause subtle bugs:

1// Common mistakes in comparison functions23// WRONG: NaN comparison4const badCompare = (a: number, b: number) => a - b5// NaN - anything = NaN, which is neither < 0 nor >= 067// WRONG: Inconsistent ordering8const unstable = (a: Obj, b: Obj) => Math.random() - 0.59// Heap operations require transitive, antisymmetric comparison1011// CORRECT: Handle edge cases12const safeCompare = (a: number, b: number) => {13  if (Number.isNaN(a)) return 1 // Push NaN to bottom14  if (Number.isNaN(b)) return -115  return a - b16}

Heap Corruption from External Mutation

If stored objects are mutated, the heap property breaks:

1// DO NOT mutate objects in a heap without re-heapifying23interface Task {4  priority: number5  name: string6}78const heap = new MinHeap<Task>((a, b) => a.priority - b.priority)9const task = { priority: 5, name: "important" }10heap.push(task)1112// WRONG: This corrupts the heap13task.priority = 1 // Heap doesn't know about this change1415// CORRECT: Remove, update, re-insert16// Or use immutable objects17// Or implement decrease-key that maintains heap invariant

Caution

A silent heap corruption surfaces as extractMin returning elements out of order, sometimes thousands of operations later. The original mutation site is long gone. If you cannot guarantee immutability, defensively re-heapify (O(n)) before any read-critical extraction.

Integer Overflow in Index Calculations

For very large heaps (>2³⁰ elements), 32-bit index arithmetic can overflow. In JavaScript, Number is double-precision so safe up to 2⁵³, but the bitwise shift trick (i - 1) >> 1 truncates to int32 — guard with Math.floor((i - 1) / 2) for heaps that could exceed 2³¹ entries.

1// Use Math.floor for very large heaps; bitwise shift truncates to int3223function leftChild(i: number): number {4  const result = 2 * i + 15  if (result < i) throw new Error("Index overflow")6  return result7}

In practice, heaps rarely exceed millions of elements — a 4-byte integer index handles up to ~2 billion entries.

Conclusion

Binary heaps provide the right balance of simplicity, performance, and generality for most priority-queue needs. The array representation eliminates pointer overhead and exploits cache locality. Understanding the O(n) buildHeap analysis and why heap sort underperforms quicksort despite better worst-case bounds illuminates the gap between theoretical and practical performance.

For graph algorithms, the duplicate-node strategy — insert new entries instead of decreaseKey — usually beats sophisticated heaps by avoiding position-tracking overhead. When the heap exceeds cache, consider a 4-ary heap for the cache-friendlier wider tree. Fibonacci heaps remain a textbook curiosity except for specialised applications on very large, dense graphs.

The priority-queue abstraction itself matters more than the heap variant. Standard libraries provide adequate implementations; optimise the data structure only after profiling proves it’s the bottleneck.

Appendix

Prerequisites

Big O notation and amortized analysis.
Basic tree terminology (height, complete, balanced).
Array indexing and memory layout concepts.

Terminology

Heap property — parent dominates children (min-heap: parent ≤ children, max-heap: parent ≥ children).
Complete binary tree — all levels filled except possibly the last, which fills left-to-right.
Bubble up (sift up) — move an element toward the root to restore the heap property.
Bubble down (sift down, heapify) — move an element toward the leaves to restore the heap property.
Decrease-key — update an element’s priority and restore the heap property; critical for graph algorithms.
Amortized analysis — average cost per operation over a sequence, allowing expensive operations if rare.
D-ary heap — generalisation where each node has d children instead of 2.
Implicit data structure — structure encoded in array positions rather than explicit pointers.

Summary

Binary heaps store complete trees in arrays; parent/child relationships come from index arithmetic.
Insert bubbles up (O(log n)), extract bubbles down (O(log n)), peek is O(1).
buildHeap runs in O(n), not O(n log n) — most nodes are cheap to heapify.
Heap sort guarantees O(n log n) but loses to quicksort due to cache effects.
4-ary heaps run roughly 10-30% faster than binary heaps on out-of-cache workloads.
Fibonacci heaps have better theoretical bounds but lose to pairing heaps and binary heaps in practice.
For Dijkstra without explicit decrease-key, use the duplicate-node strategy with a binary heap.
JavaScript lacks a built-in heap; Python’s heapq is min-heap only; Go requires interface implementation; C++ ships max-heap algorithms; Java ships a min-heap class.

Binary heap — Wikipedia. Comprehensive overview of binary heap structure and array-index relationships. ↩ ↩²
CS 161 Lecture 4 — Stanford. The bottom-up buildHeap O(n) proof from CLRS, with the converging series. ↩ ↩²
Fibonacci heap — Wikipedia. Detailed analysis of Fibonacci heap amortized bounds. ↩ ↩²
“What are the disadvantages of Fibonacci Heaps?” — CS Stack Exchange. Summary of constant-factor and cache penalties; cross-validates against the Larkin/Sen/Tarjan paper. ↩ ↩²
A. LaMarca and R. E. Ladner, “The Influence of Caches on the Performance of Heaps”, ACM Journal of Experimental Algorithmics, 1996. Foundational measurement of cache misses across heap fanouts; introduces aligned 4-heaps. ↩ ↩² ↩³
d-ary heap — Wikipedia. Summary of cache-performance benefits and reported speedups for d > 2. ↩ ↩² ↩³
Lib/heapq.py — CPython source. Comments at the top of the file describe the bottom-up _siftup optimization and explicitly cite Knuth, TAOCP Vol. 3, §5.2.3. ↩ ↩²
A. LaMarca and R. E. Ladner, “The Influence of Caches on the Performance of Sorting”, Journal of Algorithms, 1999. Companion paper covering heap sort, merge sort, and quicksort cache behaviour. ↩
Heapsort — Wikipedia. Stability and cache-performance characteristics, plus links to bottom-up variants. ↩
P. Sanders, “Fast Priority Queues for Cached Memory”, ACM JEA, 1999. Shows aligned 4-ary and sequence heaps running > faster than binary heaps in cache hierarchies. ↩
“Time Complexity of Dijkstra’s Algorithm” — Baeldung on CS. Side-by-side derivation of binary-heap and Fibonacci-heap variants; covers the duplicate-node strategy. ↩ ↩² ↩³
J. T. Stasko and J. S. Vitter, “Pairing Heaps: Experiments and Analysis”, CACM, 1987. Earliest large experiment showing pairing heaps beat Fibonacci heaps in practice. ↩
D. H. Larkin, S. Sen, and R. E. Tarjan, “A Back-to-Basics Empirical Study of Priority Queues”, ALENEX 2014. Modern replication of pairing-vs-Fibonacci experiments. ↩
Pairing heap — Wikipedia. Survey of pairing-heap experiments, including Larkin-Sen-Tarjan. ↩
“15-210 Lecture 27 — Priority Queues and Leftist Heaps”, Carnegie Mellon. Proves the work bound for leftist-heap meld. ↩
Brodal queue — Wikipedia. Summary of the worst-case bounds achieved by Brodal (1996) and Brodal–Okasaki (1996). ↩
G. S. Brodal, “Worst-case efficient priority queues”, SODA 1996. Original construction with worst-case insert / find-min / meld / decrease-key. ↩
heapq — Heap queue algorithm — Python documentation. Documents min-heap-only behaviour, the (priority, count, task) tiebreaker pattern, and the lack of stability. ↩
“Why Are Golang Heaps So Complicated?” — DoltHub. Walkthrough of container/heap’s pre-generics interface design. ↩
std::priority_queue — cppreference. Specifies the binary-heap algorithms (make_heap/push_heap/pop_heap) underneath the adaptor. ↩