Node.js Runtime Architecture: Event Loop, Streams, and APIs

Node.js is a host environment that pairs V8 with libuv, a C++ bindings layer, and a large set of JavaScript core modules and Application Programming Interfaces (APIs). This article focuses on the runtime boundaries that matter at scale: event loop ordering, microtasks, thread pool backpressure, buffer and stream memory, and Application Binary Interface (ABI) stable extension points. Coverage is current as of Node.js 22 LTS (libuv 1.49+, V8 12.4+).

Runtime layering from JavaScript to native engines, dependencies, and the OS.

Abstract

Node.js runtime behavior is defined by three boundaries: the V8 JavaScript-to-native edge (where JIT tiers and GC pauses originate), the libuv event loop (where phase ordering and microtask interleaving determine callback timing), and the thread pool (where nominally “async” file I/O becomes a blocking queue). Mastering the runtime means understanding what crosses each boundary and when.

Mental model: JavaScript triggers microtasks, which drain before the loop advances. The poll phase dispatches to either the thread pool (blocking) or OS async I/O (non-blocking).

Key mental models:

Scheduling: process.nextTick drains before Promise microtasks, which drain before any event loop phase advances. Unbounded recursion in either queue starves I/O.
Thread pool bottleneck: File system ops, dns.lookup, and CPU-bound crypto share a default 4-thread pool. Network I/O and dns.resolve* bypass it entirely.
Backpressure contract: Writable.write() returning false is a hard stop signal; ignoring it causes unbounded memory growth until OOM or abort.
ABI stability: Node-API is the only stable native addon interface across majors. Direct V8/libuv header use requires rebuild per major.

Runtime Layers and Call Path

Node.js is a host for ECMAScript. V8 executes JavaScript, libuv implements the event loop and I/O multiplexing, and a C++ bindings layer translates JavaScript calls into libuv and native library calls. libuv runs a single threaded event loop and polls non-blocking sockets using the best platform backend (epoll, kqueue, I/O Completion Ports (IOCP)). The loop itself is not thread-safe, so cross-thread work must hop via libuv’s queues.

The design trade-off is intentional: a single event loop keeps scheduling deterministic and lightweight, but any synchronous Central Processing Unit (CPU) bound work on the main thread blocks every connection.

Example: A gateway handling 30k idle keep-alive sockets is fine as long as request handlers avoid synchronous JSON parsing. A single 20 ms CPU spike blocks all 30k sockets.

Event Loop, Microtasks, and Ordering

The Node.js event loop executes callbacks in six phases:

timers - setTimeout() and setInterval() callbacks
pending callbacks - I/O callbacks deferred from the previous iteration
idle, prepare - internal libuv use
poll - retrieve new I/O events, execute I/O callbacks
check - setImmediate() callbacks
close callbacks - close event handlers (socket.on('close'))

As of libuv 1.45 (Node.js 20+), timers run only after the poll phase, changing the timing of setTimeout() relative to setImmediate(). Prior to libuv 1.45, timers could fire both before and after poll, making the setTimeout(0) vs setImmediate() race nondeterministic even inside I/O callbacks.

process.nextTick() is not part of the event loop phases. Its queue drains after the current JavaScript stack completes and before the loop advances to the next phase. The draining order is:

All process.nextTick() callbacks (recursively, until empty)
All Promise microtask callbacks (recursively, until empty)
Next event loop phase

ECMAScript defines job queues for Promise reactions but leaves inter-queue ordering to the host:

“This specification does not define the order in which multiple Job Queues are serviced.” (ECMA-262)

Node’s implementation choice to drain nextTick before Promises is a deliberate departure from browser behavior, where only the microtask queue exists. This creates a subtle portability trap: code relying on nextTick priority behaves differently in browsers using queueMicrotask().


3 collapsed lines
1
import { readFile } from "node:fs"
2

3
readFile(__filename, () => {
4
  setTimeout(() => console.log("timeout"), 0)
5
  setImmediate(() => console.log("immediate"))
1 collapsed line
6
})

Example: Inside an I/O callback on libuv 1.45+, setImmediate always fires before setTimeout(0) because check phase precedes the next timer phase. Outside an I/O callback, the order depends on process timing and remains nondeterministic.

Edge case: Recursive nextTick chains starve I/O indefinitely. A while (true) process.nextTick(fn) pattern blocks the event loop forever because the loop cannot advance until the nextTick queue empties.

libuv Thread Pool: When Async I/O Is Not Truly Non-Blocking

libuv uses a shared thread pool to offload operations that lack non-blocking OS primitives. The pool is global, shared across all event loops, and preallocated at startup.

Pool configuration:

Default size: 4 threads
Maximum: 1024 (increased from 128 in libuv 1.30.0)
Set via UV_THREADPOOL_SIZE environment variable at process startup
Cannot be resized after the first I/O operation triggers pool initialization

Operations using the thread pool:

Category	Operations
File system	All `fs` APIs except `fs.FSWatcher()` and explicitly synchronous methods
DNS	`dns.lookup()`, `dns.lookupService()`
Crypto	`crypto.pbkdf2()`, `crypto.scrypt()`, `crypto.randomBytes()`, `crypto.randomFill()`, `crypto.generateKeyPair()`
Compression	All `zlib` APIs except explicitly synchronous methods

Operations bypassing the thread pool:

Category	Operations	Mechanism
Network I/O	TCP, UDP, HTTP sockets	OS non-blocking sockets (epoll/kqueue/IOCP)
DNS resolution	`dns.resolve*()` methods	c-ares library (true async)

The distinction between dns.lookup() (uses thread pool, honors /etc/hosts) and dns.resolve*() (uses c-ares, DNS protocol only) is a common source of confusion. Under load, dns.lookup() can saturate the thread pool and block file system operations.

Example: On a 16-core machine, 64 concurrent fs.readFile calls still execute only four at a time by default. Raising UV_THREADPOOL_SIZE reduces queueing but increases memory footprint and context-switch overhead. A common production value is 64–128 for I/O-heavy workloads.

Edge case: Setting UV_THREADPOOL_SIZE after any I/O operation has already occurred has no effect. The pool size is fixed at first use. This catches teams who try to configure it dynamically based on runtime conditions.

V8 Execution and Garbage Collection in Node

V8 runs a four-tier Just-In-Time (JIT) pipeline, each tier trading compilation speed for generated code quality:

Tier	Role	Trigger threshold	Speed vs quality
Ignition	Interpreter	First call	Instant start, slowest execution
Sparkplug	Baseline compiler	~8 invocations	Compiles bytecode to machine code without optimization; no IR (Intermediate Representation)
Maglev	Mid-tier optimizer	~500 invocations	SSA-based CFG (Control Flow Graph); ~10× slower than Sparkplug, ~10× faster than TurboFan compile
TurboFan	Advanced optimizer	~6000 invocations	Aggressive speculative optimization; peak throughput

Sparkplug was introduced in V8 9.1 (Chrome 91). Maglev shipped in V8 11.7 (Chrome M117, December 2023) and is available in Node.js 22+.

The tiering thresholds reset if type feedback changes—for example, if a function previously saw only integers suddenly receives a string, it may deoptimize and re-tier. This explains why monomorphic call sites (single type) optimize better than polymorphic ones.

V8 uses generational Garbage Collection (GC). The young generation is intentionally small (up to ~16 MiB in the Orinoco design), so it fills quickly and triggers frequent scavenges; survivors are promoted to the old generation. The Orinoco parallel scavenger splits young-generation collection across multiple threads, reducing pause times to sub-millisecond on typical workloads.

GC pause implications:

Young-generation scavenges are frequent but fast (~0.5–2 ms)
Old-generation major GCs can pause 10–100+ ms on large heaps
Incremental marking spreads old-generation work across frames, reducing peak pause

Example: Long-lived services with stable object shapes benefit from Maglev and TurboFan tier-up. Short-lived serverless functions (sub-100ms) spend most execution time in Ignition and Sparkplug, making startup latency and interpreter performance the dominant factors. For serverless, minimizing cold-start bytecode size matters more than peak TurboFan throughput.

Buffers and Streams: Moving Bytes Without Blowing RAM

Buffers

Buffer is a subclass of Uint8Array; Node.js APIs accept plain Uint8Array where buffers are supported. Buffer.allocUnsafe() returns uninitialized memory—this is a security consideration because the memory may contain sensitive data from previous allocations.

Pool behavior:

Buffer.poolSize default: 8192 bytes (8 KiB)
Pool used when allocation size < Buffer.poolSize >>> 1 (4096 bytes)
Methods using the pool: Buffer.allocUnsafe(), Buffer.from(array), Buffer.from(string), Buffer.concat()
Buffer.allocUnsafeSlow() never uses the pool (allocates dedicated ArrayBuffer)

process.memoryUsage().arrayBuffers includes memory held by Buffer and SharedArrayBuffer objects, distinct from the V8 heap.

Example: If arrayBuffers grows by 200 MB while heapUsed is flat, you have large binary payloads or pooled buffers, not a JavaScript heap leak. Conversely, thousands of small allocUnsafe calls may reuse the same 8 KiB pool and not show up as memory growth until the pool is exhausted.

Streams and backpressure

Writable.write() returns false when the internal buffer exceeds highWaterMark. This is a hard signal to stop writing until the drain event fires.

Default highWaterMark values:

Binary streams: 16384 bytes (16 KiB)
Object mode streams: 16 objects

highWaterMark is a threshold, not a hard limit. Specific stream implementations may enforce stricter limits, but the base classes do not prevent writes when the threshold is exceeded—they only signal via the return value.


2 collapsed lines
1
const ok = writable.write(chunk)
2

3
if (!ok) {
4
  writable.once("drain", resume)
5
}

Failure mode: Ignoring write() → false forces Node to buffer indefinitely. Memory grows until the process hits heap limits and aborts, or the OS kills it. This is also a security vector: a slow or malicious client that never drains can cause unbounded memory growth (slowloris-style attack on memory).

Example: A 200 MB upload piped through stream.pipeline() stays bounded by highWaterMark. The same upload pushed via a tight write() loop can spike Resident Set Size (RSS) to gigabytes and trigger an OOM abort.

Web Streams alignment

Node’s Web Streams API implements the WHATWG Streams Standard and has been stable since Node.js 21.0.0 (Stability 2). Prior to v21, it was experimental and emitted runtime warnings.

Web Streams define a queuing strategy with a highWaterMark that can be any non-negative number, including Infinity (which disables backpressure). The semantics differ from Node streams: Web Streams use a “pull” model with explicit controller signaling rather than the Node “push + drain” model.

Interop helpers:

Readable.toWeb() / Writable.toWeb() - convert Node streams to Web Streams
Readable.fromWeb() / Writable.fromWeb() - convert Web Streams to Node streams

Example: When building a library shared between browser and Node, prefer Web Streams for the public API and convert internally with toWeb() / fromWeb(). This keeps backpressure semantics consistent across environments.

Native Extensions and ABI Stability

Node-API (formerly N-API) is the only ABI-stable surface for native addons. It was introduced in Node.js 8.0 as experimental, marked stable in Node.js 8.12, and has been the recommended approach since.

ABI stability guarantee:

Functions in node_api.h are ABI-stable across major Node.js versions
Addons compiled against Node-API run on later Node.js versions without recompilation
Node-API versions are additive: a higher version includes all previous version symbols

Current Node-API versions (as of Node.js 22):

Node-API version	Minimum Node.js
10	22.14.0, 23.6.0
9	18.17.0, 20.3.0
8	12.22.0, 14.17.0, 16.0.0

If NAPI_VERSION is not defined at compile time, it defaults to 8.

What is NOT ABI-stable:

Node.js C++ APIs (node.h, node_buffer.h)
libuv APIs (uv.h)
V8 APIs (v8.h)

Using any of these headers directly requires rebuilding for each major Node.js version. V8’s C++ API changes frequently; even minor V8 updates can break compilation.

Example: If you ship a native addon using only node_api.h, a single build runs across Node.js 16, 18, 20, 22, and 24. If you include v8.h for direct V8 access, expect to maintain separate builds per major version and handle API churn (e.g., V8’s handle scope changes, isolate API updates).

Module Loading and Package Scope

Within a package, the package.json "type" field defines how Node.js treats .js files:

`"type"` value	`.js` treatment	Default
`"module"`	ES module	No
`"commonjs"`	CommonJS	Yes (when `"type"` absent)

File extensions override the "type" field: .mjs is always ES module, .cjs is always CommonJS.

`require(esm)` support (Node.js 22.12+)

As of Node.js 22.12.0 (backported to 20.19.0), require() can load ES modules without flags. This eliminates the historical barrier where CommonJS code could not synchronously import ES modules.

Limitation: require(esm) only works for ES modules without top-level await. If the module uses top-level await, Node throws ERR_REQUIRE_ASYNC_MODULE.


2 collapsed lines
1
// Node.js 22.12+ allows this
2
const esModule = require("./es-module.mjs")
3

4
console.log(esModule.default)

Prior to Node.js 22.12: require() could not load ES modules at all. The only way to use ESM from CJS was dynamic import(), which returns a Promise and cannot be used synchronously. This forced many libraries to ship dual CJS/ESM builds.

Check process.features.require_module to verify if your runtime supports require(esm). The feature can be disabled with --no-experimental-require-module.

Example: In a "type": "module" package, name legacy config files *.cjs to keep tooling that still uses require() working. In Node.js 22.12+, you can simplify by using require() to load .mjs files directly, avoiding the dual-build complexity.

Observability and Failure Modes

Event loop delay monitoring

perf_hooks.monitorEventLoopDelay() samples loop delays in nanoseconds and exposes them as a histogram. This is the primary tool for detecting event loop blocking.


3 collapsed lines
1
import { monitorEventLoopDelay } from "node:perf_hooks"
2

3
const h = monitorEventLoopDelay({ resolution: 20 })
4
h.enable()

Interpreting delay metrics:

Metric	Healthy value	Investigation path
p50 delay	< 5ms	Normal operation
p99 delay	< 50ms	Occasional GC or sync work
p99 delay	> 100ms	Synchronous CPU blocking, large GC, or thread pool starvation
min/max spread	< 10×	Consistent behavior

Memory breakdown

process.memoryUsage() returns:

heapUsed / heapTotal - V8 JavaScript heap
external - C++ objects bound to JavaScript (e.g., native addon memory)
arrayBuffers - Buffer, ArrayBuffer, SharedArrayBuffer (outside V8 heap)
rss - total resident set size (includes all of the above plus code, stacks, etc.)

Example: If arrayBuffers grows by 200 MB while heapUsed is flat, you have binary payloads or stream buffers accumulating, not a JavaScript object leak. Profile with --heapsnapshot-near-heap-limit to catch JavaScript leaks; use /proc/[pid]/smaps on Linux to diagnose native memory.

Diagnostic failure patterns

Symptom	Likely cause	Investigation
Low loop delay, high file latency	Thread pool saturation	Check `UV_THREADPOOL_SIZE`, reduce concurrent `fs` calls
High loop delay spikes	Synchronous CPU work	Profile with `--cpu-prof`, look for JSON.parse/stringify on large payloads
Steady loop delay increase	Recursive `nextTick`/Promise chains	Add `nextTick` depth limits, break long chains with `setImmediate`
`arrayBuffers` growth	Stream backpressure ignored	Ensure `write() → false` pauses producers
`heapUsed` growth	JavaScript object leak	Heap snapshot comparison, look for detached DOM/closures

Edge case: process.nextTick() recursion is invisible to monitorEventLoopDelay() because the delay is measured between loop iterations, not within microtask draining. Infinite nextTick chains freeze the process without spiking the delay histogram.

Conclusion

Node.js performance lives at three boundaries: the V8 JIT/GC edge (tiering thresholds, pause times), the libuv event loop (phase ordering, microtask priority), and the thread pool (4-thread default, blocking semantics). Treat these boundaries as explicit design surfaces. Instrument them with monitorEventLoopDelay(), process.memoryUsage(), and thread pool sizing—or they surface as tail latency, memory spikes, and mysterious blocking under load.

Appendix

Prerequisites

JavaScript execution model and event-driven programming
OS I/O multiplexing primitives (epoll, kqueue, IOCP)
Basic understanding of native addons and shared libraries

Terminology

Term	Definition
ABI (Application Binary Interface)	Binary contract between compiled modules; determines if they can link without recompilation
API (Application Programming Interface)	Callable surface exposed to users; source-level contract
CFG (Control Flow Graph)	Compiler IR representing program control flow as a directed graph
GC (Garbage Collection)	Automatic memory reclamation
I/O (Input/Output)	Data movement to or from external devices
IR (Intermediate Representation)	Compiler’s internal data structure representing code between parsing and code generation
JIT (Just-In-Time)	Compilation strategy that compiles code during execution rather than ahead of time
OOM (Out of Memory)	Condition where memory allocation fails due to exhausted resources
RSS (Resident Set Size)	Memory currently resident in physical RAM for a process
SSA (Static Single Assignment)	IR form where each variable is assigned exactly once

Summary

Node.js layers V8, libuv, and C++ bindings; performance depends on understanding all three boundaries
Event loop phases run in fixed order; nextTick > Promises > timers/poll/check
Thread pool (default 4) handles fs, dns.lookup(), crypto; network I/O and dns.resolve*() bypass it
V8 tiering (Ignition → Sparkplug → Maglev → TurboFan) explains warmup; GC pauses dominate tail latency on large heaps
Stream backpressure (write() → false) must be respected or memory grows unbounded
Node-API is the only ABI-stable addon interface; direct V8/libuv use requires rebuild per major
require(esm) in Node.js 22.12+ simplifies ESM/CJS interop for synchronous modules

References

Specifications:

ECMAScript Language Specification - Job queues, Promise semantics
WHATWG Streams Standard - Queuing strategy, highWaterMark

Node.js Official Documentation:

Node.js Event Loop, Timers, and nextTick
Node.js Don’t Block the Event Loop - Thread pool operations
Node.js Streams API - write(), drain, backpressure
Node.js Backpressuring in Streams
Node.js Buffer API - Pool behavior, allocUnsafe
Node.js Web Streams API
Node.js Packages (ESM/CJS)
Node.js ESM Documentation - require(esm) support
Node.js Performance Hooks - monitorEventLoopDelay
Node.js process API - memoryUsage, nextTick
Node-API Documentation
Node.js ABI Stability

libuv Documentation:

V8 Engine (Core Maintainer):

Core Maintainer Content:

The Dangers of setImmediate (Platformatic/Matteo Collina) - libuv 1.45 timer changes
require(esm) in Node.js: From Experiment to Stability (Joyee Cheung)