Design Collaborative Document Editing (Google Docs)

A comprehensive system design for real-time collaborative document editing covering synchronization algorithms, presence broadcasting, conflict resolution, storage patterns, and offline support. This design addresses sub-second convergence for concurrent edits while maintaining document history and supporting 10-50 simultaneous editors.

High-level architecture: WebSocket-based real-time sync with operation log persistence and periodic snapshots.

Abstract

Collaborative document editing requires solving three interrelated problems: real-time synchronization (all users see changes within milliseconds), conflict resolution (concurrent edits don’t corrupt the document), and durability (no edit is ever lost).

Core architectural decisions:

Decision	Choice	Rationale
Sync algorithm	OT with server ordering	Avoids TP2 complexity; proven at Google scale
Transport	WebSocket	Full-duplex, 1-10ms latency after handshake
Persistence	Event-sourced operation log	Enables revision history, undo, and conflict replay
Presence	Ephemeral broadcast	Cursors don’t need durability; memory-only
Offline	Operation queue with reconciliation	Local-first editing, sync on reconnect

Key trade-offs accepted:

Server dependency for ordering (no true P2P) in exchange for correctness guarantees
Unbounded operation log growth requiring periodic snapshots
Higher memory on collaboration servers (one process per active document)

What this design optimizes:

Sub-100ms operation propagation to all connected clients
Guaranteed convergence regardless of network conditions
Full revision history with efficient retrieval

Requirements

Functional Requirements

Requirement	Priority	Notes
Real-time text editing	Core	Character-level granularity
Concurrent multi-user editing	Core	10-50 simultaneous editors
Live cursor/selection display	Core	See where others are editing
Revision history	Core	View/restore any previous version
Rich text formatting	Core	Bold, italic, headings, lists
Comments and suggestions	Extended	Anchored to text ranges
Offline editing	Extended	Queue operations, sync on reconnect
Tables, images, embeds	Extended	Block-level elements

Non-Functional Requirements

Requirement	Target	Rationale
Availability	99.9% (3 nines)	User-facing, but brief outages acceptable
Edit propagation latency	p99 < 200ms	Real-time feel requires sub-second
Document load time	p99 < 2s	Cold start with full history
Concurrent editors	50 per document	Google Sheets supports ~50; Docs ~10
Operation durability	99.999%	No edit should ever be lost
Revision retention	Indefinite	Full history for compliance

Scale Estimation

Users:

Monthly Active Users (MAU): 500M (Google Docs scale)
Daily Active Users (DAU): 100M (20% of MAU)
Peak concurrent users: 10M

Documents:

Total documents: 5B
Active documents (edited in last 30 days): 500M (10%)
Documents open concurrently at peak: 50M

Traffic:

Operations per active editor: 1-5 per second (typing)
Average editing session: 15 minutes
Peak concurrent editors: 50M documents × 3 editors avg = 150M editing sessions
Operations per second at peak: 150M × 2 ops/sec = 300M ops/sec globally

Storage:

Average operation size: 100 bytes (insert/delete + metadata)
Operations per document per day: 10,000 (active document)
Daily operation storage: 500M docs × 10K ops × 100B = 500TB/day
With snapshots (daily): 500M × 50KB = 25TB/day

Design Paths

Path A: Operational Transformation (Server-Ordered)

Best when:

Always-online with reliable connectivity
Central infrastructure already exists
Correctness is paramount (financial, legal documents)
Team has OT implementation experience or uses existing library

Architecture:

Key characteristics:

Server assigns canonical operation order
Clients transform incoming ops against pending local ops
Single source of truth eliminates TP2 requirement

Trade-offs:

✅ Proven correct (Google Docs, Wave, CKEditor)
✅ Simpler transformation functions (only TP1 needed)
✅ Efficient wire format (operations are small)
❌ Server round-trip required for each operation batch
❌ Limited offline capability (buffer only)
❌ Server is single point of failure per document

Real-world example: Google Docs uses Jupiter-derived OT since 2010. Every character change is saved as an event in a revision log. The document renders by replaying the log from the start (with periodic checkpoints for performance).

Path B: CRDT-Based (Decentralized)

Best when:

Offline-first is critical requirement
P2P scenarios (no server available)
Multi-device sync with unreliable networks
Mathematical convergence proofs required

Architecture:

Key characteristics:

Operations commute without server coordination
Each device maintains full CRDT state
Convergence guaranteed by mathematical properties

Trade-offs:

✅ True offline support
✅ P2P synchronization possible
✅ No server bottleneck
❌ Higher memory (tombstones, metadata)
❌ Slower document loading (replay history)
❌ More complex intent preservation for rich text

Real-world example: Figma uses a CRDT-inspired approach with server reconciliation. They deliberately stop short of full CRDT to truncate history and reduce overhead. Yjs and Automerge are pure CRDT implementations used by many collaborative editors.

Path C: Hybrid (Server-Ordered with CRDT Properties)

Best when:

Need offline support but have server infrastructure
Want CRDT convergence guarantees with OT efficiency
Building on modern algorithms (Eg-walker, Fugue)

Architecture:

Store append-only operation DAG (like CRDT)
Use server for canonical ordering (like OT)
Merge branches using CRDT-like algorithms
Free memory when not actively merging

Trade-offs:

✅ Best of both: efficient steady-state, robust merging
✅ Order of magnitude better performance than pure CRDT
✅ Supports true offline with branch merging
❌ Newest approach, less production validation
❌ More complex implementation

Real-world example: Figma adopted Eg-walker for their code layers feature (2024). Joseph Gentle and Martin Kleppmann proved it achieves O(n log n) merge complexity versus O(n²) for traditional OT.

Path Comparison

Factor	OT (Server)	CRDT	Hybrid
Correctness proof	Straightforward	Mathematical	Mathematical
Offline support	Buffer only	Native	Native
Server dependency	Required	Optional	Optional
Memory overhead	Low	High	Medium
Implementation	Moderate	Complex	Complex
Production examples	Google Docs	Notion, Linear	Figma Code

This Article’s Focus

This article focuses on Path A (OT with server ordering) because:

It’s the most battle-tested approach (15+ years in production at Google)
Most use cases have reliable connectivity
Simpler to implement correctly
Existing libraries (ShareDB, ot.js) provide solid foundations

Path B (CRDT) details are covered in CRDTs for Collaborative Systems.

Connection lifecycle (connect, heartbeat, disconnect)
Route messages to document processors
Broadcast presence updates
Handle reconnection and state recovery

Design decisions:

Decision	Choice	Rationale
Protocol	WebSocket	Full-duplex, 2-14 byte overhead vs HTTP
Session affinity	Sticky by document	All editors of a document hit same server
Heartbeat	30 second interval	Detect dead connections
Reconnection	Exponential backoff	Prevent thundering herd

Scaling approach:

Horizontal scaling with consistent hashing by document ID
One server “owns” each active document
Ownership transfers on server failure via distributed lock

Document Processor (OT Engine)

The core synchronization component that transforms and orders operations.

State per active document:

1
interface DocumentState {
2
  documentId: string
3
  revision: number // Monotonic operation counter
4
  content: DocumentContent // Current document state
5
  pendingOps: Map<ClientId, Operation[]> // Ops awaiting transform
6
  clients: Map<ClientId, ClientState> // Connected clients
7
}
8

9
interface ClientState {
10
  clientId: string
11
  lastAckedRevision: number
12
  cursor: CursorPosition | null
13
  color: string // For presence display
14
}

Operation flow:

Receive: Client sends operation with base revision
Validate: Check revision is not stale beyond buffer
Transform: Transform against all operations since base revision
Apply: Update document state
Persist: Write to operation log
Broadcast: Send transformed operation to all clients

Memory management:

Keep document state in memory while active
Evict after 5 minutes of inactivity
Reload from latest snapshot + recent operations

Presence Service

Handles ephemeral state: cursors, selections, user indicators.

Design decisions:

No persistence: Presence is reconstructed on reconnect
Throttled broadcast: Max 20 updates/second per client
Coalesced updates: Batch cursor movements before broadcast

Data structure:

1
interface PresenceUpdate {
2
  clientId: string
3
  documentId: string
4
  cursor: {
5
    anchor: number // Selection start (character position)
6
    head: number // Selection end (cursor position)
7
  } | null
8
  user: {
9
    id: string
10
    name: string
11
    avatar: string
12
    color: string // Assigned per-document
13
  }
14
  timestamp: number
15
}

Document API

Handles document CRUD, access control, and version retrieval.

Endpoints:

Endpoint	Method	Purpose
`/documents`	POST	Create document
`/documents/{id}`	GET	Load document (latest or specific revision)
`/documents/{id}/operations`	GET	Fetch operation range for history
`/documents/{id}/snapshot`	POST	Create manual snapshot
`/documents/{id}/revisions`	GET	List revision metadata
`/documents/{id}/permissions`	PUT	Update access control

1
{
2
  "type": "operation",
3
  "documentId": "doc_abc123",
4
  "clientId": "client_xyz",
5
  "baseRevision": 142,
6
  "operation": {
7
    "ops": [{ "retain": 50 }, { "insert": "Hello, " }, { "retain": 100 }, { "delete": 5 }]
8
  },
9
  "timestamp": 1706886400000
10
}

Update Presence:

1
{
2
  "type": "presence",
3
  "documentId": "doc_abc123",
4
  "cursor": { "anchor": 150, "head": 150 },
5
  "selection": null
6
}

Server → Client Messages

Operation Acknowledgment:

1
{
2
  "type": "ack",
3
  "documentId": "doc_abc123",
4
  "revision": 143,
5
  "transformedOp": { ... }
6
}

Broadcast Operation (to other clients):

1
{
2
  "type": "remote_operation",
3
  "documentId": "doc_abc123",
4
  "clientId": "client_other",
5
  "revision": 143,
6
  "operation": { ... },
7
  "user": {
8
    "id": "user_123",
9
    "name": "Alice"
10
  }
11
}

Presence Broadcast:

1
{
2
  "type": "remote_presence",
3
  "documentId": "doc_abc123",
4
  "presences": [
5
    {
6
      "clientId": "client_other",
7
      "cursor": { "anchor": 200, "head": 210 },
8
      "user": { "id": "user_123", "name": "Alice", "color": "#4285f4" }
9
    }
10
  ]
11
}

REST API

Create Document

Endpoint: POST /api/v1/documents

Request:

1
{
2
  "title": "Untitled Document",
3
  "content": "",
4
  "folderId": "folder_abc",
5
  "templateId": "template_xyz"
6
}

Response (201 Created):

1
{
2
  "id": "doc_abc123",
3
  "title": "Untitled Document",
4
  "revision": 0,
5
  "createdAt": "2024-02-03T10:00:00Z",
6
  "createdBy": {
7
    "id": "user_123",
8
    "name": "Alice"
9
  },
10
  "permissions": {
11
    "owner": "user_123",
12
    "editors": [],
13
    "viewers": []
14
  },
15
  "wsEndpoint": "wss://collab.example.com/ws/doc_abc123"
16
}

Load Document

Endpoint: GET /api/v1/documents/{id}?revision={optional}

Response (200 OK):

1
{
2
  "id": "doc_abc123",
3
  "title": "Project Proposal",
4
  "revision": 1542,
5
  "content": {
6
    "type": "doc",
7
    "content": [
8
      {
9
        "type": "heading",
10
        "attrs": { "level": 1 },
11
        "content": [{ "type": "text", "text": "Introduction" }]
12
      },
13
      {
14
        "type": "paragraph",
15
        "content": [{ "type": "text", "text": "..." }]
16
      }
17
    ]
18
  },
19
  "snapshot": {
20
    "revision": 1500,
21
    "createdAt": "2024-02-03T09:00:00Z"
22
  },
23
  "pendingOperations": 42,
24
  "collaborators": [{ "id": "user_456", "name": "Bob", "online": true }]
25
}

List Revisions

Endpoint: GET /api/v1/documents/{id}/revisions?limit=50&before={revision}

Response (200 OK):

1
{
2
  "revisions": [
3
    {
4
      "revision": 1542,
5
      "timestamp": "2024-02-03T10:30:00Z",
6
      "user": { "id": "user_123", "name": "Alice" },
7
      "summary": "Edited section 3",
8
      "operationCount": 15
9
    },
10
    {
11
      "revision": 1500,
12
      "timestamp": "2024-02-03T09:00:00Z",
13
      "user": { "id": "user_456", "name": "Bob" },
14
      "summary": "Added introduction",
15
      "operationCount": 203,
16
      "isSnapshot": true
17
    }
18
  ],
19
  "hasMore": true,
20
  "nextCursor": "rev_1499"
21
}

Error Responses

Code	Error	When
400	`INVALID_OPERATION`	Operation format invalid
409	`REVISION_CONFLICT`	Base revision too old
410	`DOCUMENT_DELETED`	Document was deleted
423	`DOCUMENT_LOCKED`	Document temporarily locked
429	`RATE_LIMITED`	Too many operations

Revision conflict handling:

1
{
2
  "error": "REVISION_CONFLICT",
3
  "message": "Base revision 100 is too old. Current: 150",
4
  "currentRevision": 150,
5
  "missingOperations": "/api/v1/documents/doc_abc/operations?from=100&to=150"
6
}

Client must fetch missing operations, transform local pending operations, and retry.

Data Modeling

Document Metadata (PostgreSQL)

1
CREATE TABLE documents (
2
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
3
    title TEXT NOT NULL,
4
    owner_id UUID NOT NULL REFERENCES users(id),
5
    folder_id UUID REFERENCES folders(id),
6
    current_revision BIGINT DEFAULT 0,
7
    latest_snapshot_revision BIGINT,
8
    content_type VARCHAR(50) DEFAULT 'rich_text',
9
    created_at TIMESTAMPTZ DEFAULT NOW(),
10
    updated_at TIMESTAMPTZ DEFAULT NOW(),
11
    deleted_at TIMESTAMPTZ,
12

13
    -- Denormalized for read performance
14
    collaborator_count INT DEFAULT 0,
15
    word_count INT DEFAULT 0,
16
    last_edited_by UUID REFERENCES users(id),
17
    last_edited_at TIMESTAMPTZ
18
);
19

20
-- Access control
21
CREATE TABLE document_permissions (
22
    document_id UUID REFERENCES documents(id) ON DELETE CASCADE,
23
    user_id UUID REFERENCES users(id),
24
    role VARCHAR(20) NOT NULL, -- 'owner', 'editor', 'commenter', 'viewer'
25
    granted_at TIMESTAMPTZ DEFAULT NOW(),
26
    granted_by UUID REFERENCES users(id),
27
    PRIMARY KEY (document_id, user_id)
28
);
29

30
CREATE INDEX idx_documents_owner ON documents(owner_id, updated_at DESC);
31
CREATE INDEX idx_documents_folder ON documents(folder_id, updated_at DESC);
32
CREATE INDEX idx_permissions_user ON document_permissions(user_id);

Operation Log (DynamoDB)

Table design for append-heavy workload:

Partition Key	Sort Key	Attributes
`document_id`	`revision`	`operation`, `client_id`, `user_id`, `timestamp`, `checksum`

Schema:

1
{
2
  "document_id": "doc_abc123",
3
  "revision": 1542,
4
  "operation": {
5
    "ops": [{ "retain": 50 }, { "insert": "Hello" }]
6
  },
7
  "client_id": "client_xyz",
8
  "user_id": "user_123",
9
  "timestamp": 1706886400000,
10
  "checksum": "sha256:abc123...",
11
  "ttl": null
12
}

Why DynamoDB:

Append-only workload (write-optimized)
Predictable latency at scale
Built-in TTL for old operations (after snapshot)
Range queries by revision efficient

Capacity planning:

Write capacity: 300M ops/sec globally → partition across documents
Single document: 100 ops/sec max (50 editors × 2 ops/sec)
Read capacity: Burst on document load, then minimal

Snapshots (S3)

Naming convention:

1
s3://doc-snapshots/{document_id}/{revision}.json.gz

Snapshot content:

1
{
2
  "documentId": "doc_abc123",
3
  "revision": 1500,
4
  "createdAt": "2024-02-03T09:00:00Z",
5
  "content": {
6
    "type": "doc",
7
    "content": [...]
8
  },
9
  "metadata": {
10
    "wordCount": 5420,
11
    "characterCount": 32150,
12
    "imageCount": 12
13
  },
14
  "checksum": "sha256:..."
15
}

Snapshot strategy:

Create snapshot every 1000 operations
Or every 1 hour of activity
Or on manual request (revision history view)
Keep all snapshots for compliance

Active Document Cache (Redis)

Data structures:

1
# Document state (hash)
2
HSET doc:{id}:state
3
    revision 1542
4
    content "{serialized_content}"
5
    last_updated 1706886400000
6

7
# Connected clients (sorted set by last activity)
8
ZADD doc:{id}:clients {timestamp} {client_id}
9

10
# Pending operations queue (list)
11
RPUSH doc:{id}:pending "{operation_json}"
12

13
# Presence (hash with TTL per client)
14
HSET doc:{id}:presence:{client_id}
15
    cursor_anchor 150
16
    cursor_head 150
17
    user_name "Alice"
18
    user_color "#4285f4"
19
EXPIRE doc:{id}:presence:{client_id} 60

Eviction policy:

Documents evicted after 5 minutes of no activity
Presence entries auto-expire after 60 seconds without refresh

1
type Operation = {
2
  ops: (RetainOp | InsertOp | DeleteOp)[]
3
}
4

5
type RetainOp = {
6
  retain: number
7
  attributes?: Record<string, any> // For formatting changes
8
}
9

10
type InsertOp = {
11
  insert: string | { image: string } | { embed: any }
12
  attributes?: Record<string, any>
13
}
14

15
type DeleteOp = {
16
  delete: number
17
}

Example operations:

1
// Insert "Hello" at position 0
2
{
3
  ops: [{ insert: "Hello" }]
4
}
5

6
// Delete 3 characters at position 10
7
{
8
  ops: [{ retain: 10 }, { delete: 3 }]
9
}
10

11
// Bold characters 5-10
12
{
13
  ops: [{ retain: 5 }, { retain: 5, attributes: { bold: true } }]
14
}

Transformation Functions


10 collapsed lines
1
function transform(op1: Operation, op2: Operation, priority: "left" | "right"): [Operation, Operation] {
2
  // op1' = transform(op1, op2) - op1 transformed against op2
3
  // op2' = transform(op2, op1) - op2 transformed against op1
4
  // Guarantee: apply(apply(doc, op1), op2') === apply(apply(doc, op2), op1')
5

6
  const ops1 = [...op1.ops]
7
  const ops2 = [...op2.ops]
8
  const result1: Op[] = []
9
  const result2: Op[] = []
10

11
  let i1 = 0,
12
    i2 = 0
13

14
  while (i1 < ops1.length || i2 < ops2.length) {
15
    const o1 = ops1[i1]
16
    const o2 = ops2[i2]
17

18
    // Case: insert vs anything - inserts go first
19
    if (o1 && "insert" in o1) {
20
      if (priority === "left") {
21
        result2.push({ retain: insertLength(o1) })
22
        result1.push(o1)
23
        i1++
24
        continue
25
      }
26
    }
27
    if (o2 && "insert" in o2) {
28
      result1.push({ retain: insertLength(o2) })
29
      result2.push(o2)
30
      i2++
31
      continue
32
    }
33

34
    // Case: retain vs retain
35
    if (o1 && "retain" in o1 && o2 && "retain" in o2) {
36
      const len = Math.min(o1.retain, o2.retain)
37
      result1.push({ retain: len, attributes: o1.attributes })
38
      result2.push({ retain: len, attributes: o2.attributes })
39
      consumeLength(ops1, i1, len)
40
      consumeLength(ops2, i2, len)
41
      continue
42
    }
43

44
    // Case: delete vs retain
45
    if (o1 && "delete" in o1 && o2 && "retain" in o2) {
46
      const len = Math.min(o1.delete, o2.retain)
47
      result1.push({ delete: len })
48
      // o2 doesn't produce output - deleted content
49
      consumeLength(ops1, i1, len)
50
      consumeLength(ops2, i2, len)
51
      continue
52
    }
53

54
    // Case: retain vs delete
55
    if (o1 && "retain" in o1 && o2 && "delete" in o2) {
56
      const len = Math.min(o1.retain, o2.delete)
57
      // o1 doesn't produce output - deleted content
58
      result2.push({ delete: len })
59
      consumeLength(ops1, i1, len)
60
      consumeLength(ops2, i2, len)
61
      continue
62
    }
63

64
    // Case: delete vs delete - both delete same content
65
    if (o1 && "delete" in o1 && o2 && "delete" in o2) {
66
      const len = Math.min(o1.delete, o2.delete)
67
      // Neither produces output - already deleted
68
      consumeLength(ops1, i1, len)
69
      consumeLength(ops2, i2, len)
70
      continue
71
    }
72
  }
73

74
  return [{ ops: result1 }, { ops: result2 }]
75
}

Server-Side Processing


15 collapsed lines
1
class DocumentProcessor {
2
  private state: DocumentState
3
  private opLog: OperationLog
4
  private broadcaster: Broadcaster
5

6
  async processOperation(clientId: string, baseRevision: number, operation: Operation): Promise<ProcessResult> {
7
    // 1. Validate base revision
8
    if (baseRevision < this.state.revision - MAX_REVISION_LAG) {
9
      throw new RevisionConflictError(this.state.revision)
10
    }
11

12
    // 2. Transform against all operations since base
13
    let transformedOp = operation
14
    for (let rev = baseRevision + 1; rev <= this.state.revision; rev++) {
15
      const serverOp = await this.opLog.getOperation(this.state.documentId, rev)
16
      ;[transformedOp] = transform(transformedOp, serverOp, "right")
17
    }
18

19
    // 3. Apply to document state
20
    const newContent = applyOperation(this.state.content, transformedOp)
21
    const newRevision = this.state.revision + 1
22

23
    // 4. Persist operation (async, but before ack)
24
    await this.opLog.append({
25
      documentId: this.state.documentId,
26
      revision: newRevision,
27
      operation: transformedOp,
28
      clientId,
29
      timestamp: Date.now(),
30
    })
31

32
    // 5. Update in-memory state
33
    this.state.content = newContent
34
    this.state.revision = newRevision
35

36
    // 6. Broadcast to other clients
37
    this.broadcaster.broadcastOperation(
38
      this.state.documentId,
39
      clientId, // Exclude sender
40
      newRevision,
41
      transformedOp,
42
    )
43

44
    // 7. Return acknowledgment
45
    return {
46
      revision: newRevision,
47
      transformedOp,
48
    }
49
  }
50
}

Client-Side State Machine


12 collapsed lines
1
type ClientOTState =
2
  | { type: "synchronized"; serverRevision: number }
3
  | { type: "awaitingAck"; serverRevision: number; pending: Operation }
4
  | { type: "awaitingWithBuffer"; serverRevision: number; pending: Operation; buffer: Operation }
5

6
class ClientOT {
7
  private state: ClientOTState = { type: "synchronized", serverRevision: 0 }
8
  private document: DocumentContent
9

10
  onLocalEdit(operation: Operation): void {
11
    switch (this.state.type) {
12
      case "synchronized":
13
        // Send immediately
14
        this.sendToServer(operation, this.state.serverRevision)
15
        this.state = {
16
          type: "awaitingAck",
17
          serverRevision: this.state.serverRevision,
18
          pending: operation,
19
        }
20
        break
21

22
      case "awaitingAck":
23
        // Buffer - compose with existing buffer or create new
24
        this.state = {
25
          type: "awaitingWithBuffer",
26
          serverRevision: this.state.serverRevision,
27
          pending: this.state.pending,
28
          buffer: operation,
29
        }
30
        break
31

32
      case "awaitingWithBuffer":
33
        // Compose into buffer
34
        this.state = {
35
          ...this.state,
36
          buffer: compose(this.state.buffer, operation),
37
        }
38
        break
39
    }
40

41
    // Apply locally immediately
42
    this.document = applyOperation(this.document, operation)
43
  }
44

45
  onServerAck(revision: number): void {
46
    switch (this.state.type) {
47
      case "awaitingAck":
48
        this.state = { type: "synchronized", serverRevision: revision }
49
        break
50

51
      case "awaitingWithBuffer":
52
        // Send buffered operations
53
        this.sendToServer(this.state.buffer, revision)
54
        this.state = {
55
          type: "awaitingAck",
56
          serverRevision: revision,
57
          pending: this.state.buffer,
58
        }
59
        break
60
    }
61
  }
62

63
  onRemoteOperation(revision: number, operation: Operation): void {
64
    // Transform remote op against pending/buffer
65
    let transformedRemote = operation
66

67
    if (this.state.type === "awaitingAck" || this.state.type === "awaitingWithBuffer") {
68
      ;[, transformedRemote] = transform(this.state.pending, operation, "left")
69

70
      // Also transform pending against remote
71
      const [newPending] = transform(this.state.pending, operation, "left")
72
      this.state = { ...this.state, pending: newPending }
73
    }
74

75
    if (this.state.type === "awaitingWithBuffer") {
76
      ;[, transformedRemote] = transform(this.state.buffer, transformedRemote, "left")
77

78
      const [newBuffer] = transform(this.state.buffer, operation, "left")
79
      this.state = { ...this.state, buffer: newBuffer }
80
    }
81

82
    // Apply transformed remote operation
83
    this.document = applyOperation(this.document, transformedRemote)
84
  }
85
}

Snapshot and Compaction

Snapshot Worker


8 collapsed lines
1
class SnapshotWorker {
2
  private readonly SNAPSHOT_THRESHOLD = 1000 // Operations since last snapshot
3
  private readonly SNAPSHOT_INTERVAL_MS = 3600000 // 1 hour
4

5
  async processDocument(documentId: string): Promise<void> {
6
    const doc = await this.documentStore.getMetadata(documentId)
7
    const latestSnapshot = await this.snapshotStore.getLatest(documentId)
8

9
    const opsSinceSnapshot = doc.currentRevision - (latestSnapshot?.revision ?? 0)
10
    const timeSinceSnapshot = Date.now() - (latestSnapshot?.createdAt ?? 0)
11

12
    if (opsSinceSnapshot < this.SNAPSHOT_THRESHOLD && timeSinceSnapshot < this.SNAPSHOT_INTERVAL_MS) {
13
      return // No snapshot needed
14
    }
15

16
    // Build document state
17
    let content = latestSnapshot?.content ?? emptyDocument()
18
    const operations = await this.opLog.getRange(documentId, (latestSnapshot?.revision ?? 0) + 1, doc.currentRevision)
19

20
    for (const op of operations) {
21
      content = applyOperation(content, op.operation)
22
    }
23

24
    // Store snapshot
25
    await this.snapshotStore.create({
26
      documentId,
27
      revision: doc.currentRevision,
28
      content,
29
      createdAt: Date.now(),
30
    })
31

32
    // Mark old operations for TTL expiry (keep last 100 for recent history)
33
    await this.opLog.setTTL(documentId, 0, doc.currentRevision - 100, TTL_30_DAYS)
34
  }
35
}

Frontend Considerations

Editor Integration

Rich text editors with OT support:

Editor	OT/CRDT Support	Notes
ProseMirror	Steps (OT-like)	Used by Notion, Atlassian
Slate	Plugin-based	Flexible, needs OT library
Quill	Delta format	Native OT support
TipTap	ProseMirror-based	Modern API

Integration pattern (ProseMirror example):


15 collapsed lines
1
class CollaborativeEditor {
2
  private view: EditorView
3
  private otClient: ClientOT
4
  private ws: WebSocket
5

6
  constructor(container: HTMLElement, documentId: string) {
7
    // Initialize OT client
8
    this.otClient = new ClientOT()
9

10
    // Connect WebSocket
11
    this.ws = new WebSocket(`wss://collab.example.com/ws/${documentId}`)
12
    this.ws.onmessage = this.handleServerMessage.bind(this)
13

14
    // Initialize editor with collaboration plugin
15
    this.view = new EditorView(container, {
16
      state: EditorState.create({
17
        plugins: [collab({ version: 0 }), this.cursorPlugin(), this.presencePlugin()],
18
      }),
19
      dispatchTransaction: this.handleLocalChange.bind(this),
20
    })
21
  }
22

23
  private handleLocalChange(tr: Transaction): void {
24
    const newState = this.view.state.apply(tr)
25
    this.view.updateState(newState)
26

27
    if (tr.docChanged) {
28
      // Convert ProseMirror steps to OT operations
29
      const steps = sendableSteps(newState)
30
      if (steps) {
31
        const operation = stepsToOperation(steps.steps)
32
        this.otClient.onLocalEdit(operation)
33
        this.ws.send(
34
          JSON.stringify({
35
            type: "operation",
36
            operation,
37
            baseRevision: this.otClient.serverRevision,
38
          }),
39
        )
40
      }
41
    }
42
  }
43
}

Presence Rendering

Cursor overlay approach:


20 collapsed lines
1
interface RemoteCursor {
2
  clientId: string
3
  user: { name: string; color: string }
4
  anchor: number
5
  head: number
6
}
7

8
class CursorOverlay {
9
  private cursors: Map<string, RemoteCursor> = new Map()
10

11
  updateCursor(cursor: RemoteCursor): void {
12
    this.cursors.set(cursor.clientId, cursor)
13
    this.render()
14
  }
15

16
  removeCursor(clientId: string): void {
17
    this.cursors.delete(clientId)
18
    this.render()
19
  }
20

21
  private render(): void {
22
    // Convert character positions to screen coordinates
23
    for (const [clientId, cursor] of this.cursors) {
24
      const coords = this.positionToCoords(cursor.head)
25

26
      // Render cursor caret
27
      this.renderCaret(clientId, coords, cursor.user.color)
28

29
      // Render selection highlight if anchor !== head
30
      if (cursor.anchor !== cursor.head) {
31
        this.renderSelection(clientId, cursor.anchor, cursor.head, cursor.user.color)
32
      }
33

34
      // Render name label
35
      this.renderNameLabel(clientId, coords, cursor.user)
36
    }
37
  }
38
}

Performance optimizations:

Technique	Purpose	Implementation
Throttle cursor updates	Reduce network traffic	Max 20 updates/sec
Batch presence broadcasts	Reduce message count	Collect 50ms, send batch
Use CSS transforms	Avoid layout thrashing	`transform: translate()`
Virtual cursor layer	Don’t modify editor DOM	Absolute positioned overlay

Offline Support

Operation queue for offline editing:


10 collapsed lines
1
class OfflineQueue {
2
  private db: IDBDatabase
3
  private queueName = "pendingOperations"
4

5
  async enqueue(documentId: string, operation: Operation): Promise<void> {
6
    const tx = this.db.transaction(this.queueName, "readwrite")
7
    const store = tx.objectStore(this.queueName)
8

9
    await store.add({
10
      documentId,
11
      operation,
12
      timestamp: Date.now(),
13
      id: crypto.randomUUID(),
14
    })
15
  }
16

17
  async syncPending(documentId: string): Promise<void> {
18
    const pending = await this.getPending(documentId)
19

20
    for (const item of pending) {
21
      try {
22
        await this.sendOperation(item)
23
        await this.remove(item.id)
24
      } catch (e) {
25
        if (e instanceof RevisionConflictError) {
26
          // Fetch missing ops, transform, retry
27
          await this.handleConflict(documentId, item)
28
        } else {
29
          throw e
30
        }
31
      }
32
    }
33
  }
34
}

Infrastructure

Cloud-Agnostic Components

Component	Purpose	Options
WebSocket Gateway	Real-time connections	Nginx, HAProxy, Envoy
Message Queue	Operation streaming	Kafka, RabbitMQ, NATS
KV Store	Active document state	Redis, Memcached, KeyDB
Document Store	Operation log	Cassandra, ScyllaDB, DynamoDB
Object Store	Snapshots, media	MinIO, Ceph, S3-compatible
Relational DB	Metadata, ACL	PostgreSQL, CockroachDB

AWS Reference Architecture

Service configurations:

Service	Configuration	Rationale
WebSocket (Fargate)	4 vCPU, 8GB RAM	Memory for active documents
API (Fargate)	2 vCPU, 4GB RAM	Stateless, scale on traffic
Workers (Fargate Spot)	2 vCPU, 4GB RAM	Cost optimization for async work
ElastiCache	r6g.xlarge cluster	Sub-ms latency for hot documents
RDS PostgreSQL	db.r6g.2xlarge Multi-AZ	Metadata queries, ACL
DynamoDB	On-demand	Predictable per-op pricing
S3	Standard + Intelligent-Tiering	Hot snapshots, cold history

Scaling Considerations

WebSocket connection limits:

Single server: ~65K connections (Linux file descriptor limit)
Solution: Consistent hashing by document ID across server pool
Active documents per server: ~10K (memory constrained)

Document processor memory:

Average document state: 100KB
Active document with history buffer: 500KB
8GB server → ~16K active documents max

Operation log partitioning:

DynamoDB partition key: document_id
Hot partition: 3000 WCU per partition
Solution: Document sharding if single doc exceeds limits (rare)

Conclusion

This design provides real-time collaborative document editing with:

Sub-200ms operation propagation via WebSocket and server-ordered OT
Strong convergence guarantees without P2P complexity
Full revision history through event-sourced operation log
Offline resilience with IndexedDB operation queue and conflict resolution

Key architectural decisions:

Server-ordered OT eliminates TP2 correctness concerns
Periodic snapshots bound operation replay cost
Ephemeral presence avoids persistence overhead for cursors
Per-document process isolation simplifies scaling

Known limitations:

Server dependency for real-time sync (no true P2P)
Memory pressure at high concurrent editor counts
Snapshot creation adds latency for very active documents

Future enhancements:

Hybrid OT/CRDT for better offline support (Eg-walker approach)
Incremental snapshot deltas to reduce storage
Smarter presence coalescing for large collaborator counts

Appendix

Prerequisites

Distributed systems fundamentals (eventual consistency, vector clocks)
Real-time communication patterns (WebSocket, SSE)
Event sourcing concepts
Understanding of OT or CRDTs (see related articles)

Terminology

Term	Definition
OT	Operational Transformation - algorithm for transforming concurrent operations
TP1/TP2	Transformation properties ensuring convergence
Revision	Monotonic counter representing document state version
Operation	Atomic change to document (insert, delete, format)
Snapshot	Full document state at a specific revision
Presence	Ephemeral state like cursors and selections
Tombstone	Marker for deleted content in CRDT systems

Summary

Real-time collaborative editing requires synchronization algorithms (OT or CRDT), presence broadcasting, and event-sourced persistence
Server-ordered OT dominates production use (Google Docs, CKEditor) because it avoids TP2 correctness issues
WebSocket provides full-duplex communication with 1-10ms latency after handshake
Operation log + periodic snapshots enables full revision history while bounding replay cost
Presence is ephemeral—cursors and selections stored in memory only, reconstructed on reconnect
Scale to 50 concurrent editors per document with ~200ms operation propagation latency

References

Architecture and Implementation:

How Figma’s Multiplayer Technology Works - Figma Engineering Blog
Making Multiplayer More Reliable - Figma transaction journal design
Realtime Editing of Ordered Sequences - Fractional indexing at Figma
The Data Model Behind Notion - Block-based architecture
Sharding Postgres at Notion - Database scaling patterns
Scaling the Linear Sync Engine - Local-first sync architecture

Operational Transformation:

Apache Wave OT Whitepaper - Detailed protocol specification
Google Drive Blog: What’s Different About New Google Docs - Architecture overview
Lessons Learned from CKEditor 5 - Production OT for rich text

Algorithms and Research:

Eg-walker: Collaborative Text Editing - Gentle & Kleppmann, EuroSys 2025
Real Differences between OT and CRDT - ACM 2020 comparison
Performance of Real-Time Collaborative Editors at Large Scale - Scaling analysis

Related Articles:

Operational Transformation - Deep dive into OT algorithms
CRDTs for Collaborative Systems - Alternative approach for offline-first

Read more