Design Google Calendar

A comprehensive system design for a calendar and scheduling application handling recurring events, timezone complexity, and real-time collaboration. This design addresses event recurrence at scale (RRULE expansion), global timezone handling across DST boundaries, availability aggregation for meeting scheduling, and multi-client synchronization with conflict resolution.

High-level architecture: Clients connect through an API gateway to core services backed by a hybrid data layer with async processing for notifications and recurrence expansion.

Abstract

Calendar systems solve three interconnected problems: temporal data modeling (representing events, recurrence rules, and exceptions), timezone arithmetic (displaying the same event correctly across global participants), and availability computation (finding meeting slots across multiple calendars).

The core data model stores recurring event masters with RRULE strings (RFC 5545) rather than individual instances. Expansion happens in a hybrid approach: materialize instances 30-90 days ahead for query performance, expand dynamically beyond that window. Exceptions (cancellations, single-instance modifications) are stored separately and merged at read time.

Timezone handling requires storing events in local time with named IANA timezone identifiers—never raw UTC offsets. This ensures a “9 AM daily standup” remains at 9 AM local time across DST transitions.

Conflict-free synchronization uses sync-tokens (RFC 6578) for incremental updates. Each calendar has a monotonically increasing token; clients send their last token and receive only changes since that state. For concurrent edits, the server maintains the event history and uses last-write-wins with user notification for conflicts.

Requirements

Functional Requirements

Feature	Priority	Scope
Single events (create, read, update, delete)	Core	Full
Recurring events (RRULE support)	Core	Full
Event exceptions (cancel/modify single instance)	Core	Full
Time zone handling with DST	Core	Full
Meeting invitations (RSVP workflow)	Core	Full
Free/busy queries	Core	Full
Calendar sharing and delegation	Core	Full
Reminders and notifications	Core	Full
Multi-client sync (CalDAV)	Core	Full
Calendar search	High	Full
Meeting room/resource booking	High	Overview
Video conferencing integration	Medium	Brief
Task management (VTODO)	Low	Out of scope

Non-Functional Requirements

Requirement	Target	Rationale
Availability	99.99%	Calendar access is mission-critical for business operations
Read latency (calendar view)	p99 < 200ms	Month view may expand hundreds of recurring events
Write latency (event creation)	p99 < 500ms	Acceptable for user-initiated actions
Sync latency	< 5 seconds	Changes should propagate across devices quickly
Data consistency	Eventual (< 5s)	Strong consistency not required for calendar data
Data retention	10+ years	Historical calendar data has legal/compliance value

Scale Estimation

Users:

MAU: 500M (Google Workspace + consumer Gmail)
DAU: 100M
Peak concurrent: 10M (10% of DAU)

Events:

Average events per user: 50 active recurring + 200 single events
Total events: 500M users × 250 events = 125B event records
But with recurrence masters (not instances): ~25B records

Traffic:

Calendar loads: 100M DAU × 10 loads/day = 1B/day = ~12K RPS
Event writes: 100M DAU × 2 writes/day = 200M/day = ~2.3K RPS
Free/busy queries: 100M DAU × 0.5/day = 50M/day = ~580 RPS
Peak multiplier: 3x → 36K RPS reads, 7K RPS writes

Storage:

Event master: ~2KB average (metadata, description, RRULE, attendees)
25B events × 2KB = 50TB primary storage
With indexes, replicas, and history: ~200TB total

Design Paths

Path A: RRULE-Centric (Store Rules, Expand on Read)

Best when:

Events have long or infinite recurrence (daily standups forever)
Storage cost is a primary concern
Updates to recurring series are frequent

Key characteristics:

Store only the recurrence rule in the events table
Expand instances dynamically when querying a date range
Cache expansion results in Redis for frequently accessed calendars

Trade-offs:

✅ Minimal storage (one record per recurring series)
✅ Updating series changes all future instances instantly
✅ Supports infinite recurrence naturally
❌ CPU-intensive expansion for complex RRULEs
❌ Slow queries spanning long date ranges
❌ Exception handling adds query complexity

Real-world example: Many open-source CalDAV servers (Radicale, DAViCal) use this approach because storage efficiency matters more than query speed for personal calendars.

Path B: Instance-Centric (Materialize All Instances)

Best when:

Queries span arbitrary date ranges frequently
Meeting scheduling and free/busy aggregation are critical
Most events have bounded recurrence (end dates)

Key characteristics:

Pre-expand all instances into a separate table
Recurring series modifications trigger batch updates to instances
Indexes on start_time enable fast range queries

Trade-offs:

✅ O(1) range queries—just filter by date
✅ Simple free/busy aggregation (SUM over intervals)
✅ Exception instances are just rows with modified fields
❌ Storage explosion (daily event for 10 years = 3,650 rows)
❌ Series updates require updating thousands of rows
❌ Cannot support infinite recurrence

Real-world example: Microsoft Outlook’s Exchange uses materialization for corporate calendars where meeting scheduling performance is paramount.

Path C: Hybrid (Chosen Approach)

Best when:

Mix of short-term and long-term recurring events
Need both fast queries and storage efficiency
Workload varies (view calendar vs. schedule meetings)

Key characteristics:

Store recurrence rules in the master events table
Materialize instances for a rolling window (30-90 days)
Expand dynamically beyond the materialized window
Background jobs refresh materialized instances nightly

Trade-offs:

✅ Fast queries within the materialized window
✅ Reasonable storage (30-90 instances per series, not thousands)
✅ Can support infinite recurrence (expand on demand)
✅ Series updates only touch instances within window
❌ More complex architecture (two code paths)
❌ Stale data possible if background jobs lag

Path Comparison

Factor	Path A (RRULE)	Path B (Instance)	Path C (Hybrid)
Storage	Minimal	High	Moderate
Read latency	High (expansion)	Low	Low within window
Write complexity	Low	High (batch updates)	Moderate
Infinite recurrence	Yes	No	Yes
Free/busy speed	Slow	Fast	Fast within window
Best for	Personal calendars	Enterprise scheduling	General-purpose

This Article’s Focus

This article implements Path C (Hybrid) because Google Calendar serves both consumer users (long-running personal recurring events) and enterprise users (meeting-heavy scheduling). The hybrid approach optimizes for the common case (viewing this week/month) while supporting edge cases (events repeating forever).

High-Level Design

Service Architecture

Event Service

Handles CRUD operations for events and recurring masters:

Create/update/delete single events
Create/update/delete recurring series (stores RRULE)
Create exceptions (modified or cancelled instances)
Query events by date range (calls Recurrence Service for expansion)

Recurrence Service

Expands RRULE strings into concrete instances:

Parse RRULE using RFC 5545 grammar
Generate instances within a date range
Apply EXDATE (exclusions) and RDATE (additions)
Merge with exception instances from database
Cache expansions in Redis (TTL = 1 hour)

Scheduling Service

Handles meeting coordination:

Aggregate free/busy across attendees
Find available meeting slots
Send invitations (iTIP REQUEST method)
Process RSVPs (iTIP REPLY method)
Resource (room) availability and booking

Sync Service

Manages multi-client synchronization:

Implement CalDAV protocol (RFC 4791)
Maintain sync-tokens per calendar
Push notifications for real-time updates (WebSocket/FCM)
Handle conflict detection and resolution

Notification Service

Delivers reminders and alerts:

Schedule reminders based on event VALARM
Deliver via push notification, email, SMS
Handle timezone-aware scheduling (reminder at 9 AM local time)
Batch notification delivery for efficiency

Data Flow: Creating a Recurring Event

Data Flow: Querying Calendar View

API Design

Event Resource

Create Event

Endpoint: POST /api/v1/calendars/{calendarId}/events


3 collapsed lines
1
// Headers
2
Authorization: Bearer {access_token}
3
Content-Type: application/json
4

5
// Request body
6
{
7
  "summary": "Weekly Team Standup",
8
  "description": "Discuss blockers and priorities",
9
  "start": {
10
    "dateTime": "2024-01-15T09:00:00",
11
    "timeZone": "America/New_York"
12
  },
13
  "end": {
14
    "dateTime": "2024-01-15T09:30:00",
15
    "timeZone": "America/New_York"
16
  },
17
  "recurrence": ["RRULE:FREQ=WEEKLY;BYDAY=MO,WE,FR"],
18
  "attendees": [
19
    {"email": "alice@example.com"},
20
    {"email": "bob@example.com", "optional": true}
21
  ],
22
  "reminders": {
23
    "useDefault": false,
24
    "overrides": [
25
      {"method": "popup", "minutes": 10},
26
      {"method": "email", "minutes": 60}
27
    ]
28
  },
6 collapsed lines
29
  "conferenceData": {
30
    "createRequest": {"requestId": "unique-request-id"}
31
  },
32
  "visibility": "default",
33
  "transparency": "opaque"
34
}

Response (201 Created):


5 collapsed lines
1
{
2
  "kind": "calendar#event",
3
  "etag": "\"3148476458000000\"",
4
  "id": "abc123xyz",
5
  "status": "confirmed",
6
  "htmlLink": "https://calendar.example.com/event?eid=abc123xyz",
7
  "created": "2024-01-10T15:30:00.000Z",
8
  "updated": "2024-01-10T15:30:00.000Z",
9
  "summary": "Weekly Team Standup",
10
  "description": "Discuss blockers and priorities",
11
  "creator": {
12
    "email": "organizer@example.com",
13
    "self": true
14
  },
15
  "organizer": {
16
    "email": "organizer@example.com",
17
    "self": true
18
  },
19
  "start": {
20
    "dateTime": "2024-01-15T09:00:00-05:00",
21
    "timeZone": "America/New_York"
22
  },
23
  "end": {
24
    "dateTime": "2024-01-15T09:30:00-05:00",
25
    "timeZone": "America/New_York"
26
  },
27
  "recurrence": ["RRULE:FREQ=WEEKLY;BYDAY=MO,WE,FR"],
28
  "iCalUID": "abc123xyz@calendar.example.com",
29
  "sequence": 0,
30
  "attendees": [
31
    { "email": "alice@example.com", "responseStatus": "needsAction" },
32
    { "email": "bob@example.com", "responseStatus": "needsAction", "optional": true }
33
  ],
34
  "reminders": {
15 collapsed lines
35
    "useDefault": false,
36
    "overrides": [
37
      { "method": "popup", "minutes": 10 },
38
      { "method": "email", "minutes": 60 }
39
    ]
40
  },
41
  "conferenceData": {
42
    "conferenceId": "meet123",
43
    "conferenceSolution": {
44
      "name": "Google Meet",
45
      "iconUri": "https://..."
46
    },
47
    "entryPoints": [{ "entryPointType": "video", "uri": "https://meet.example.com/meet123" }]
48
  }
49
}

Error Responses:

400 Bad Request: Invalid RRULE syntax, missing required fields
401 Unauthorized: Missing or invalid auth token
403 Forbidden: No write access to calendar
409 Conflict: Event conflicts with existing event (if strict mode)
429 Too Many Requests: Rate limit exceeded

Rate Limits: 600 requests/minute per user, 10,000/minute per project

Query Events

Endpoint: GET /api/v1/calendars/{calendarId}/events

Query Parameters:

Parameter	Type	Description
`timeMin`	ISO8601	Lower bound (inclusive) for event end time
`timeMax`	ISO8601	Upper bound (exclusive) for event start time
`singleEvents`	boolean	If true, expand recurring events into instances
`orderBy`	string	`startTime` (requires singleEvents=true) or `updated`
`maxResults`	integer	Maximum entries returned (default: 250, max: 2500)
`pageToken`	string	Token for pagination
`syncToken`	string	Token from previous sync for incremental updates
`showDeleted`	boolean	Include cancelled events (for sync)

Design Decision: Pagination Strategy

Why cursor-based (pageToken/syncToken), not offset-based:

Calendar data is highly dynamic (events created/deleted constantly)
Offset pagination breaks when data changes between pages
Sync tokens enable efficient incremental sync (only fetch changes)

Sync flow:

Initial full sync: GET /events?timeMin=...&timeMax=... → returns nextSyncToken
Incremental sync: GET /events?syncToken={token} → returns changed items + new syncToken
If sync token expires (410 Gone): perform full sync again

Modify Single Instance of Recurring Event

Endpoint: PUT /api/v1/calendars/{calendarId}/events/{recurringEventId}/instances/{instanceId}

This creates an exception instance that overrides the recurring pattern for one occurrence.

1
{
2
  "start": {
3
    "dateTime": "2024-01-17T10:00:00",
4
    "timeZone": "America/New_York"
5
  },
6
  "end": {
7
    "dateTime": "2024-01-17T10:30:00",
8
    "timeZone": "America/New_York"
9
  }
10
}

The instanceId encodes the original instance date (e.g., abc123xyz_20240117T140000Z).

Design Decision: How Exceptions Are Stored

The exception is stored as a separate row linked to the recurring master via recurring_event_id with the original_start_time preserved. This allows:

Querying the modified instance by its new time
Reverting to the original time by deleting the exception
Identifying which instance was modified (via original_start_time)

Free/Busy Query

Endpoint: POST /api/v1/freeBusy

1
{
2
  "timeMin": "2024-01-15T00:00:00Z",
3
  "timeMax": "2024-01-22T00:00:00Z",
4
  "items": [
5
    { "id": "alice@example.com" },
6
    { "id": "bob@example.com" },
7
    { "id": "conference-room-a@resource.example.com" }
8
  ]
9
}

Response:


3 collapsed lines
1
{
2
  "kind": "calendar#freeBusy",
3
  "timeMin": "2024-01-15T00:00:00Z",
4
  "timeMax": "2024-01-22T00:00:00Z",
5
  "calendars": {
6
    "alice@example.com": {
7
      "busy": [
8
        { "start": "2024-01-15T14:00:00Z", "end": "2024-01-15T15:00:00Z" },
9
        { "start": "2024-01-16T09:00:00Z", "end": "2024-01-16T10:00:00Z" }
10
      ]
11
    },
12
    "bob@example.com": {
13
      "busy": [{ "start": "2024-01-15T14:00:00Z", "end": "2024-01-15T14:30:00Z" }]
14
    },
15
    "conference-room-a@resource.example.com": {
16
      "busy": [{ "start": "2024-01-15T10:00:00Z", "end": "2024-01-15T11:00:00Z" }],
17
      "errors": []
18
    }
19
  },
20
  "groups": {}
21
}

Design Decision: Free/Busy Privacy

Free/busy queries return only time intervals, not event details. This allows users to share availability without exposing meeting contents. The transparency field on events controls whether they appear as busy:

opaque (default): Shows as busy
transparent: Doesn’t block time (e.g., “Working from home” all-day event)

Data Modeling

Event Schema

Primary Store: PostgreSQL (ACID for writes, complex queries for recurrence)


5 collapsed lines
1
-- Users and calendars (simplified)
2
CREATE TABLE users (
3
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
4
    email VARCHAR(255) UNIQUE NOT NULL,
5
    timezone VARCHAR(50) DEFAULT 'UTC',
6
    created_at TIMESTAMPTZ DEFAULT NOW()
7
);
8

9
CREATE TABLE calendars (
10
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
11
    owner_id UUID NOT NULL REFERENCES users(id),
12
    name VARCHAR(255) NOT NULL,
13
    timezone VARCHAR(50) NOT NULL,
14
    sync_token BIGINT DEFAULT 0,
15
    created_at TIMESTAMPTZ DEFAULT NOW()
16
);
17

18
-- Event master table (stores both single and recurring events)
19
CREATE TABLE events (
20
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
21
    calendar_id UUID NOT NULL REFERENCES calendars(id),
22
    ical_uid VARCHAR(255) NOT NULL,  -- RFC 5545 UID for iCal interop
23
    summary VARCHAR(500),
24
    description TEXT,
25
    location VARCHAR(500),
26

27
    -- Time fields stored in local time with timezone
28
    start_datetime TIMESTAMP NOT NULL,
29
    end_datetime TIMESTAMP NOT NULL,
30
    start_timezone VARCHAR(50) NOT NULL,
31
    end_timezone VARCHAR(50) NOT NULL,
32
    is_all_day BOOLEAN DEFAULT FALSE,
33

34
    -- Recurrence (NULL for single events)
35
    recurrence_rule TEXT,  -- RRULE string, e.g., "FREQ=WEEKLY;BYDAY=MO,WE,FR"
36
    recurrence_exceptions TEXT[],  -- EXDATE array
37
    recurrence_additions TEXT[],   -- RDATE array
38

39
    -- Metadata
40
    status VARCHAR(20) DEFAULT 'confirmed',  -- confirmed, tentative, cancelled
41
    visibility VARCHAR(20) DEFAULT 'default',  -- default, public, private
42
    transparency VARCHAR(20) DEFAULT 'opaque',  -- opaque, transparent
43
    sequence INTEGER DEFAULT 0,  -- Increment on updates (iCal SEQUENCE)
44

11 collapsed lines
45
    -- Organizer and creator
46
    organizer_email VARCHAR(255),
47
    creator_email VARCHAR(255),
48

49
    created_at TIMESTAMPTZ DEFAULT NOW(),
50
    updated_at TIMESTAMPTZ DEFAULT NOW(),
51
    deleted_at TIMESTAMPTZ,  -- Soft delete
52

53
    UNIQUE(calendar_id, ical_uid)
54
);
55

56
-- Indexes for common query patterns
57
CREATE INDEX idx_events_calendar_time ON events(calendar_id, start_datetime, end_datetime)
58
    WHERE deleted_at IS NULL;
59
CREATE INDEX idx_events_updated ON events(calendar_id, updated_at)
60
    WHERE deleted_at IS NULL;
61
CREATE INDEX idx_events_recurring ON events(calendar_id)
62
    WHERE recurrence_rule IS NOT NULL AND deleted_at IS NULL;

Design Decision: Local Time Storage

Why store start_datetime as local time with a separate start_timezone instead of UTC?

DST correctness: A “9 AM daily standup” should always be at 9 AM local time. If stored as UTC, it would shift by an hour during DST transitions.
RRULE expansion: The RRULE BYDAY=MO means Monday in the event’s timezone, not UTC Monday.
Display simplicity: No conversion needed when displaying in the organizer’s timezone.

Trade-off: Queries that span multiple timezones require conversion. The materialized instances table stores computed UTC times for efficient range queries.

Materialized Instances


3 collapsed lines
1
-- Materialized instances for query performance
2
-- Regenerated nightly for rolling 90-day window
3
CREATE TABLE event_instances (
4
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
5
    event_id UUID NOT NULL REFERENCES events(id) ON DELETE CASCADE,
6
    calendar_id UUID NOT NULL REFERENCES calendars(id),
7

8
    -- Instance timing (UTC for efficient range queries)
9
    instance_start_utc TIMESTAMPTZ NOT NULL,
10
    instance_end_utc TIMESTAMPTZ NOT NULL,
11

12
    -- Original occurrence date (for exception matching)
13
    original_start_utc TIMESTAMPTZ NOT NULL,
14

15
    -- Instance-specific overrides (NULL = inherit from master)
16
    summary_override VARCHAR(500),
17
    description_override TEXT,
18
    location_override VARCHAR(500),
19
    start_override TIMESTAMP,
20
    end_override TIMESTAMP,
21
    timezone_override VARCHAR(50),
22

23
    -- Exception status
24
    status VARCHAR(20) NOT NULL DEFAULT 'confirmed',  -- confirmed, cancelled
25
    is_exception BOOLEAN DEFAULT FALSE,
26

27
    created_at TIMESTAMPTZ DEFAULT NOW()
28
);
29

6 collapsed lines
30
-- Primary query index: calendar + date range
31
CREATE INDEX idx_instances_calendar_range
32
    ON event_instances(calendar_id, instance_start_utc, instance_end_utc)
33
    WHERE status != 'cancelled';
34

35
-- Free/busy aggregation index
36
CREATE INDEX idx_instances_freebusy
37
    ON event_instances(calendar_id, instance_start_utc, instance_end_utc)
38
    WHERE status = 'confirmed';
39

40
-- Exception lookup (find if this occurrence has been modified)
41
CREATE INDEX idx_instances_exception
42
    ON event_instances(event_id, original_start_utc)
43
    WHERE is_exception = TRUE;

Attendees and RSVPs


3 collapsed lines
1
-- Attendees for meetings
2
CREATE TABLE event_attendees (
3
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
4
    event_id UUID NOT NULL REFERENCES events(id) ON DELETE CASCADE,
5
    email VARCHAR(255) NOT NULL,
6
    display_name VARCHAR(255),
7

8
    -- Response status (RFC 5545 PARTSTAT)
9
    response_status VARCHAR(20) DEFAULT 'needsAction',
10
        -- needsAction, declined, tentative, accepted
11

12
    -- Role
13
    is_organizer BOOLEAN DEFAULT FALSE,
14
    is_optional BOOLEAN DEFAULT FALSE,
15
    is_resource BOOLEAN DEFAULT FALSE,  -- Conference room, equipment
16

17
    -- Response metadata
18
    response_comment TEXT,
19
    responded_at TIMESTAMPTZ,
20

21
    UNIQUE(event_id, email)
22
);
23

24
CREATE INDEX idx_attendees_email ON event_attendees(email, event_id);
1 collapsed line
25
CREATE INDEX idx_attendees_event ON event_attendees(event_id);

Database Selection Matrix

Data Type	Store	Rationale
Events and instances	PostgreSQL	ACID, complex RRULE queries, date range filtering
Free/busy cache	Redis Sorted Sets	Sub-ms latency, TTL, efficient range queries
Full-text search	Elasticsearch	Event content search, attendee search
Attachments	Object Storage (S3)	Large files, CDN delivery
Notification queue	Redis Streams / Kafka	High throughput, at-least-once delivery
Sync tokens	PostgreSQL	Transactional consistency with events

Sharding Strategy

Primary shard key: calendar_id

Rationale:

Co-locates all events for a calendar (most queries filter by calendar)
Calendar view queries hit single shard
Cross-calendar queries (free/busy) require scatter-gather, but these are less frequent

Shard distribution:

Hash-based sharding on calendar_id
256 logical shards, distributed across physical nodes
Rebalancing via consistent hashing

Low-Level Design

Recurrence Expansion Algorithm

The recurrence service expands RRULE strings into concrete instances. RFC 5545 defines the algorithm, but edge cases require careful handling.

RRULE Parsing and Expansion


10 collapsed lines
1
// Using a library like rrule.js or python-dateutil for parsing
2
import { RRule, RRuleSet, rrulestr } from "rrule"
3

4
interface RecurrenceExpansionRequest {
5
  rruleString: string // e.g., "FREQ=WEEKLY;BYDAY=MO,WE,FR"
6
  dtstart: Date // Series start in local time
7
  timezone: string // IANA timezone
8
  rangeStart: Date // Query range start (UTC)
9
  rangeEnd: Date // Query range end (UTC)
10
  exdates?: Date[] // Excluded dates
11
  rdates?: Date[] // Additional dates
12
}
13

14
function expandRecurrence(req: RecurrenceExpansionRequest): Date[] {
15
  // Parse the RRULE with timezone awareness
16
  const rule = RRule.fromString(req.rruleString)
17

18
  const rruleSet = new RRuleSet()
19
  rruleSet.rrule(rule)
20

21
  // Add exclusions (EXDATE)
22
  for (const exdate of req.exdates ?? []) {
23
    rruleSet.exdate(exdate)
24
  }
25

26
  // Add additional dates (RDATE)
27
  for (const rdate of req.rdates ?? []) {
28
    rruleSet.rdate(rdate)
29
  }
30

31
  // Expand within range
32
  // CRITICAL: between() uses the RRULE's timezone for DST handling
33
  const instances = rruleSet.between(req.rangeStart, req.rangeEnd, true)
34

35
  return instances
36
}
37

38
// Example: Weekly standup at 9 AM, Mon/Wed/Fri
39
const instances = expandRecurrence({
40
  rruleString: "FREQ=WEEKLY;BYDAY=MO,WE,FR",
41
  dtstart: new Date("2024-01-15T09:00:00"),
42
  timezone: "America/New_York",
43
  rangeStart: new Date("2024-01-01T00:00:00Z"),
44
  rangeEnd: new Date("2024-03-31T23:59:59Z"),
3 collapsed lines
45
  exdates: [new Date("2024-01-17T09:00:00")], // Skip Jan 17
46
})
47
// Returns: [Jan 15, Jan 19, Jan 22, Jan 24, Jan 26, ...]

DST Edge Cases

Spring Forward (2 AM → 3 AM):

When an event is scheduled at 2:30 AM on the night clocks spring forward, the time doesn’t exist.


5 collapsed lines
1
// Handling non-existent times during spring forward
2
function adjustForDST(localTime: Date, timezone: string): Date {
3
  const { DateTime } = require("luxon")
4

5
  const dt = DateTime.fromJSDate(localTime, { zone: timezone })
6

7
  if (!dt.isValid && dt.invalidReason === "time zone offset transition") {
8
    // Time doesn't exist—shift forward to the next valid time
9
    return dt.plus({ hours: 1 }).toJSDate()
10
  }
11

12
  return localTime
13
}

Fall Back (2 AM occurs twice):

When clocks fall back, the 1:00-2:00 AM hour repeats. The iCalendar spec recommends using the first occurrence.

Design Decision: Follow the VTIMEZONE specification by storing and expanding in local time with TZID. The TZID references the IANA database, which contains the complete DST rules. Libraries like Luxon, date-fns-tz, and moment-timezone handle this correctly.

Free/Busy Aggregation

Free/busy aggregation is the core of meeting scheduling. It must be fast (< 100ms for 10 attendees over 1 week) and respect privacy.

Redis-Based Free/Busy Cache


8 collapsed lines
1
import { Redis } from "ioredis"
2

3
interface BusyInterval {
4
  start: number // Unix timestamp
5
  end: number
6
  eventId?: string // Only for the calendar owner
7
}
8

9
// Store busy intervals as sorted set members
10
// Key: freebusy:{calendarId}
11
// Score: start timestamp
12
// Member: JSON { start, end, eventId }
13

14
async function updateFreeBusy(redis: Redis, calendarId: string, instances: EventInstance[]): Promise<void> {
15
  const key = `freebusy:${calendarId}`
16
  const pipeline = redis.pipeline()
17

18
  // Clear existing entries in the affected range
19
  const rangeStart = Math.min(...instances.map((i) => i.startUtc.getTime() / 1000))
20
  const rangeEnd = Math.max(...instances.map((i) => i.endUtc.getTime() / 1000))
21
  pipeline.zremrangebyscore(key, rangeStart, rangeEnd)
22

23
  // Add new busy intervals
24
  for (const instance of instances) {
25
    if (instance.status === "confirmed" && instance.transparency === "opaque") {
26
      const interval: BusyInterval = {
27
        start: instance.startUtc.getTime() / 1000,
28
        end: instance.endUtc.getTime() / 1000,
29
        eventId: instance.eventId,
30
      }
31
      pipeline.zadd(key, interval.start, JSON.stringify(interval))
32
    }
33
  }
34

35
  // Set TTL to 7 days (refresh weekly)
36
  pipeline.expire(key, 7 * 24 * 60 * 60)
37

38
  await pipeline.exec()
39
}
11 collapsed lines
40

41
async function queryFreeBusy(
42
  redis: Redis,
43
  calendarId: string,
44
  rangeStart: Date,
45
  rangeEnd: Date,
46
): Promise<BusyInterval[]> {
47
  const key = `freebusy:${calendarId}`
48
  const start = rangeStart.getTime() / 1000
49
  const end = rangeEnd.getTime() / 1000
50

51
  // Get all intervals that START within the range
52
  const members = await redis.zrangebyscore(key, start, end)
53

54
  return members.map((m) => JSON.parse(m) as BusyInterval).filter((interval) => interval.end > start) // Exclude ended before range
55
}

Finding Available Slots


5 collapsed lines
1
interface TimeSlot {
2
  start: Date
3
  end: Date
4
}
5

6
function findAvailableSlots(
7
  busyIntervalsByAttendee: Map<string, BusyInterval[]>,
8
  rangeStart: Date,
9
  rangeEnd: Date,
10
  duration: number, // minutes
11
  workingHours?: { start: number; end: number }, // e.g., { start: 9, end: 17 }
12
): TimeSlot[] {
13
  // Merge all busy intervals
14
  const allBusy: BusyInterval[] = []
15
  for (const intervals of busyIntervalsByAttendee.values()) {
16
    allBusy.push(...intervals)
17
  }
18

19
  // Sort by start time
20
  allBusy.sort((a, b) => a.start - b.start)
21

22
  // Merge overlapping intervals
23
  const merged: BusyInterval[] = []
24
  for (const interval of allBusy) {
25
    if (merged.length === 0 || merged[merged.length - 1].end < interval.start) {
26
      merged.push({ ...interval })
27
    } else {
28
      merged[merged.length - 1].end = Math.max(merged[merged.length - 1].end, interval.end)
29
    }
30
  }
31

32
  // Find gaps that fit the duration
33
  const durationSec = duration * 60
34
  const available: TimeSlot[] = []
35
  let cursor = rangeStart.getTime() / 1000
36

37
  for (const busy of merged) {
38
    if (busy.start - cursor >= durationSec) {
39
      available.push({
40
        start: new Date(cursor * 1000),
41
        end: new Date(busy.start * 1000),
42
      })
43
    }
44
    cursor = Math.max(cursor, busy.end)
45
  }
46

47
  // Check final gap
48
  const endSec = rangeEnd.getTime() / 1000
49
  if (endSec - cursor >= durationSec) {
11 collapsed lines
50
    available.push({
51
      start: new Date(cursor * 1000),
52
      end: rangeEnd,
53
    })
54
  }
55

56
  // Filter by working hours if specified
57
  if (workingHours) {
58
    return available.filter((slot) => {
59
      const startHour = slot.start.getHours()
60
      return startHour >= workingHours.start && startHour < workingHours.end
61
    })
62
  }
63

64
  return available
65
}

Time Complexity: O(N log N) for sorting, O(N) for merging, where N = total busy intervals across all attendees.

Sync Token Implementation

Sync tokens enable efficient incremental sync for CalDAV clients and mobile apps.


5 collapsed lines
1
-- Track changes for sync
2
CREATE TABLE calendar_changes (
3
    id BIGSERIAL PRIMARY KEY,
4
    calendar_id UUID NOT NULL REFERENCES calendars(id),
5
    event_id UUID NOT NULL,
6
    change_type VARCHAR(10) NOT NULL,  -- 'created', 'updated', 'deleted'
7
    changed_at TIMESTAMPTZ DEFAULT NOW(),
8
    sync_token BIGINT NOT NULL  -- Matches calendars.sync_token at time of change
9
);
10

11
CREATE INDEX idx_changes_sync ON calendar_changes(calendar_id, sync_token);
12

13
-- On event change, record it
14
CREATE OR REPLACE FUNCTION record_event_change()
15
RETURNS TRIGGER AS $$
16
BEGIN
17
  -- Increment calendar's sync token
18
  UPDATE calendars SET sync_token = sync_token + 1 WHERE id = NEW.calendar_id;
19

20
  -- Record the change
21
  INSERT INTO calendar_changes (calendar_id, event_id, change_type, sync_token)
22
  SELECT NEW.calendar_id, NEW.id, TG_OP, sync_token FROM calendars WHERE id = NEW.calendar_id;
23

24
  RETURN NEW;
25
END;
26
$$ LANGUAGE plpgsql;

Sync flow:

Initial sync: Client receives all events + current syncToken (e.g., 15)
Incremental sync: Client sends syncToken=15, server returns changes where sync_token > 15 + new token (e.g., 23)
Token expiration: If changes for token 15 have been purged (older than 30 days), return 410 Gone → client performs full sync

Invitation Workflow (iTIP/iMIP)

When an organizer invites attendees, the system generates iTIP REQUEST messages:

iMIP Email Format:

1
Content-Type: multipart/alternative; boundary="boundary"
2

3
--boundary
4
Content-Type: text/plain
5

6
You've been invited to: Weekly Team Standup
7
When: Monday, January 15, 2024 9:00 AM - 9:30 AM (EST)
8

9
--boundary
10
Content-Type: text/calendar; method=REQUEST
11

12
BEGIN:VCALENDAR
13
VERSION:2.0
14
METHOD:REQUEST
15
BEGIN:VEVENT
16
UID:abc123xyz@calendar.example.com
17
DTSTART;TZID=America/New_York:20240115T090000
18
DTEND;TZID=America/New_York:20240115T093000
19
SUMMARY:Weekly Team Standup
20
ORGANIZER:mailto:organizer@example.com
21
ATTENDEE;PARTSTAT=NEEDS-ACTION:mailto:attendee@example.com
22
END:VEVENT
23
END:VCALENDAR
24

25
--boundary--

Frontend Considerations

Calendar View Performance

Problem: A month view showing 30+ days with recurring events may need to display hundreds of event instances.

Solution: Virtual Scrolling + Batched Loading


10 collapsed lines
1
// Load events in batches as user scrolls
2
interface CalendarViewState {
3
  visibleRange: { start: Date; end: Date }
4
  loadedRanges: Array<{ start: Date; end: Date }>
5
  events: Map<string, CalendarEvent>
6
}
7

8
function useCalendarEvents(calendarId: string) {
9
  const [state, setState] = useState<CalendarViewState>({
10
    visibleRange: getCurrentWeek(),
11
    loadedRanges: [],
12
    events: new Map(),
13
  })
14

15
  // Load events for visible range + buffer
16
  useEffect(() => {
17
    const rangeToLoad = expandRange(state.visibleRange, { days: 7 }) // ±1 week buffer
18

19
    if (!isRangeCovered(rangeToLoad, state.loadedRanges)) {
20
      fetchEvents(calendarId, rangeToLoad).then((newEvents) => {
21
        setState((prev) => ({
22
          ...prev,
23
          loadedRanges: mergeRanges([...prev.loadedRanges, rangeToLoad]),
24
          events: new Map([...prev.events, ...newEvents.map((e) => [e.id, e])]),
25
        }))
26
      })
27
    }
28
  }, [state.visibleRange, calendarId])
29

30
  return state.events
31
}

Key optimizations:

Request singleEvents=true from API to get pre-expanded instances
Cache responses by date range (events within a range don’t change often)
Use ETag / If-None-Match for conditional requests
Virtualize day cells in month view (render only visible weeks)

Real-Time Updates

Strategy: WebSocket for active browser tabs, push notifications for background/mobile.


5 collapsed lines
1
// Real-time sync via WebSocket
2
const useCalendarSync = (calendarId: string) => {
3
  const queryClient = useQueryClient()
4

5
  useEffect(() => {
6
    const ws = new WebSocket(`wss://api.calendar.com/sync/${calendarId}`)
7

8
    ws.onmessage = (event) => {
9
      const change = JSON.parse(event.data)
10

11
      switch (change.type) {
12
        case "event.created":
13
        case "event.updated":
14
          queryClient.setQueryData(["events", calendarId], (old: CalendarEvent[]) => upsertEvent(old, change.event))
15
          break
16
        case "event.deleted":
17
          queryClient.setQueryData(["events", calendarId], (old: CalendarEvent[]) =>
18
            old.filter((e) => e.id !== change.eventId),
19
          )
20
          break
21
      }
22
    }
23

24
    return () => ws.close()
2 collapsed lines
25
  }, [calendarId, queryClient])
26
}

Timezone Display

User expectations:

Event times shown in user’s local timezone by default
Option to view in event’s original timezone
All-day events should span the full day in any timezone


5 collapsed lines
1
// Convert and display event times
2
function formatEventTime(event: CalendarEvent, userTimezone: string): string {
3
  const { DateTime } = require("luxon")
4

5
  if (event.isAllDay) {
6
    // All-day events: show date only, no timezone conversion
7
    return DateTime.fromISO(event.start.date).toLocaleString(DateTime.DATE_MED)
8
  }
9

10
  // Timed events: convert to user's timezone
11
  const start = DateTime.fromISO(event.start.dateTime, { zone: event.start.timeZone })
12
  const userStart = start.setZone(userTimezone)
13

14
  // Show original timezone if different
15
  if (event.start.timeZone !== userTimezone) {
16
    return `${userStart.toLocaleString(DateTime.TIME_SIMPLE)} (${userStart.toFormat("ZZZZ")})`
17
  }
18

19
  return userStart.toLocaleString(DateTime.TIME_SIMPLE)
20
}

Drag-and-Drop Rescheduling

Optimistic updates with rollback:


5 collapsed lines
1
// Drag event to new time slot
2
async function handleEventDrop(eventId: string, newStart: Date, newEnd: Date) {
3
  const previousEvent = queryClient.getQueryData(["event", eventId])
4

5
  // Optimistic update
6
  queryClient.setQueryData(["event", eventId], (old: CalendarEvent) => ({
7
    ...old,
8
    start: { dateTime: newStart.toISOString(), timeZone: old.start.timeZone },
9
    end: { dateTime: newEnd.toISOString(), timeZone: old.end.timeZone },
10
  }))
11

12
  try {
13
    await updateEvent(eventId, { start: newStart, end: newEnd })
14
  } catch (error) {
15
    // Rollback on failure
16
    queryClient.setQueryData(["event", eventId], previousEvent)
17
    toast.error("Failed to reschedule event")
18
  }
19
}
20

21
// For recurring event instance: prompt user for scope
22
function handleRecurringEventDrop(eventId: string, instanceDate: Date, newTime: Date) {
23
  showDialog({
24
    title: "Edit recurring event",
25
    options: [
26
      { label: "This event only", action: () => updateInstance(eventId, instanceDate, newTime) },
27
      { label: "This and future events", action: () => splitSeries(eventId, instanceDate, newTime) },
28
      { label: "All events", action: () => updateSeries(eventId, newTime) },
29
    ],
2 collapsed lines
30
  })
31
}

Infrastructure Design

Cloud-Agnostic Concepts

Component	Requirement	Options
Primary Database	ACID, complex queries	PostgreSQL, MySQL
Cache	Sub-ms reads, TTL	Redis, Memcached
Search	Full-text, aggregations	Elasticsearch, OpenSearch
Message Queue	At-least-once, ordering	Kafka, RabbitMQ, Redis Streams
Object Storage	Attachments, large files	S3-compatible (MinIO)
Job Scheduler	Cron, delayed jobs	Temporal, Celery, pg-boss

AWS Reference Architecture

Component	AWS Service	Configuration
API Service	ECS Fargate	2-50 tasks, 1 vCPU / 2GB each
Background Workers	ECS Fargate Spot	5-20 tasks, Spot for cost
Primary Database	RDS PostgreSQL	db.r6g.xlarge, Multi-AZ, 1TB gp3
Read Replicas	RDS Read Replicas	2 replicas across AZs
Cache	ElastiCache Redis	cache.r6g.large, 3-node cluster
Search	OpenSearch	m6g.large.search, 3-node
Message Queue	Amazon SQS / MSK	SQS for simplicity, MSK for ordering
Object Storage	S3 + CloudFront	Intelligent-Tiering, CDN for attachments
Notifications	Lambda + SNS	Push via FCM/APNs

Self-Hosted Alternatives

Managed Service	Self-Hosted	When to Self-Host
RDS PostgreSQL	PostgreSQL on EC2	Cost at scale, specific extensions (pg_cron)
ElastiCache	Redis on EC2	Redis modules (RedisJSON, RediSearch)
OpenSearch	Elasticsearch on EC2	Cost, specific plugins
MSK	Kafka on EC2	Cost at scale, Kafka Streams

Conclusion

This design prioritizes the hybrid approach for recurring events—materializing instances within a rolling window while supporting on-demand expansion for arbitrary ranges. This balances storage efficiency with query performance for the most common use cases (viewing this week/month).

Key architectural decisions:

Local time + TZID storage: Events stored in local time with named timezones, ensuring DST correctness for recurring events.
Sync tokens for incremental sync: Monotonically increasing tokens per calendar enable efficient CalDAV/mobile sync without polling.
Redis-cached free/busy: Pre-computed busy intervals in sorted sets provide sub-100ms scheduling queries.
iTIP/iMIP for interoperability: Standards-based invitation workflow ensures email-based RSVP works across calendar providers.

Limitations and future improvements:

Conflict detection: Current design uses last-write-wins; could implement operational transforms for real-time collaborative editing.
AI scheduling: Could add ML-based suggestions for optimal meeting times based on attendee patterns.
Calendar federation: Cross-organization free/busy queries require additional privacy controls and federation protocols.

Appendix

Prerequisites

Distributed systems fundamentals (CAP theorem, eventual consistency)
Database design (indexing, sharding, replication)
REST API design principles
Basic understanding of timezone concepts (UTC, offsets, DST)

Terminology

RRULE: Recurrence Rule—RFC 5545 syntax for defining repeating patterns (e.g., FREQ=WEEKLY;BYDAY=MO)
EXDATE: Exception Date—dates excluded from a recurring series
iTIP: iCalendar Transport-Independent Interoperability Protocol—defines methods for scheduling (REQUEST, REPLY, CANCEL)
iMIP: iCalendar Message-Based Interoperability Protocol—iTIP over email
CalDAV: Calendaring Extensions to WebDAV—protocol for calendar access and sync
Sync Token: Opaque string representing calendar state for incremental synchronization
TZID: Timezone Identifier—IANA timezone name (e.g., America/New_York)

Summary

Calendar systems require a hybrid recurrence model: store RRULE masters, materialize instances for a rolling window (30-90 days), expand dynamically beyond
Time storage must be local time with TZID, not UTC, to handle DST transitions correctly for recurring events
Free/busy aggregation is optimized via Redis sorted sets with pre-computed busy intervals
Sync tokens enable efficient incremental sync—clients receive only changes since their last sync
iTIP/iMIP provide interoperability with other calendar systems via standardized invitation workflows
Scale to 500M users requires PostgreSQL sharding by calendar_id, Redis caching, and async notification delivery

References

RFC 5545 - iCalendar Specification - Core data format for calendar interchange
RFC 4791 - CalDAV - Calendar access protocol
RFC 5546 - iTIP - Scheduling protocol (REQUEST, REPLY, CANCEL)
RFC 6047 - iMIP - Email transport for calendar invitations
RFC 6578 - Collection Synchronization - Sync token mechanism for WebDAV
IANA Time Zone Database - Authoritative timezone data
Google Calendar API Documentation - Reference implementation patterns
rrule.js - JavaScript library for RRULE expansion

Read more