Design Instagram: Photo Sharing at Scale

A photo-sharing social platform at Instagram scale handles 1+ billion photos uploaded daily, serves feeds to 500M+ daily active users, and delivers sub-second playback for Stories. This design covers the image upload pipeline, feed generation with fan-out strategies, Stories architecture, and the recommendation systems that power Explore—focusing on the architectural decisions that enable Instagram to process 95M+ daily uploads while maintaining real-time feed delivery.

Mermaid diagram — High-level architecture: upload → process → store → deliver. Feed generation uses hybrid fan-out; Stories have separate TTL-aware caching. Discovery systems run 1000+ ML models for personalization.

Abstract

Instagram’s architecture addresses three fundamental challenges:

Write amplification vs. read latency: A single post from a user with 10M followers could generate 10M timeline cache writes. The hybrid fan-out strategy (push for regular users, pull for celebrities) bounds write amplification while keeping read latency under 100ms.
Ephemeral vs. persistent content: Stories (24-hour TTL) and posts (permanent) require different storage strategies. Stories use aggressive client-side caching with TTL sync; posts use tiered storage with CDN caching.
Cold start vs. engagement optimization: New users need immediate value (trending content); returning users need personalized feeds. The Explore system runs 1000+ ML models simultaneously, using Two Towers neural networks for candidate retrieval and multi-task ranking for final selection.

Core mechanisms:

Hybrid fan-out: Push to followers’ timeline caches for users with <10K followers; merge celebrity content at read time
Image processing pipeline: Generate 6+ resolution variants synchronously; apply filters via GPU shaders
Stories architecture: 24-hour TTL with aggressive prefetch; target <200ms load time
Feed ranking: 100K+ dense features processed by neural networks; models fine-tuned hourly
MQTT for real-time: Powers DMs, notifications, and presence with 6-8% less power than HTTP

Requirements

Functional Requirements

Requirement	Priority	Notes
Photo/video upload	Core	Multiple resolutions, filters, up to 10 items per post
Feed (home timeline)	Core	Ranked, personalized, infinite scroll
Stories	Core	24-hour ephemeral, ring UI, reply capability
Follow/unfollow	Core	Social graph management
Likes and comments	Core	Real-time counts, threaded comments
Direct messages	Core	Real-time chat, media sharing
Explore page	Core	Content discovery, personalized recommendations
Search	Core	Users, hashtags, locations
Notifications	Core	Likes, comments, follows, DM alerts
Reels	Extended	Short-form video (separate video pipeline)
Shopping	Out of scope	E-commerce integration
Ads	Out of scope	Separate ad-tech stack

Non-Functional Requirements

Requirement	Target	Rationale
Upload availability	99.9%	Brief maintenance acceptable
Feed availability	99.99%	Core engagement driver
Feed load latency	p99 < 500ms	User experience threshold
Stories load latency	p99 < 200ms	Tap-and-swipe UX requires instant response
Image processing time	< 10s	User waits for upload confirmation
DM delivery latency	p99 < 500ms	Real-time conversation expectation
Notification delivery	< 2s	Engagement driver
Feed freshness	< 30s for non-celebrities	Balance freshness vs. ranking quality

Scale Estimation

Instagram-scale baseline (2025):

1
Monthly active users: 3 billion
2
Daily active users: 500M+
3
Photos uploaded daily: 95M-100M (conservative estimate)
4

5
Upload traffic:
6
- 100M uploads/day = ~1,150 uploads/second average
7
- Peak: 3x average = ~3,500 uploads/second
8
- Average image size: 2MB (after client compression)
9
- Daily upload ingestion: ~200TB/day
10

11
Storage per image:
12
- Original: 2MB
13
- Resolutions: 1080p, 640p, 320p, 150p (thumbnail) × 2 (square + original aspect)
14
- Total variants: ~8 files
15
- Average total per image: ~5MB (compressed variants)
16
- Daily storage growth: ~500TB/day
17

18
Feed reads:
19
- 500M DAU × 20 feed opens/day = 10B feed reads/day
20
- = ~115K feed reads/second average
21
- Peak: 350K+ reads/second
22

23
Social graph:
24
- Average followers per user: ~150
25
- Celebrity accounts (>1M followers): ~50,000
26
- Graph edges: billions

CDN efficiency:

1
Without CDN: All reads from origin
2
With 95% cache hit rate (power-law distribution):
3
- 5% cache misses = 17.5K requests/second to origin
4
- Popular content served entirely from edge

Design Paths

Path A: Push-First Fan-out

Best when:

Smaller scale (<100M users)
Read latency is critical
Most users have similar follower counts (no extreme outliers)

Architecture:

On post creation, push the post ID to every follower’s timeline cache. Reads are O(1) cache lookups.

Trade-offs:

✅ Extremely fast reads (pre-computed timelines)
✅ Simple read path (single cache lookup)
❌ Massive write amplification for popular accounts
❌ Wasted writes for inactive followers
❌ Storage explosion (N copies per post where N = followers)

Real-world example: Twitter (2010-2012) used pure push model initially. A single Bieber tweet caused 10M+ cache writes.

Path B: Pull-Only Fan-out

Best when:

Read latency tolerance is higher
Storage cost is primary concern
Feed freshness can be slightly stale

Architecture:

On feed request, query the social graph for followed users, fetch their recent posts, merge and rank.

Trade-offs:

✅ No write amplification
✅ Minimal storage (posts stored once)
✅ Always fresh (computed at read time)
❌ High read latency (multiple DB queries)
❌ Expensive computation per request
❌ Difficult to rank effectively (limited time for ML)

Real-world example: Early Facebook News Feed used pull model; abandoned due to latency issues at scale.

Path C: Hybrid Fan-out (Instagram Model)

Best when:

Massive scale with power-law follower distribution
Sub-second read latency required
Celebrity accounts exist (>1M followers)

Architecture:

Regular users (<10K followers): Push to followers’ timeline caches on post
Celebrity accounts (>10K followers): Store posts separately; merge at read time
Inactive users: Skip fan-out; compute on demand if they return

Trade-offs:

✅ Bounded write amplification (max 10K writes per post)
✅ Fast reads for most users (cache + small merge)
✅ Handles celebrity scale without storage explosion
❌ Two code paths to maintain
❌ Merge logic adds complexity
❌ Celebrity posts have slight latency penalty

Real-world example: Instagram and Twitter (post-2012) use hybrid models. Instagram’s feed team explicitly tuned the threshold based on write cost analysis.

Path Comparison

Factor	Push-First	Pull-Only	Hybrid
Read latency	O(1)	O(following × posts)	O(1) + O(celebrities)
Write amplification	O(followers)	O(1)	O(min(followers, 10K))
Storage per post	O(followers)	O(1)	O(min(followers, 10K))
Code complexity	Low	Low	Medium
Freshness	Immediate	Immediate	Immediate (regular), slight delay (celebrity merge)
Best scale	<100M users	<10M users	Billions

This Article’s Focus

This article focuses on Path C (Hybrid Fan-out) because:

Instagram’s scale (3B MAU) requires bounding write amplification
Celebrity accounts (Ronaldo: 600M+ followers) would otherwise cause catastrophic write storms
The hybrid model is well-documented in Instagram engineering posts

High-Level Design

Component Overview

Service Responsibilities

Service	Responsibility	Data Store	Key Operations
Upload Service	Media ingestion, validation, processing	S3, PostgreSQL	Resize, filter, generate variants
Post Service	Post CRUD, metadata management	PostgreSQL	Create, update, delete, soft-delete
Feed Service	Timeline generation, ranking	Redis, PostgreSQL	Fan-out, merge, rank
Stories Service	Ephemeral content management	Redis (TTL), S3	Create, expire, ring ordering
Social Service	Follow graph management	Cassandra	Follow, unfollow, follower lists
Search Service	User/hashtag/location search	Elasticsearch	Index, query, autocomplete
Explore Service	Content discovery, recommendations	ML platform	Candidate retrieval, ranking
DM Service	Real-time messaging	Cassandra, Redis	Send, receive, sync
Notification Service	Push and in-app notifications	PostgreSQL, Redis	Queue, dedupe, deliver

Image Upload Service

Upload Flow

Image Processing Pipeline

Input validation:

Maximum file size: 30MB (client-side compression typically yields 2-5MB)
Supported formats: JPEG, PNG, HEIC (converted to JPEG)
Minimum resolution: 320px (smaller images upscaled)
Maximum resolution: 1080px width (larger images downscaled)

Resolution variants generated:

Variant	Dimensions	Use Case
Original	Up to 1080px	Full-screen view
Large	1080 × 1080	Feed (high-DPI devices)
Medium	640 × 640	Feed (standard devices)
Small	320 × 320	Grid view, thumbnails
Thumbnail	150 × 150	Notifications, search results

Filter processing:

Instagram applies filters using GPU shaders (GPUImage framework on mobile, server-side for web uploads).

1
Filter = Color LUT + Adjustments + Blending
2
- LUT (Look-Up Table): 3D color mapping
3
- Adjustments: Brightness, contrast, saturation, warmth
4
- Blending: Vignette, frame overlays

Processing time budget:

Operation	Target	Notes
Upload to storage	< 2s	Depends on connection
Generate variants	< 3s	Parallel processing
Filter application	< 1s	GPU-accelerated
CDN propagation	< 5s	Edge cache warming
Total	< 10s	User-perceived upload time

Storage Strategy

Object storage layout:

1
s3://instagram-media/
2
  /{user_id}/
3
    /{media_id}/
4
      original.jpg          # Raw upload
5
      1080.jpg             # Full resolution
6
      640.jpg              # Medium
7
      320.jpg              # Small
8
      150.jpg              # Thumbnail
9
      metadata.json        # EXIF, dimensions, filter applied

CDN caching rules:

Content Type	Cache Duration	Cache Key
Original	1 year	`{media_id}/original`
Variants	1 year	`{media_id}/{size}`
Profile pictures	1 hour	`{user_id}/profile`
Stories media	24 hours	`{story_id}/media`

Storage tiering (power-law optimization):

1
Hot tier (SSD): Last 7 days of uploads, frequently accessed
2
Warm tier (HDD): 7 days - 1 year, moderate access
3
Cold tier (archive): > 1 year, rare access
4

5
Migration policy:
6
- Content accessed > 10x/day stays hot
7
- Content accessed < 1x/week moves to warm
8
- Content not accessed in 90 days moves to cold

# Timeline cache (sorted set by timestamp)
ZADD timeline:{user_id} {timestamp} {post_id}

# Keep last 800 posts per timeline
ZREMRANGEBYRANK timeline:{user_id} 0 -801

# Post metadata cache (hash)
HSET post:{post_id}
  author_id "{user_id}"
  media_url "{cdn_url}"
  caption "{text}"
  like_count {count}
  created_at {timestamp}

# Celebrity posts (separate sorted set per celebrity)
ZADD celebrity:{user_id}:posts {timestamp} {post_id}

Timeline composition at read:

1
def get_feed(user_id, cursor=None, limit=20):
2
    # 1. Get cached timeline posts
3
    cached_posts = redis.zrevrange(
4
        f"timeline:{user_id}",
5
        start=cursor or 0,
6
        end=(cursor or 0) + limit * 2  # Fetch extra for ranking
7
    )
8

9
    # 2. Get followed celebrities
10
    celebrities = get_followed_celebrities(user_id)
11

12
    # 3. Fetch recent celebrity posts (last 24h)
13
    celebrity_posts = []
14
    for celeb_id in celebrities:
15
        posts = redis.zrevrangebyscore(
16
            f"celebrity:{celeb_id}:posts",
17
            max=now(),
18
            min=now() - 86400,  # 24 hours
19
            limit=5
20
        )
21
        celebrity_posts.extend(posts)
22

23
    # 4. Merge and rank
24
    all_posts = cached_posts + celebrity_posts
25
    ranked_posts = ranking_service.rank(user_id, all_posts)
26

27
    return ranked_posts[:limit]

Feed Ranking

Instagram’s ranking system uses deep neural networks with 100K+ features.

Signal categories:

Category	Signals	Weight (approx)
Relationship	DM history, profile visits, comments, tags	High
Interest	Content type engagement, hashtag affinity	High
Timeliness	Post age, time since last seen	Medium
Popularity	Like velocity, comment rate, share count	Medium
Creator	Posting frequency, content quality score	Low

Ranking model architecture:

1
Input: User embeddings + Post embeddings + Context features
2
  ↓
3
Feature extraction (100K+ dense features)
4
  ↓
5
Multi-task neural network
6
  ↓
7
Outputs:
8
  - P(like)
9
  - P(comment)
10
  - P(save)
11
  - P(share)
12
  - P(time_spent > 10s)
13
  ↓
14
Weighted combination → Final score

Model training:

Trained on billions of engagement events
Fine-tuned hourly with recent interactions
A/B tested continuously (10+ experiments running at any time)

Consistency and Pagination

Consistency model:

Operation	Consistency	Rationale
Own post visibility	Strong (immediate)	User expects to see own post
Follower timeline update	Eventual (< 30s)	Acceptable delay for feed freshness
Like/comment counts	Eventual (< 5s)	Tolerable for social proof
Unfollow propagation	Strong (immediate)	Privacy expectation

Cursor-based pagination:

1
// Request
2
GET /feed?cursor=eyJ0cyI6MTY0...&limit=20
3

4
// Response
5
{
6
  "posts": [...],
7
  "next_cursor": "eyJ0cyI6MTY0...",
8
  "has_more": true
9
}
10

11
// Cursor structure (base64-encoded)
12
{
13
  "ts": 1640000000,  // Timestamp of last item
14
  "pid": "abc123",   // Post ID (for tie-breaking)
15
  "v": 2             // Cursor version (for migrations)
16
}

Why cursor-based (not offset-based):

Timeline changes between requests (new posts arrive)
Offset pagination causes duplicates or missed posts
Cursor is stable: “posts older than X” always returns consistent results

Stories Service

Architecture

Stories have fundamentally different requirements than feed posts:

Property	Posts	Stories
Lifetime	Permanent	24 hours
Load time target	< 500ms	< 200ms
Caching strategy	CDN + Redis	Aggressive prefetch
Ranking	Complex ML	Recency + engagement

Story Ring Ordering

The “Stories ring” (horizontal tray at top) orders accounts by engagement signals:

Ordering factors:

Accounts with unseen stories (always first)
DM interaction frequency
Profile visit frequency
Comment/like history
Story view history (accounts you consistently view)

Data model:

# Story metadata (expires with TTL)
SETEX story:{story_id} 86400 '{
  "author_id": "123",
  "media_url": "https://...",
  "created_at": 1640000000,
  "viewers": [],
  "reply_enabled": true
}'

# User's active stories (sorted set, auto-cleanup)
ZADD user:{user_id}:stories {created_at} {story_id}
ZREMRANGEBYSCORE user:{user_id}:stories -inf {now - 86400}

# Story ring ordering per viewer
ZADD user:{viewer_id}:story_ring {engagement_score} {author_id}

Prefetch Strategy

Client-side behavior:

1
On app open:
2
1. Fetch story ring ordering (lightweight API call)
3
2. Prefetch first 3 story authors' media (background)
4
3. As user views stories, prefetch next 2 authors ahead
5

6
On story view:
7
1. Preload all segments of current story
8
2. Preload first segment of next story
9
3. Mark current story as viewed (async)

Why aggressive prefetch:

Stories UX is tap-tap-tap: any loading spinner breaks flow
Media is small (compressed images/short videos)
Users view multiple stories in sequence: sequential access pattern

TTL and Expiration

Server-side:

Redis keys set with 24-hour TTL
Background job cleans up S3 media at TTL+1 hour (grace period for in-flight views)

Client-side:

Local cache respiration synced with server TTL
Client computes ttl_remaining = story.created_at + 86400 - now()
Evict from local cache when TTL expires

Property	MQTT	WebSocket
Protocol overhead	2 bytes minimum	2-14 bytes
Power consumption	6-8% lower	Higher (keep-alive)
Reconnection	Built-in session resumption	Manual implementation
QoS levels	At-most-once, at-least-once, exactly-once	Manual

MQTT topic structure:

1
# User's DM inbox (subscribe on connect)
2
/u/{user_id}/inbox
3

4
# Thread-specific updates
5
/t/{thread_id}/messages
6

7
# Typing indicators
8
/t/{thread_id}/typing

Direct’s Mutation Manager (DMM)

Instagram’s engineering team built a dedicated mutation manager for DMs to handle:

Optimistic UI: Show sent message immediately, reconcile with server response
Offline support: Queue messages when offline, sync when reconnected
Ordering guarantees: Preserve message order even with network jitter
Retry logic: Automatic retry with exponential backoff

Client-side queue:

1
interface QueuedMessage {
2
  localId: string // Client-generated UUID
3
  threadId: string
4
  content: string
5
  timestamp: number
6
  status: "pending" | "sent" | "failed"
7
  retryCount: number
8
}
9

10
// Persisted to IndexedDB/SQLite
11
// Survives app restarts

Cassandra Data Model

DMs use Cassandra for high write throughput and partition-local queries.

1
-- Thread metadata
2
CREATE TABLE threads (
3
    thread_id UUID PRIMARY KEY,
4
    participant_ids SET<UUID>,
5
    created_at TIMESTAMP,
6
    last_message_at TIMESTAMP,
7
    last_message_preview TEXT
8
);
9

10
-- Messages partitioned by thread
11
CREATE TABLE messages (
12
    thread_id UUID,
13
    message_id TIMEUUID,
14
    sender_id UUID,
15
    content TEXT,
16
    media_url TEXT,
17
    created_at TIMESTAMP,
18
    PRIMARY KEY (thread_id, message_id)
19
) WITH CLUSTERING ORDER BY (message_id DESC);
20

21
-- User's inbox (materialized view for fast inbox loading)
22
CREATE TABLE user_inbox (
23
    user_id UUID,
24
    thread_id UUID,
25
    last_message_at TIMESTAMP,
26
    unread_count INT,
27
    PRIMARY KEY (user_id, last_message_at)
28
) WITH CLUSTERING ORDER BY (last_message_at DESC);

Explore and Recommendations

System Scale

Instagram’s Explore recommendation system:

Serves hundreds of millions of daily visitors
Chooses from billions of content options in real-time
Runs 1,000+ ML models simultaneously

Three-Stage Recommendation Pipeline

Two Towers Model (Retrieval)

Architecture:

1
User Tower:                     Item Tower:
2
[User features]                 [Item features]
3
      ↓                              ↓
4
   Dense layers                  Dense layers
5
      ↓                              ↓
6
User embedding (128d)           Item embedding (128d)
7
      ↓                              ↓
8
      └────── Dot product ──────────┘
9
                   ↓
10
           Similarity score

User features:

Account-level embeddings (topical interests)
Recent engagement history
Social graph signals
Demographic signals (age bucket, region)

Item features:

Content embeddings (visual + text)
Creator features
Engagement statistics
Content category

Multi-Task Ranking

The late-stage ranker predicts multiple objectives simultaneously:

1
Outputs:
2
- P(like)        weight: 1.0
3
- P(comment)     weight: 2.0  (higher engagement)
4
- P(save)        weight: 3.0  (strong intent signal)
5
- P(share)       weight: 3.0
6
- P(follow)      weight: 5.0  (acquisition metric)
7
- P(hide)        weight: -10.0 (negative signal)
8

9
Final score = Σ(weight × probability)

Model Training

Continual learning:

Models fine-tuned hourly with new engagement data
Base model retrained weekly with full dataset
Feature store updated in real-time

Scale:

1,000+ models running in production
Custom ML infrastructure (PyTorch-based)
GPU clusters for inference at <100ms p99

API Design

Photo Upload

1
POST /api/v1/media/upload
2
Content-Type: multipart/form-data
3

4
Request:
5
- file: <binary>
6
- media_type: "image" | "video"
7
- filter_id: "clarendon" | "gingham" | ... (optional)
8

9
Response (200 OK):
10
{
11
  "media_id": "abc123",
12
  "urls": {
13
    "1080": "https://cdn.instagram.com/abc123/1080.jpg",
14
    "640": "https://cdn.instagram.com/abc123/640.jpg",
15
    "320": "https://cdn.instagram.com/abc123/320.jpg",
16
    "150": "https://cdn.instagram.com/abc123/150.jpg"
17
  },
18
  "expires_at": "2024-01-02T00:00:00Z"  // Media must be posted within 24h
19
}

Create Post

1
POST /api/v1/posts
2

3
Request:
4
{
5
  "media_ids": ["abc123", "def456"],  // Up to 10 for carousel
6
  "caption": "Summer vibes 🌴",
7
  "location_id": "loc_789",           // Optional
8
  "tagged_users": ["user_111"],       // Optional
9
  "alt_text": "Beach sunset"          // Accessibility
10
}
11

12
Response (201 Created):
13
{
14
  "post_id": "post_xyz",
15
  "permalink": "https://instagram.com/p/xyz",
16
  "created_at": "2024-01-01T12:00:00Z"
17
}
18

19
Errors:
20
- 400: Invalid media_id (expired or not found)
21
- 400: Caption too long (> 2200 characters)
22
- 403: Tagged user has blocked you
23
- 429: Rate limited (> 25 posts/day)

Feed

1
GET /api/v1/feed?cursor={cursor}&limit=20
2

3
Response (200 OK):
4
{
5
  "posts": [
6
    {
7
      "post_id": "post_xyz",
8
      "author": {
9
        "user_id": "user_123",
10
        "username": "photographer",
11
        "profile_pic_url": "https://...",
12
        "is_verified": true
13
      },
14
      "media": [
15
        {
16
          "type": "image",
17
          "url": "https://cdn.instagram.com/...",
18
          "width": 1080,
19
          "height": 1080,
20
          "alt_text": "Beach sunset"
21
        }
22
      ],
23
      "caption": "Summer vibes 🌴",
24
      "like_count": 1234,
25
      "comment_count": 56,
26
      "created_at": "2024-01-01T12:00:00Z",
27
      "viewer_has_liked": false,
28
      "viewer_has_saved": false
29
    }
30
  ],
31
  "next_cursor": "eyJ0cyI6MTY0...",
32
  "has_more": true
33
}

Stories

1
GET /api/v1/stories/feed
2

3
Response (200 OK):
4
{
5
  "story_ring": [
6
    {
7
      "user_id": "user_123",
8
      "username": "friend1",
9
      "profile_pic_url": "https://...",
10
      "has_unseen": true,
11
      "latest_story_ts": "2024-01-01T11:00:00Z"
12
    }
13
  ],
14
  "stories": {
15
    "user_123": [
16
      {
17
        "story_id": "story_abc",
18
        "media_url": "https://...",
19
        "media_type": "image",
20
        "created_at": "2024-01-01T11:00:00Z",
21
        "expires_at": "2024-01-02T11:00:00Z",
22
        "seen": false,
23
        "reply_enabled": true
24
      }
25
    ]
26
  }
27
}

Direct Messages

1
POST /api/v1/direct/threads/{thread_id}/messages
2

3
Request:
4
{
5
  "text": "Hey, nice photo!",
6
  "reply_to_story_id": "story_abc"  // Optional
7
}
8

9
Response (201 Created):
10
{
11
  "message_id": "msg_xyz",
12
  "thread_id": "thread_123",
13
  "created_at": "2024-01-01T12:00:00Z",
14
  "status": "sent"
15
}

Data Modeling

PostgreSQL Schema (Core Entities)

1
-- Users
2
CREATE TABLE users (
3
    id BIGINT PRIMARY KEY,
4
    username VARCHAR(30) UNIQUE NOT NULL,
5
    email VARCHAR(255) UNIQUE,
6
    phone VARCHAR(20) UNIQUE,
7
    full_name VARCHAR(100),
8
    bio TEXT,
9
    profile_pic_url TEXT,
10
    is_private BOOLEAN DEFAULT false,
11
    is_verified BOOLEAN DEFAULT false,
12
    follower_count INT DEFAULT 0,
13
    following_count INT DEFAULT 0,
14
    post_count INT DEFAULT 0,
15
    created_at TIMESTAMPTZ DEFAULT NOW(),
16
    updated_at TIMESTAMPTZ DEFAULT NOW()
17
);
18

19
CREATE INDEX idx_users_username ON users(username);
20

21
-- Posts
22
CREATE TABLE posts (
23
    id BIGINT PRIMARY KEY,
24
    author_id BIGINT NOT NULL REFERENCES users(id),
25
    caption TEXT,
26
    location_id BIGINT REFERENCES locations(id),
27
    like_count INT DEFAULT 0,
28
    comment_count INT DEFAULT 0,
29
    is_archived BOOLEAN DEFAULT false,
30
    created_at TIMESTAMPTZ DEFAULT NOW(),
31
    deleted_at TIMESTAMPTZ
32
);
33

34
CREATE INDEX idx_posts_author ON posts(author_id, created_at DESC);
35

36
-- Post Media (supports carousel)
37
CREATE TABLE post_media (
38
    id BIGINT PRIMARY KEY,
39
    post_id BIGINT NOT NULL REFERENCES posts(id),
40
    media_type VARCHAR(10) NOT NULL,  -- 'image', 'video'
41
    url TEXT NOT NULL,
42
    width INT,
43
    height INT,
44
    alt_text TEXT,
45
    position SMALLINT DEFAULT 0,
46
    created_at TIMESTAMPTZ DEFAULT NOW()
47
);
48

49
CREATE INDEX idx_post_media_post ON post_media(post_id);
50

51
-- Comments
52
CREATE TABLE comments (
53
    id BIGINT PRIMARY KEY,
54
    post_id BIGINT NOT NULL REFERENCES posts(id),
55
    author_id BIGINT NOT NULL REFERENCES users(id),
56
    parent_id BIGINT REFERENCES comments(id),  -- For replies
57
    content TEXT NOT NULL,
58
    like_count INT DEFAULT 0,
59
    created_at TIMESTAMPTZ DEFAULT NOW(),
60
    deleted_at TIMESTAMPTZ
61
);
62

63
CREATE INDEX idx_comments_post ON comments(post_id, created_at DESC);
64

65
-- Likes (polymorphic)
66
CREATE TABLE likes (
67
    id BIGINT PRIMARY KEY,
68
    user_id BIGINT NOT NULL REFERENCES users(id),
69
    target_type VARCHAR(10) NOT NULL,  -- 'post', 'comment', 'story'
70
    target_id BIGINT NOT NULL,
71
    created_at TIMESTAMPTZ DEFAULT NOW(),
72
    UNIQUE(user_id, target_type, target_id)
73
);
74

75
CREATE INDEX idx_likes_target ON likes(target_type, target_id);

1
-- Follows (partitioned by follower for "who do I follow" queries)
2
CREATE TABLE follows (
3
    follower_id UUID,
4
    following_id UUID,
5
    created_at TIMESTAMP,
6
    PRIMARY KEY (follower_id, following_id)
7
);
8

9
-- Followers (partitioned by following for "who follows me" queries)
10
CREATE TABLE followers (
11
    following_id UUID,
12
    follower_id UUID,
13
    created_at TIMESTAMP,
14
    PRIMARY KEY (following_id, follower_id)
15
);
16

17
-- Activity feed (for "activity" tab)
18
CREATE TABLE activity (
19
    user_id UUID,
20
    activity_id TIMEUUID,
21
    actor_id UUID,
22
    activity_type TEXT,  -- 'like', 'comment', 'follow', 'mention'
23
    target_type TEXT,
24
    target_id UUID,
25
    created_at TIMESTAMP,
26
    PRIMARY KEY (user_id, activity_id)
27
) WITH CLUSTERING ORDER BY (activity_id DESC);

ID Generation (Instagram’s Approach)

Instagram’s famous sharding and ID generation system:

1
-- PL/pgSQL function for globally unique, time-sorted IDs
2
CREATE OR REPLACE FUNCTION instagram_id() RETURNS BIGINT AS $$
3
DECLARE
4
    epoch BIGINT := 1314220021721;  -- Custom epoch (Sep 2011)
5
    seq_id BIGINT;
6
    now_millis BIGINT;
7
    shard_id INT := 1;  -- Set per logical shard
8
    result BIGINT;
9
BEGIN
10
    SELECT nextval('instagram_id_seq') % 1024 INTO seq_id;
11
    SELECT FLOOR(EXTRACT(EPOCH FROM NOW()) * 1000) INTO now_millis;
12

13
    result := (now_millis - epoch) << 23;  -- 41 bits for timestamp
14
    result := result | (shard_id << 10);    -- 13 bits for shard
15
    result := result | (seq_id);            -- 10 bits for sequence
16

17
    RETURN result;
18
END;
19
$$ LANGUAGE PLPGSQL;

ID structure (64 bits):

Bits	Purpose	Range
41	Milliseconds since epoch	~69 years
13	Shard ID	8,192 shards
10	Sequence	1,024 IDs/ms/shard

Why this matters:

IDs are time-sorted: no separate timestamp index needed
IDs encode shard: routing without lookup
IDs are unique across shards: no coordination needed

Infrastructure

Cloud-Agnostic Concepts

Component	Purpose	Requirements
Object Storage	Media files	High durability, CDN integration
Relational DB	Users, posts, metadata	ACID, sharding support
Wide-column DB	Social graph, activity	High write throughput, partition-local queries
Cache	Timeline, hot data	Sub-ms latency, cluster support
Message Queue	Async processing	At-least-once delivery, partitioning
Search Index	Discovery	Full-text, faceted search
CDN	Media delivery	Global PoPs, cache efficiency
Push Gateway	Real-time notifications	MQTT/WebSocket support

AWS Reference Architecture

Component	Service	Configuration
API Gateway	ALB + API Gateway	Auto-scaling, WAF protection
Compute	EKS (Kubernetes)	Spot instances for workers
Primary DB	RDS PostgreSQL	Multi-AZ, read replicas
Social Graph	Amazon Keyspaces or self-managed Cassandra	Multi-region
Cache	ElastiCache Redis Cluster	Cluster mode, 6+ nodes
Object Storage	S3 + CloudFront	Intelligent tiering
Message Queue	Amazon MSK (Kafka) or SQS	For fan-out workers
Search	OpenSearch Service	3+ data nodes
Push	Amazon MQ (MQTT) or IoT Core	Managed MQTT broker

Multi-Region Deployment

Instagram’s Migration (AWS → Facebook)

Instagram migrated from AWS to Facebook’s data centers in 2014:

Before (AWS):

12 PostgreSQL instances (Quadruple Extra-Large memory)
12 read replicas
6 Memcached instances
S3 for media storage

After (Facebook infrastructure):

1 Facebook server ≈ 3 Amazon servers (efficiency)
Shared infrastructure with Facebook
Private fiber network between data centers
No service disruption during migration (8 engineers, ~1 year)

Frontend Considerations

Feed Virtualization

Instagram’s feed is an infinite scroll of variable-height items:

1
// Virtualized list configuration
2
const FeedList = () => {
3
  return (
4
    <VirtualizedList
5
      data={posts}
6
      renderItem={({ item }) => <PostCard post={item} />}
7
      estimatedItemSize={600}  // Average post height
8
      overscanCount={3}        // Render 3 items above/below viewport
9
      onEndReached={loadMore}
10
      onEndReachedThreshold={0.5}
11
    />
12
  );
13
};

Why virtualization:

Feed can have 1000+ posts
Each post has heavy media (images/video)
Without virtualization: memory explosion, jank

Image Loading Strategy

1
// Progressive image loading
2
const PostImage = ({ post }) => {
3
  const [loaded, setLoaded] = useState(false);
4

5
  return (
6
    <div className="post-image">
7
      {/* Blur placeholder (tiny, inline) */}
8
      <img
9
        src={post.thumbnail_blur}  // 10x10 base64
10
        className={loaded ? 'hidden' : 'blur'}
11
      />
12

13
      {/* Full image (lazy loaded) */}
14
      <img
15
        src={post.media_url}
16
        loading="lazy"
17
        onLoad={() => setLoaded(true)}
18
        className={loaded ? 'visible' : 'hidden'}
19
      />
20
    </div>
21
  );
22
};

Stories Ring Interaction

1
// Horizontal scroll with snap points
2
const StoriesRing = ({ stories }) => {
3
  return (
4
    <div className="stories-ring" style={{
5
      display: 'flex',
6
      overflowX: 'scroll',
7
      scrollSnapType: 'x mandatory',
8
      WebkitOverflowScrolling: 'touch'  // Smooth iOS scroll
9
    }}>
10
      {stories.map(story => (
11
        <div
12
          key={story.id}
13
          style={{ scrollSnapAlign: 'start' }}
14
        >
15
          <StoryAvatar story={story} />
16
        </div>
17
      ))}
18
    </div>
19
  );
20
};

Optimistic Updates

1
// Like button with optimistic UI
2
const LikeButton = ({ post }) => {
3
  const [optimisticLiked, setOptimisticLiked] = useState(post.viewer_has_liked);
4
  const [optimisticCount, setOptimisticCount] = useState(post.like_count);
5

6
  const handleLike = async () => {
7
    // Optimistic update (immediate feedback)
8
    setOptimisticLiked(!optimisticLiked);
9
    setOptimisticCount(prev => optimisticLiked ? prev - 1 : prev + 1);
10

11
    try {
12
      await api.toggleLike(post.id);
13
    } catch (error) {
14
      // Rollback on failure
15
      setOptimisticLiked(post.viewer_has_liked);
16
      setOptimisticCount(post.like_count);
17
      showError('Failed to like post');
18
    }
19
  };
20

21
  return (
22
    <button onClick={handleLike}>
23
      <HeartIcon filled={optimisticLiked} />
24
      <span>{formatCount(optimisticCount)}</span>
25
    </button>
26
  );
27
};

Conclusion

Instagram’s architecture demonstrates several key principles for building photo-sharing platforms at scale:

Architectural decisions:

Hybrid fan-out bounds write amplification while maintaining sub-second feed loads. The 10K follower threshold is tuned based on write cost analysis.
Separate storage strategies for ephemeral (Stories) vs. persistent (Posts) content optimize for their different access patterns and lifetime requirements.
Three-stage recommendation pipeline (retrieval → early ranking → late ranking) enables personalization across billions of content items with <100ms latency.
MQTT for real-time provides significant power and bandwidth savings over HTTP polling, critical for mobile-first platforms.

Optimizations this design achieves:

Feed load: p99 < 500ms through cached timelines + celebrity merge
Stories load: p99 < 200ms through aggressive prefetch
Upload processing: < 10s for immediate user feedback
Global delivery: 95%+ CDN cache hit rate exploiting power-law distribution

Known limitations:

Hybrid fan-out requires maintaining two code paths
Celebrity threshold (10K) is a tunable but imperfect heuristic
Ranking model hourly retraining introduces slight staleness
Multi-region eventual consistency means brief windows of inconsistency

Alternative approaches not chosen:

Pure push (write amplification at celebrity scale)
Pure pull (read latency unacceptable)
Single-region (latency for global users)

Appendix

Prerequisites

Distributed systems fundamentals (CAP theorem, eventual consistency)
Database concepts (sharding, replication, indexing)
Caching strategies (write-through, write-behind, cache invalidation)
CDN and content delivery concepts
Basic ML concepts (embeddings, neural networks)

Summary

Hybrid fan-out (push for <10K followers, pull for celebrities) bounds write amplification while keeping reads fast
Image processing pipeline generates 6+ variants synchronously; filters use GPU shaders
Stories architecture uses 24-hour TTL with aggressive prefetch for <200ms load times
Feed ranking uses 100K+ features with neural networks; models fine-tuned hourly
Explore recommendation runs 1000+ ML models; Two Towers for retrieval, multi-task ranking for final selection
MQTT powers real-time features (DMs, notifications) with 6-8% less power than HTTP

References

Sharding & IDs at Instagram - Original ID generation design
What Powers Instagram - Early architecture overview
Migrating from AWS to Facebook - Infrastructure migration case study
Making Direct Messages Reliable and Fast - DM architecture details
Scaling Instagram’s Explore Recommendations - Recommendation system architecture
Journey to 1000 Models - ML infrastructure at scale
Instagram Video Processing and Encoding Reduction - Video pipeline optimization
Introducing mcrouter - Caching infrastructure
Powered by AI: Instagram’s Explore Recommender System - Two Towers model details