Design a Social Feed (Facebook/Instagram)

A comprehensive system design for social feed generation and ranking covering fan-out strategies, ML-powered ranking, graph-based storage, and caching at scale. This design addresses sub-second feed delivery for billions of users with personalized content ranking, handling the “celebrity problem” where a single post can require millions of fan-out operations.

Mermaid diagram — High-level architecture: Aggregators query leaf servers for candidates, ranking service applies ML models, fan-out service distributes posts to follower feeds, and TAO provides graph storage.

Abstract

Social feed systems solve three interconnected problems: efficient content distribution (getting posts to followers), personalized ranking (showing the most relevant content first), and low-latency retrieval (sub-second feed loads at massive scale).

Core architectural decisions:

Decision	Choice	Rationale
Fan-out strategy	Hybrid push/pull	Push for regular users, pull for celebrities (>10K followers)
Content storage	TAO-like graph store	Objects + associations model fits social data naturally
Feed ranking	Multi-stage ML funnel	Billions → hundreds → ~50 candidates through progressive filtering
Caching	Two-tier regional	Followers handle reads, leaders maintain consistency
Consistency	Eventual (~1 min)	Acceptable for social content; strong only for engagement counts

Key trade-offs accepted:

Eventual consistency for feed updates (fresh-enough is good enough)
Higher write amplification for regular users (pre-computed feeds)
Complex hybrid logic for celebrity handling
ML model staleness (hourly retraining) in exchange for training efficiency

What this design optimizes:

p99 feed load < 500ms globally
Zero data loss for user-generated content
Personalized ranking with 1000+ signals
Linear horizontal scaling to billions of users

Requirements

Functional Requirements

Requirement	Priority	Notes
Home feed generation	Core	Aggregated posts from followed users
Post creation	Core	Text, images, videos with privacy settings
Feed ranking	Core	Personalized relevance ordering
Real-time feed updates	Core	New posts appear without full refresh
Engagement actions	Core	Like, comment, share, save
Following/followers	Core	Asymmetric social graph
Feed pagination	Core	Infinite scroll with cursor-based loading
Post visibility	Extended	Public, friends-only, custom lists
Stories/ephemeral content	Extended	24-hour expiring content
Algorithmic vs. chronological toggle	Extended	User preference for feed type

Non-Functional Requirements

Requirement	Target	Rationale
Availability	99.99% (4 nines)	Revenue-critical, user retention
Feed load latency	p99 < 500ms	User experience threshold
Post publish latency	p99 < 2s	Acceptable for async processing
Feed freshness	< 1 minute	Balance between freshness and efficiency
Ranking model latency	p99 < 100ms	Real-time personalization requirement
Data durability	99.999999%	No user content loss

Scale Estimation

Users:

Monthly Active Users (MAU): 2 billion
Daily Active Users (DAU): 1 billion (50% of MAU)
Peak concurrent users: 200 million (20% of DAU)

Traffic:

Feed loads per user per day: 20
Daily feed requests: 1B × 20 = 20 billion/day = 230K RPS
Peak multiplier: 3× → 700K RPS
Posts per user per day: 0.5 average (power law distribution)
Daily posts: 500 million posts/day = 5.8K posts/second

Storage:

Average post size: 2KB (metadata, excluding media)
Media per post: 500KB average (after compression, CDN-served)
Daily post storage: 500M × 2KB = 1TB/day (metadata)
Daily media storage: 500M × 500KB = 250TB/day
Social graph edges: 2B users × 500 avg connections = 1 trillion edges
Graph storage: 1T edges × 100 bytes = 100TB

Fan-out estimation:

Average followers: 500
Posts requiring fan-out: 500M/day
Fan-out writes: 500M × 500 = 250 billion writes/day
Celebrity optimization: Top 1% (20M users) average 50K followers
Without optimization: 20M × 50K × 0.5 posts = 500 trillion writes/day (impossible)

Design Paths

Path A: Fan-out on Write (Push Model)

Best when:

Uniform follower distribution (no celebrities)
Read-heavy workload (reads >> writes)
Real-time feed freshness is critical
Simpler operational model preferred

Architecture:

Key characteristics:

Pre-computed feeds in Redis/Memcached
O(1) feed reads, O(followers) writes
Feed cache stores post IDs in sorted order

Trade-offs:

✅ Extremely fast feed reads (single cache lookup)
✅ Simple feed retrieval logic
✅ Predictable read latency
❌ Massive write amplification for celebrities
❌ High storage cost (duplicate post references)
❌ Delayed delivery during fan-out processing

Real-world example: Twitter historically used this approach, storing up to 800 tweets per timeline in Redis. At 39M QPS with 105TB RAM across 10,000+ Redis instances, this works for most users but requires special handling for high-follower accounts.

Path B: Fan-out on Read (Pull Model)

Best when:

Celebrity-heavy platform (many high-follower users)
Write-heavy workload
Storage cost is a primary concern
Acceptable higher read latency

Architecture:

Key characteristics:

No write amplification
Feed computed on demand
Read complexity: O(followees × posts_per_user)

Trade-offs:

✅ No write amplification
✅ Always fresh content
✅ Lower storage requirements
❌ Higher read latency (aggregation required)
❌ More compute per request
❌ Harder to apply complex ranking

Real-world example: Facebook moved to this approach in 2007. The Multifeed system fetches tens of thousands of potential updates per user, then ranks and filters to ~45 items. This requires sophisticated indexing and caching to maintain sub-second latency.

Path C: Hybrid Model (Industry Standard)

Best when:

Mixed follower distribution (most users small, some celebrities)
Need to optimize both reads and writes
Can handle additional system complexity
Facebook/Instagram/Twitter scale

Architecture:

Key characteristics:

Threshold-based routing (typically 10K followers)
Pre-computed feeds + on-demand celebrity merging
Best of both worlds for common cases

Trade-offs:

✅ Optimal latency for typical users (pre-computed)
✅ Handles celebrities without write explosion
✅ Flexible threshold tuning
❌ Two code paths to maintain
❌ More complex feed read logic
❌ Edge cases around threshold (users crossing 10K)

Real-world example: Discord uses this approach: direct fan-out for small servers, Kafka-based distribution for large servers (100+ members). Instagram similarly uses hybrid fan-out with ML-based feed generation.

Path Comparison

Factor	Fan-out Write	Fan-out Read	Hybrid
Read latency	~10ms	~200ms	~50ms
Write latency	O(followers)	O(1)	O(followers) for small
Celebrity handling	Impossible	Native	Optimized
Storage cost	High	Low	Medium
Complexity	Low	Medium	High
Production examples	Twitter (pre-2015)	Facebook Multifeed	Instagram, Discord

This Article’s Focus

This article focuses on Path C (Hybrid) because:

Represents modern production architectures at Facebook/Instagram scale
Demonstrates trade-off thinking essential for system design
Handles the full spectrum from regular users to celebrities
Balances latency, storage, and operational complexity

CPU-intensive query processing and ranking
Stateless, horizontally scalable
Queries multiple leaf servers in parallel
Applies ML ranking models to candidates

Leaf Tier:

Memory-intensive, stores recent action indices
Indexes posts by author, sorted by time
Maintains in-memory structures for fast retrieval
Sharded by user ID

Tailer:

Real-time data pipeline from Kafka
Updates leaf indices as posts are created
Handles index rebuilding from persistent storage

Design decisions:

Decision	Choice	Rationale
Tier separation	Disaggregated aggregator/leaf	40% efficiency gain, independent scaling
Leaf sharding	By author user_id	Co-locates author’s posts for range queries
Index structure	Time-sorted skip list	Fast range queries with O(log n) insert
Memory management	LRU eviction, flash overflow	Balance hot data in RAM, cold on SSD

Manages the social graph (follows, friends, blocks) and content relationships:

Data model:

Objects: Users, posts, comments, pages (nodes)
Associations: Follows, likes, comments, tags (edges)

Design characteristics:

Two tables only: objects and associations
Associations stored on source object’s shard
Enables single-shard queries for common patterns

Fan-out Service

Handles post distribution to follower feeds:

Routing logic:

1
interface FanoutRouter {
2
  routePost(post: Post, author: User): FanoutStrategy
3
}
4

5
type FanoutStrategy =
6
  | { type: "push"; followers: string[] }
7
  | { type: "pull"; authorId: string }
8
  | { type: "hybrid"; pushTo: string[]; markForPull: boolean }

Threshold-based routing:

Authors with < 10K followers: Full push fan-out
Authors with 10K-1M followers: Hybrid (push to active followers, pull for rest)
Authors with > 1M followers: Pull only (mark post for read-time merging)

Parameter	Type	Description
cursor	string	Pagination cursor (opaque)
limit	int	Items per page (default: 20, max: 50)
refresh	boolean	Force fresh feed generation

Response (200 OK):

1
{
2
  "posts": [
3
    {
4
      "id": "post_abc123",
5
      "author": {
6
        "id": "user_456",
7
        "username": "johndoe",
8
        "displayName": "John Doe",
9
        "avatarUrl": "https://cdn.example.com/avatars/456.jpg",
10
        "isVerified": true
11
      },
12
      "content": {
13
        "type": "image",
14
        "text": "Beautiful sunset! 🌅",
15
        "media": [
16
          {
17
            "id": "media_789",
18
            "type": "image",
19
            "url": "https://cdn.example.com/posts/789.jpg",
20
            "thumbnailUrl": "https://cdn.example.com/posts/789_thumb.jpg",
21
            "width": 1080,
22
            "height": 1350,
23
            "altText": "Sunset over the ocean"
24
          }
25
        ]
26
      },
27
      "engagement": {
28
        "likeCount": 1542,
29
        "commentCount": 89,
30
        "shareCount": 23,
31
        "viewCount": 12450,
32
        "isLiked": false,
33
        "isSaved": false
34
      },
35
      "ranking": {
36
        "score": 0.89,
37
        "reason": "friend_interaction"
38
      },
39
      "createdAt": "2024-02-03T10:30:00Z",
40
      "visibility": "public"
41
    }
42
  ],
43
  "pagination": {
44
    "nextCursor": "eyJ0IjoxNzA2ODg2NDAwfQ",
45
    "hasMore": true
46
  },
47
  "meta": {
48
    "feedType": "ranked",
49
    "generatedAt": "2024-02-03T12:00:00Z"
50
  }
51
}

Create Post

Endpoint: POST /api/v1/posts

Request:

1
{
2
  "content": {
3
    "text": "Hello world!",
4
    "mediaIds": ["upload_123", "upload_456"]
5
  },
6
  "visibility": "public",
7
  "allowComments": true,
8
  "allowSharing": true,
9
  "location": {
10
    "latitude": 37.7749,
11
    "longitude": -122.4194,
12
    "name": "San Francisco, CA"
13
  }
14
}

Response (201 Created):

1
{
2
  "id": "post_new789",
3
  "author": {
4
    "id": "user_123",
5
    "username": "currentuser"
6
  },
7
  "content": {
8
    "text": "Hello world!",
9
    "media": [...]
10
  },
11
  "createdAt": "2024-02-03T12:05:00Z",
12
  "visibility": "public",
13
  "fanoutStatus": "pending"
14
}

Real-time Feed Updates (SSE)

Endpoint: GET /api/v1/feed/stream

Event Types:

1
event: new_post
2
data: {"postId": "post_xyz", "authorId": "user_456", "preview": "Check out..."}
3

4
event: engagement_update
5
data: {"postId": "post_abc", "likeCount": 1543, "commentCount": 90}
6

7
event: post_removed
8
data: {"postId": "post_old", "reason": "author_deleted"}

Engagement Endpoints

Like/Unlike Post

Endpoint: POST /api/v1/posts/{id}/like

Response (200 OK):

1
{
2
  "liked": true,
3
  "likeCount": 1543,
4
  "timestamp": "2024-02-03T12:10:00Z"
5
}

Get Comments

Endpoint: GET /api/v1/posts/{id}/comments

Query Parameters:

Parameter	Type	Description
cursor	string	Pagination cursor
limit	int	Comments per page (default: 20)
sort	string	”top” (engagement), “newest”, “oldest”

Response (200 OK):

1
{
2
  "comments": [
3
    {
4
      "id": "comment_123",
5
      "author": {
6
        "id": "user_789",
7
        "username": "commenter",
8
        "avatarUrl": "..."
9
      },
10
      "text": "Great post!",
11
      "likeCount": 45,
12
      "replyCount": 3,
13
      "isLiked": false,
14
      "createdAt": "2024-02-03T10:35:00Z",
15
      "replies": []
16
    }
17
  ],
18
  "pagination": {
19
    "nextCursor": "...",
20
    "hasMore": true
21
  },
22
  "totalCount": 89
23
}

Error Responses

Code	Error	When
400	`INVALID_CONTENT`	Post content violates rules
401	`UNAUTHORIZED`	Missing or invalid token
403	`FORBIDDEN`	User blocked or content restricted
404	`POST_NOT_FOUND`	Post doesn’t exist or is deleted
429	`RATE_LIMITED`	Too many requests
503	`FEED_UNAVAILABLE`	Feed generation temporarily unavailable

Rate limits:

Endpoint	Limit	Window
Feed load	60	per minute
Post creation	10	per hour
Like/comment	100	per minute
Media upload	50	per hour

1
CREATE TABLE objects (
2
    id BIGINT PRIMARY KEY,
3
    type VARCHAR(50) NOT NULL,
4
    data BLOB,
5
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
6
    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP ON UPDATE CURRENT_TIMESTAMP,
7
    INDEX idx_type_created (type, created_at)
8
);

Object types:

Type	Data Fields
user	username, display_name, avatar_url, bio, follower_count
post	author_id, content, visibility, like_count, comment_count
comment	post_id, author_id, text, like_count
media	post_id, url, type, dimensions, alt_text

Associations Table

1
CREATE TABLE associations (
2
    id1 BIGINT NOT NULL,
3
    assoc_type VARCHAR(50) NOT NULL,
4
    id2 BIGINT NOT NULL,
5
    time BIGINT NOT NULL,
6
    data BLOB,
7
    PRIMARY KEY (id1, assoc_type, id2),
8
    INDEX idx_id1_type_time (id1, assoc_type, time DESC)
9
);

Association types:

Type	id1	id2	Data
follows	follower_id	followee_id	created_at
authored	user_id	post_id	-
liked	user_id	post_id	created_at
commented	user_id	comment_id	-
tagged	post_id	user_id	-

Sharding strategy:

Shard by id1 (source object)
Co-locates user’s follows, likes, authored posts
Enables single-shard queries for common patterns

Feed Cache Schema (Redis)

1
# Pre-computed feed (sorted set)
2
# Score = ranking score or timestamp
3
ZADD feed:{user_id} {score} {post_id}
4

5
# Keep last 500 posts per feed
6
ZREMRANGEBYRANK feed:{user_id} 0 -501
7

8
# Feed metadata (hash)
9
HSET feed:meta:{user_id}
10
    last_generated 1706886400000
11
    version 42
12
    type "ranked"
13

14
# Celebrity posts index (for pull-based merging)
15
ZADD celebrity_posts:{author_id} {timestamp} {post_id}
16

17
# User's recent engagement (for ranking features)
18
ZADD user:engaged:{user_id} {timestamp} {post_id}
19
EXPIRE user:engaged:{user_id} 604800  # 7 days

Post Index (Leaf Servers)

In-memory index structure on leaf servers:

1
interface PostIndex {
2
  // Posts by author, sorted by time (descending)
3
  authorIndex: Map<UserId, SortedSet<PostId, Timestamp>>
4

5
  // Posts by visibility for filtering
6
  visibilityIndex: Map<Visibility, Set<PostId>>
7

8
  // Recent posts global index (for trending)
9
  recentPosts: SortedSet<PostId, Timestamp>
10
}
11

12
interface SortedSet<K, S> {
13
  add(key: K, score: S): void
14
  range(start: S, end: S, limit: number): K[]
15
  remove(key: K): void
16
}

MySQL Persistent Storage

1
-- Users table
2
CREATE TABLE users (
3
    id BIGINT PRIMARY KEY AUTO_INCREMENT,
4
    username VARCHAR(50) UNIQUE NOT NULL,
5
    display_name VARCHAR(100),
6
    email VARCHAR(255) UNIQUE NOT NULL,
7
    password_hash VARCHAR(255) NOT NULL,
8
    avatar_url TEXT,
9
    bio TEXT,
10
    follower_count INT DEFAULT 0,
11
    following_count INT DEFAULT 0,
12
    post_count INT DEFAULT 0,
13
    is_verified BOOLEAN DEFAULT FALSE,
14
    is_celebrity BOOLEAN DEFAULT FALSE,  -- >10K followers
15
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
16
    INDEX idx_username (username),
17
    INDEX idx_email (email)
18
) ENGINE=InnoDB;
19

20
-- Posts table
21
CREATE TABLE posts (
22
    id BIGINT PRIMARY KEY AUTO_INCREMENT,
23
    author_id BIGINT NOT NULL,
24
    content_text TEXT,
25
    content_type ENUM('text', 'image', 'video', 'link') NOT NULL,
26
    visibility ENUM('public', 'friends', 'private') DEFAULT 'public',
27
    like_count INT DEFAULT 0,
28
    comment_count INT DEFAULT 0,
29
    share_count INT DEFAULT 0,
30
    view_count INT DEFAULT 0,
31
    is_deleted BOOLEAN DEFAULT FALSE,
32
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
33
    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP ON UPDATE CURRENT_TIMESTAMP,
34
    INDEX idx_author_created (author_id, created_at DESC),
35
    INDEX idx_created (created_at DESC),
36
    FOREIGN KEY (author_id) REFERENCES users(id)
37
) ENGINE=InnoDB;
38

39
-- Social graph (follow relationships)
40
CREATE TABLE follows (
41
    follower_id BIGINT NOT NULL,
42
    followee_id BIGINT NOT NULL,
43
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
44
    PRIMARY KEY (follower_id, followee_id),
45
    INDEX idx_followee (followee_id),
46
    FOREIGN KEY (follower_id) REFERENCES users(id),
47
    FOREIGN KEY (followee_id) REFERENCES users(id)
48
) ENGINE=InnoDB;
49

50
-- Engagement (likes, saves)
51
CREATE TABLE post_likes (
52
    user_id BIGINT NOT NULL,
53
    post_id BIGINT NOT NULL,
54
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
55
    PRIMARY KEY (user_id, post_id),
56
    INDEX idx_post (post_id),
57
    FOREIGN KEY (user_id) REFERENCES users(id),
58
    FOREIGN KEY (post_id) REFERENCES posts(id)
59
) ENGINE=InnoDB;

Database Selection Matrix

Data Type	Store	Rationale
Social graph	TAO (MySQL-backed)	Optimized for graph queries, association lists
User profiles	MySQL	ACID, moderate scale
Posts	MySQL + TAO cache	Durability + fast graph queries
Feed cache	Redis	Sub-ms reads, sorted sets for ranking
Post index	Leaf servers (memory)	Ultra-fast candidate retrieval
Media	S3 + CDN	Cost-effective, globally distributed
ML features	Feature store	Consistent features for training/serving
Analytics	ClickHouse	Time-series, aggregations


12 collapsed lines
1
class CandidateRetriever {
2
  private readonly leafClient: LeafClient
3
  private readonly taoClient: TAOClient
4
  private readonly redis: RedisCluster
5

6
  async getCandidates(userId: string): Promise<CandidateSet> {
7
    // 1. Get user's followees
8
    const followees = await this.taoClient.getAssociations(userId, "follows", { limit: 5000 })
9

10
    // 2. Fetch recent posts from each followee (parallel)
11
    const postPromises = followees.map((followee) =>
12
      this.leafClient.getAuthorPosts(followee.id2, {
13
        since: Date.now() - 7 * 24 * 60 * 60 * 1000, // 7 days
14
        limit: 50,
15
      }),
16
    )
17

18
    const postsByAuthor = await Promise.all(postPromises)
19

20
    // 3. Fetch celebrity posts for pull-based merging
21
    const celebrityFollowees = followees.filter((f) => f.data.isCelebrity)
22
    const celebrityPosts = await this.getCelebrityPosts(celebrityFollowees.map((f) => f.id2))
23

24
    // 4. Merge and deduplicate
25
    const allPosts = [...postsByAuthor.flat(), ...celebrityPosts]
26

27
    // 5. Apply eligibility filters
28
    const eligible = await this.filterEligible(userId, allPosts)
29

30
    return {
31
      candidates: eligible,
32
      source: {
33
        fromCache: postsByAuthor.length,
34
        fromCelebrity: celebrityPosts.length,
35
      },
36
    }
37
  }
38

39
  private async filterEligible(userId: string, posts: Post[]): Promise<Post[]> {
40
    // Filter out: blocked authors, hidden posts, already seen
41
    const [blocked, hidden, seen] = await Promise.all([
42
      this.taoClient.getAssociations(userId, "blocks"),
43
      this.redis.smembers(`hidden:${userId}`),
44
      this.redis.smembers(`seen:${userId}`),
45
    ])
46

47
    const blockedSet = new Set(blocked.map((b) => b.id2))
48
    const hiddenSet = new Set(hidden)
49
    const seenSet = new Set(seen)
50

51
    return posts.filter(
52
      (post) =>
53
        !blockedSet.has(post.authorId) &&
54
        !hiddenSet.has(post.id) &&
55
        !seenSet.has(post.id) &&
56
        this.checkVisibility(userId, post),
57
    )
58
  }
59
}

Multi-Stage Ranking


15 collapsed lines
1
class FeedRanker {
2
  private readonly featureStore: FeatureStore
3
  private readonly modelServer: ModelServer
4

5
  async rank(userId: string, candidates: Post[]): Promise<RankedPost[]> {
6
    // Stage 1: First-pass lightweight ranking
7
    const stage1 = await this.firstPassRank(userId, candidates)
8
    const top500 = stage1.slice(0, 500)
9

10
    // Stage 2: Second-pass neural network
11
    const stage2 = await this.secondPassRank(userId, top500)
12
    const top100 = stage2.slice(0, 100)
13

14
    // Stage 3: Final reranking with diversity
15
    const final = await this.finalRerank(userId, top100)
16

17
    return final.slice(0, 50)
18
  }
19

20
  private async firstPassRank(userId: string, candidates: Post[]): Promise<RankedPost[]> {
21
    // Lightweight features (can compute in-process)
22
    const features = candidates.map((post) => ({
23
      postId: post.id,
24
      recency: this.computeRecency(post.createdAt),
25
      authorAffinity: this.getAuthorAffinity(userId, post.authorId),
26
      contentType: post.contentType,
27
      engagementVelocity: this.getEngagementVelocity(post),
28
    }))
29

30
    // Simple linear model for speed
31
    const scores = features.map(
32
      (f) =>
33
        0.3 * f.recency +
34
        0.4 * f.authorAffinity +
35
        0.2 * f.engagementVelocity +
36
        0.1 * this.contentTypeBoost(f.contentType),
37
    )
38

39
    return this.sortByScore(candidates, scores)
40
  }
41

42
  private async secondPassRank(userId: string, candidates: Post[]): Promise<RankedPost[]> {
43
    // Fetch rich features from feature store
44
    const features = await this.featureStore.getBatch(
45
      candidates.map((c) => ({
46
        userId,
47
        postId: c.id,
48
        authorId: c.authorId,
49
      })),
50
    )
51

52
    // Neural network scoring
53
    const scores = await this.modelServer.predict("feed_ranking_v2", features)
54

55
    return this.sortByScore(candidates, scores)
56
  }
57

58
  private async finalRerank(userId: string, candidates: RankedPost[]): Promise<RankedPost[]> {
59
    // Apply diversity rules
60
    const diversified = this.applyDiversity(candidates, {
61
      maxPerAuthor: 2,
62
      contentTypeMix: { image: 0.4, video: 0.3, text: 0.3 },
63
      maxAds: 3,
64
    })
65

66
    // Apply integrity filters (misinformation, policy violations)
67
    const filtered = await this.applyIntegrityFilters(diversified)
68

69
    return filtered
70
  }
71

72
  private applyDiversity(posts: RankedPost[], rules: DiversityRules): RankedPost[] {
73
    const result: RankedPost[] = []
74
    const authorCounts = new Map<string, number>()
75
    const typeCounts = new Map<string, number>()
76

77
    for (const post of posts) {
78
      const authorCount = authorCounts.get(post.authorId) || 0
79
      if (authorCount >= rules.maxPerAuthor) continue
80

81
      // Check content type distribution
82
      const typeCount = typeCounts.get(post.contentType) || 0
83
      const typeRatio = typeCount / (result.length + 1)
84
      const maxRatio = rules.contentTypeMix[post.contentType] || 0.5
85
      if (typeRatio > maxRatio && result.length > 10) continue
86

87
      result.push(post)
88
      authorCounts.set(post.authorId, authorCount + 1)
89
      typeCounts.set(post.contentType, typeCount + 1)
90
    }
91

92
    return result
93
  }
94
}

Fan-out Pipeline


20 collapsed lines
1
class FanoutService {
2
  private readonly CELEBRITY_THRESHOLD = 10_000
3
  private readonly BATCH_SIZE = 1000
4

5
  async fanoutPost(post: Post, author: User): Promise<FanoutResult> {
6
    const followerCount = author.followerCount
7

8
    if (followerCount < this.CELEBRITY_THRESHOLD) {
9
      return this.pushFanout(post, author)
10
    } else if (followerCount < 1_000_000) {
11
      return this.hybridFanout(post, author)
12
    } else {
13
      return this.pullOnly(post, author)
14
    }
15
  }
16

17
  private async pushFanout(post: Post, author: User): Promise<FanoutResult> {
18
    // Get all followers
19
    const followers = await this.getFollowers(author.id)
20

21
    // Process in batches to avoid memory pressure
22
    const batches = this.chunk(followers, this.BATCH_SIZE)
23
    let written = 0
24

25
    for (const batch of batches) {
26
      const pipeline = this.redis.pipeline()
27

28
      for (const followerId of batch) {
29
        // Add to follower's feed with timestamp score
30
        pipeline.zadd(`feed:${followerId}`, post.createdAt.getTime(), post.id)
31
        // Trim to max 500 posts
32
        pipeline.zremrangebyrank(`feed:${followerId}`, 0, -501)
33
      }
34

35
      await pipeline.exec()
36
      written += batch.length
37

38
      // Emit progress for monitoring
39
      this.metrics.increment("fanout.writes", batch.length)
40
    }
41

42
    return {
43
      strategy: "push",
44
      followersReached: written,
45
      duration: Date.now() - post.createdAt.getTime(),
46
    }
47
  }
48

49
  private async hybridFanout(post: Post, author: User): Promise<FanoutResult> {
50
    // Get active followers (engaged in last 7 days)
51
    const activeFollowers = await this.getActiveFollowers(author.id, 7)
52

53
    // Push to active followers only
54
    const pushResult = await this.pushToFollowers(post, activeFollowers)
55

56
    // Mark post for pull-based retrieval by inactive followers
57
    await this.redis.zadd(`celebrity_posts:${author.id}`, post.createdAt.getTime(), post.id)
58
    await this.redis.expire(`celebrity_posts:${author.id}`, 7 * 24 * 60 * 60)
59

60
    return {
61
      strategy: "hybrid",
62
      pushed: pushResult.count,
63
      markedForPull: author.followerCount - activeFollowers.length,
64
    }
65
  }
66

67
  private async pullOnly(post: Post, author: User): Promise<FanoutResult> {
68
    // Only index for pull-based retrieval
69
    await this.redis.zadd(`celebrity_posts:${author.id}`, post.createdAt.getTime(), post.id)
70

71
    // Update author's post index
72
    await this.leafClient.indexPost(post)
73

74
    return {
75
      strategy: "pull",
76
      indexed: true,
77
    }
78
  }
79
}

Cache Consistency (TAO Pattern)

TAO uses a two-tier caching architecture with leasing for consistency:


15 collapsed lines
1
class TAOCache {
2
  private readonly leaseTimeout = 10_000 // 10 seconds
3

4
  async get(id1: string, assocType: string): Promise<Association[] | null> {
5
    // Try follower cache first
6
    const followerResult = await this.followerCache.get(this.cacheKey(id1, assocType))
7

8
    if (followerResult) {
9
      this.metrics.increment("cache.hit.follower")
10
      return followerResult
11
    }
12

13
    // Try leader cache
14
    const leaderResult = await this.leaderCache.get(this.cacheKey(id1, assocType))
15

16
    if (leaderResult) {
17
      this.metrics.increment("cache.hit.leader")
18
      // Populate follower cache
19
      await this.followerCache.set(this.cacheKey(id1, assocType), leaderResult, { ttl: 300 })
20
      return leaderResult
21
    }
22

23
    // Cache miss - fetch from MySQL with lease
24
    return this.fetchWithLease(id1, assocType)
25
  }
26

27
  private async fetchWithLease(id1: string, assocType: string): Promise<Association[]> {
28
    const leaseKey = `lease:${id1}:${assocType}`
29

30
    // Try to acquire lease
31
    const acquired = await this.redis.set(leaseKey, "1", "NX", "PX", this.leaseTimeout)
32

33
    if (!acquired) {
34
      // Another request is fetching - wait and retry
35
      await this.sleep(100)
36
      return this.get(id1, assocType)
37
    }
38

39
    try {
40
      // Fetch from MySQL
41
      const data = await this.mysql.query(
42
        `SELECT * FROM associations
43
         WHERE id1 = ? AND assoc_type = ?
44
         ORDER BY time DESC`,
45
        [id1, assocType],
46
      )
47

48
      // Populate both cache tiers
49
      await Promise.all([
50
        this.leaderCache.set(this.cacheKey(id1, assocType), data, { ttl: 3600 }),
51
        this.followerCache.set(this.cacheKey(id1, assocType), data, { ttl: 300 }),
52
      ])
53

54
      return data
55
    } finally {
56
      // Release lease
57
      await this.redis.del(leaseKey)
58
    }
59
  }
60

61
  async invalidate(id1: string, assocType: string): Promise<void> {
62
    // Delete from both tiers
63
    const key = this.cacheKey(id1, assocType)
64
    await Promise.all([this.followerCache.del(key), this.leaderCache.del(key)])
65
  }
66
}

Engagement Counter (Eventual Consistency)

Engagement counts (likes, comments) use write-behind pattern for efficiency:


12 collapsed lines
1
class EngagementService {
2
  private readonly FLUSH_INTERVAL = 5000 // 5 seconds
3
  private pendingUpdates = new Map<string, EngagementDelta>()
4

5
  async incrementLike(postId: string): Promise<void> {
6
    // Immediate Redis increment for read consistency
7
    await this.redis.hincrby(`post:${postId}`, "like_count", 1)
8

9
    // Buffer MySQL update
10
    this.bufferUpdate(postId, { likes: 1 })
11
  }
12

13
  private bufferUpdate(postId: string, delta: EngagementDelta): void {
14
    const existing = this.pendingUpdates.get(postId) || {
15
      likes: 0,
16
      comments: 0,
17
      shares: 0,
18
    }
19

20
    this.pendingUpdates.set(postId, {
21
      likes: existing.likes + (delta.likes || 0),
22
      comments: existing.comments + (delta.comments || 0),
23
      shares: existing.shares + (delta.shares || 0),
24
    })
25
  }
26

27
  // Periodic flush to MySQL
28
  @Scheduled(FLUSH_INTERVAL)
29
  async flushToMySQL(): Promise<void> {
30
    const updates = new Map(this.pendingUpdates)
31
    this.pendingUpdates.clear()
32

33
    const queries = Array.from(updates.entries()).map(([postId, delta]) =>
34
      this.mysql.query(
35
        `UPDATE posts SET
36
          like_count = like_count + ?,
37
          comment_count = comment_count + ?,
38
          share_count = share_count + ?
39
         WHERE id = ?`,
40
        [delta.likes, delta.comments, delta.shares, postId],
41
      ),
42
    )
43

44
    await Promise.all(queries)
45
  }
46
}

Frontend Considerations

Feed Virtualization

For infinite scroll with thousands of potential posts:


15 collapsed lines
1
interface VirtualFeedConfig {
2
  containerHeight: number
3
  estimatedItemHeight: number
4
  overscan: number // Extra items above/below viewport
5
}
6

7
class VirtualFeed {
8
  private heightCache = new Map<string, number>()
9
  private offsetCache: number[] = []
10

11
  calculateVisibleRange(scrollTop: number, posts: Post[]): { start: number; end: number } {
12
    // Binary search for start index
13
    let start = this.binarySearchOffset(scrollTop - this.config.overscan * 300)
14

15
    // Calculate end based on viewport
16
    let accumulatedHeight = this.offsetCache[start] || 0
17
    let end = start
18

19
    while (
20
      end < posts.length &&
21
      accumulatedHeight < scrollTop + this.config.containerHeight + this.config.overscan * 300
22
    ) {
23
      accumulatedHeight += this.getItemHeight(posts[end])
24
      end++
25
    }
26

27
    return { start, end }
28
  }
29

30
  private getItemHeight(post: Post): number {
31
    // Check cache first
32
    if (this.heightCache.has(post.id)) {
33
      return this.heightCache.get(post.id)!
34
    }
35

36
    // Estimate based on content type
37
    let estimate = 100 // Base height
38
    if (post.content.media?.length > 0) {
39
      estimate += 400 // Image/video
40
    }
41
    if (post.content.text?.length > 200) {
42
      estimate += 50 // Long text
43
    }
44

45
    return estimate
46
  }
47

48
  // Called after actual render to update height cache
49
  onItemRendered(postId: string, actualHeight: number): void {
50
    this.heightCache.set(postId, actualHeight)
51
    this.rebuildOffsetCache()
52
  }
53
}

Optimistic Engagement Updates


10 collapsed lines
1
class FeedStore {
2
  private posts = new Map<string, Post>()
3
  private pendingLikes = new Set<string>()
4

5
  async likePost(postId: string): Promise<void> {
6
    const post = this.posts.get(postId)
7
    if (!post || this.pendingLikes.has(postId)) return
8

9
    // Optimistic update
10
    this.pendingLikes.add(postId)
11
    this.updatePost(postId, {
12
      engagement: {
13
        ...post.engagement,
14
        likeCount: post.engagement.likeCount + 1,
15
        isLiked: true,
16
      },
17
    })
18

19
    // Re-render immediately
20
    this.notifySubscribers(postId)
21

22
    try {
23
      await this.api.likePost(postId)
24
    } catch (error) {
25
      // Rollback on failure
26
      this.updatePost(postId, {
27
        engagement: {
28
          ...post.engagement,
29
          likeCount: post.engagement.likeCount,
30
          isLiked: false,
31
        },
32
      })
33
      this.notifySubscribers(postId)
34
    } finally {
35
      this.pendingLikes.delete(postId)
36
    }
37
  }
38
}

Real-time Feed Updates


12 collapsed lines
1
class FeedStreamManager {
2
  private eventSource: EventSource | null = null
3
  private reconnectAttempt = 0
4

5
  connect(userId: string): void {
6
    this.eventSource = new EventSource(`/api/v1/feed/stream?userId=${userId}`)
7

8
    this.eventSource.addEventListener("new_post", (event) => {
9
      const data = JSON.parse(event.data)
10
      this.onNewPost(data)
11
    })
12

13
    this.eventSource.addEventListener("engagement_update", (event) => {
14
      const data = JSON.parse(event.data)
15
      this.onEngagementUpdate(data)
16
    })
17

18
    this.eventSource.onerror = () => {
19
      this.scheduleReconnect()
20
    }
21
  }
22

23
  private onNewPost(data: NewPostEvent): void {
24
    // Show "New posts available" indicator instead of auto-inserting
25
    // This prevents jarring scroll position changes
26
    this.feedStore.setPendingPosts(data.count)
27
    this.ui.showNewPostsIndicator()
28
  }
29

30
  private onEngagementUpdate(data: EngagementUpdateEvent): void {
31
    // Update counts in place (non-disruptive)
32
    this.feedStore.updateEngagement(data.postId, data)
33
  }
34

35
  loadNewPosts(): void {
36
    // User clicked "Show new posts"
37
    const pending = this.feedStore.getPendingPosts()
38
    this.feedStore.prependPosts(pending)
39
    this.feedStore.clearPendingPosts()
40
    this.ui.scrollToTop()
41
  }
42
}

Prefetching Strategy


10 collapsed lines
1
class FeedPrefetcher {
2
  private readonly PREFETCH_THRESHOLD = 5 // Posts from bottom
3

4
  onScroll(visibleRange: { start: number; end: number }, totalPosts: number): void {
5
    const postsRemaining = totalPosts - visibleRange.end
6

7
    if (postsRemaining < this.PREFETCH_THRESHOLD && !this.loading) {
8
      this.prefetchNextPage()
9
    }
10
  }
11

12
  private async prefetchNextPage(): Promise<void> {
13
    this.loading = true
14

15
    try {
16
      const cursor = this.feedStore.getNextCursor()
17
      const response = await this.api.getFeed({ cursor, limit: 20 })
18

19
      // Add to store but don't re-render yet
20
      this.feedStore.appendPosts(response.posts)
21
      this.feedStore.setNextCursor(response.pagination.nextCursor)
22
    } finally {
23
      this.loading = false
24
    }
25
  }
26
}

Infrastructure

Cloud-Agnostic Components

Component	Purpose	Options
API Gateway	Auth, rate limiting	Kong, Envoy, Nginx
Graph Store	Social graph	Custom TAO-like, Neo4j, DGraph
KV Cache	Feed cache, sessions	Redis, KeyDB, Dragonfly
Message Queue	Fan-out, events	Kafka, Pulsar, NATS
Relational DB	Users, posts	MySQL, PostgreSQL, CockroachDB
Object Store	Media files	MinIO, Ceph, S3-compatible
ML Serving	Ranking models	TensorFlow Serving, Triton, Seldon
Feature Store	ML features	Feast, Tecton, custom

AWS Reference Architecture

Service configurations:

Service	Configuration	Rationale
Aggregator	c5.4xlarge (16 vCPU, 32GB)	CPU-bound ranking computation
Leaf	r5.8xlarge (32 vCPU, 256GB)	Memory-intensive index storage
Fan-out workers	Fargate Spot	Cost-effective async processing
ElastiCache Redis	r6g.2xlarge cluster mode	Sub-ms feed cache reads
Aurora MySQL	db.r6g.4xlarge Multi-AZ	Durability, read replicas for scale
DynamoDB	On-demand mode	TAO-like graph store, auto-scaling
SageMaker	p3.2xlarge (1 GPU)	Neural network inference

Multi-Region Deployment

Multi-region considerations:

Primary region for all writes (US-East)
Read replicas in each region for local read latency
Redis cross-region replication for feed cache
~50-100ms replication lag acceptable for social content
Failover to secondary region if primary unavailable

Conclusion

This design provides a social feed system with:

Sub-500ms feed generation via hybrid fan-out and pre-computed caches
Personalized ranking using multi-stage ML funnel (1000+ signals)
Infinite scalability through disaggregated architecture and sharding
Celebrity handling via pull-based merging avoiding write explosion
Eventual consistency acceptable for social content (strong for engagement)

Key architectural decisions:

Hybrid fan-out balances latency (push) with scalability (pull)
TAO-like graph store optimizes social relationship queries
Multi-stage ranking funnel enables complex ML without latency impact
Two-tier caching with leasing prevents thundering herd

Known limitations:

~1 minute eventual consistency for feed updates
Pull-based celebrity posts may have slightly higher latency
ML model staleness between hourly retraining windows
Complex operational model with multiple specialized tiers

Future enhancements:

Real-time ML model serving for instant personalization
Federated learning for privacy-preserving ranking
Graph neural networks for improved content understanding
Edge-based feed generation for reduced latency

Appendix

Prerequisites

Distributed systems fundamentals (caching, sharding, replication)
ML basics (training vs. serving, feature engineering)
Graph database concepts
Message queue patterns (pub/sub, fan-out)

Terminology

Term	Definition
Fan-out	Distributing a post to multiple follower feeds
TAO	Facebook’s graph store (The Associations and Objects)
Aggregator	Service that queries and combines data from multiple sources
Leaf server	Memory-intensive server storing indexed data
Candidate retrieval	First stage of ranking - gathering potential items
Affinity	Strength of relationship between user and content/author
Leasing	Cache coordination pattern to prevent thundering herd

Summary

Hybrid fan-out (push for regular users, pull for celebrities) balances write amplification with read latency
TAO-like graph storage with objects and associations models social data naturally and enables efficient graph queries
Multi-stage ML ranking (billions → hundreds → 50) enables sophisticated personalization without latency impact
Two-tier regional caching with followers and leaders maintains consistency while providing sub-ms reads
Eventual consistency (~1 minute) is acceptable for social content; engagement counts use write-behind pattern
Scale to 700K RPS feed loads with disaggregated aggregator/leaf architecture

References

Real-World Implementations:

Serving Facebook Multifeed: Efficiency, Performance Gains Through Redesign - Disaggregated architecture achieving 40% efficiency gains
TAO: The Power of the Graph - Facebook’s distributed graph store handling 1B+ reads/second
How Machine Learning Powers Facebook’s News Feed Ranking - ML-based ranking at scale
Cache Made Consistent - Achieving 10 nines cache consistency at Meta
Journey to 1000 Models: Scaling Instagram’s Recommendation System - Instagram’s ML platform
How Twitter Uses Redis to Scale - Twitter’s 105TB Redis deployment

Academic Papers:

TAO: Facebook’s Distributed Data Store for the Social Graph - USENIX ATC 2013
Scaling Memcache at Facebook - NSDI 2013
LiRank: Industrial Large Scale Ranking Models - LinkedIn’s ranking framework

Related Articles:

Design Real-Time Chat and Messaging - WebSocket-based messaging
Design a Notification System - Multi-channel notification delivery