TikTok’s Edge Caching Strategy for Low Latency Video Delivery
Tech · 6 min read
TikTok relies on a multi-tiered edge CDN that blends proactive replication with real-time popularity prediction. Videos are chunked into short segments and annotated with compact metadata enabling rapid client-side prefetching. Edge nodes prioritize hot content based on regional trending scores computed in seconds, while colder content is fetched from mid-tier caches or origin storage.
A key innovation is TikTok’s lightweight popularity forecast which runs at the edge: micro-models predict near-term spike probability using early engagement velocity, creator reach, and local social graph signals. Popular items are proactively replicated to more nodes; uncertain items get guarded caching with ephemeral replication. This reduces tail latency during sudden viral events without bloating global storage.
Operationally, the system accepts some cache misses during the warm-up window but minimizes user-perceived stuttering through adaptive bitrate and client prebuffering. The teardown shows how recommender-driven demand creates unique CDN design challenges: capacity planning must account for algorithmic amplification, and caching policies must be tightly coupled with prediction systems to meet strict latency SLAs.