Open In App

How a Cache Stampede Caused One of Facebook’s Biggest Outages

On September 23, 2010, Facebook experienced one of its most severe outages to date, affecting hundreds of thousands of users internationally. The website online remained inaccessible for over 4 hours, causing extensive disruption and frustration.

The root reason for the outage changed into a cache stampede, a phenomena that happens when a large number of users try to access a cached resource at the same time, overloading the information supply and causing a series of failure.



A cache stampede, also known as cache thrashing or dog-piling, occurs when a massive range of requests try, and get entry to a specific resource that isn’t present inside the cache. This usually takes place when the cached object expires or is invalidated, and multiple requests attempt to repopulate the cache concurrently.

How a cache stampede possibly cause problems

What happened in the case of Facebook?

In the case of facebook, there was a problem because someone made a mistake in how things were set up. This mistake caused a lot of stored information i.e cache to become useless all at once, and many people tried to get the new information from the main server all at the same time. The main server couldn’t handle so many requests at once, and it crashed, causing the whole system to stop working.

The cache stampede got worse because of a computerized system that made mistakes in handling errors. It wrongly thought that a wrong cache entry was a real error in the information. This caused a loop of confusion, where the wrong access to the cache made users question the main server. The main server then couldn’t handle the overload, making more wrong cache entries and causing more requests. This repeating cycle continued until the whole system broke down.

The Facebook outage highlighted the vital position of caching in current net applications and the potential risks related to cache stampedes.

To prevent thses occurrences again in the future, Facebook applied numerous measures, which includes:

The Facebook outage served as a precious lesson for the enterprise, emphasizing the importance of designing and implementing caching structures with attention for ability disasters and imposing suitable safeguards to save you cache stampedes.

Article Tags :