Skip the navigation

Twitter blames two-hour failure on dual data-center crashes

Two parallel, redundant servers failed at about the same time, the company says

By Stephen Lawson
July 26, 2012 07:44 PM ET

IDG News Service - A Twitter outage on Thursday that lasted as long as two hours for some users was caused by separate data centers failing at nearly the same time, the company said in an apologetic blog post.

Twitter went down between about 8:20 a.m. and 9 a.m. Pacific Time on Thursday and was back in action by about 10:25 a.m., wrote Mazen Rawashdeh, vice president of engineering. Though some users suspected an overload of Tweets related to the Olympic Games, which opens on Friday in London, that was not the cause of the outage.

Instead, two data centers that operate in parallel for redundancy both failed, in what Rawashdeh called an "infrastructural double whammy."

"What was noteworthy about today's outage was the coincidental failure of two parallel systems at nearly the same time," Rawashdeh wrote. "We are investing aggressively in our systems to avoid this situation in the future."

It was Twitter's second outage in about six weeks. On June 21, the microblogging service went down about 9 a.m. Pacific and started to come back just after 10 a.m., only to fail again before full recovery began after 11 a.m. The company blamed that outage on a cascading bug, a type of problem that spreads from one software element to others.

Stephen Lawson covers mobile, storage and networking technologies for The IDG News Service. Follow Stephen on Twitter at @sdlawsonmedia. Stephen's e-mail address is stephen_lawson@idg.com

Reprinted with permission from IDG.net. Story copyright 2014 International Data Group. All rights reserved.
Our Commenting Policies
Consumerization of IT: Be in the know
consumer tech

Our new weekly Consumerization of IT newsletter covers a wide range of trends including BYOD, smartphones, tablets, MDM, cloud, social and what it all means for IT. Subscribe now and stay up to date!