Gmail outage caused by overloaded servers
IDG News Service - A worldwide outage of Google's Gmail online e-mail system on Tuesday was caused by a traffic jam on its servers, according to Google's official Gmail blog.
The problem was that some recent changes designed to improve traffic flow on request routers, servers designed to direct Web queries to the appropriate Gmail server, overloaded the system after workers took some Gmail servers offline to perform routine upgrades.
"As we now know, we had slightly underestimated the load which some recent changes placed on the request routers," Ben Treynor, site reliability Czar wrote on the Gmail blog. "At about 12:30 p.m. Pacific a few of the request routers became overloaded and in effect told the rest of the system "stop sending us traffic, we're too slow!". This transferred the load onto the remaining request routers, causing a few more of them also to become overloaded, and within minutes nearly all of the request routers were overloaded."
The overload resulted in people around the world being unable to access Gmail for about 100 minutes, Treynor said, though he noted that IMAP/POP access and mail processing continued to work normally.
Gmail engineers were alerted to the problem within seconds of the failures and after figuring out what the problem was, brought additional request routers online. Now, Gmail is more than 99.9 percent available to users, he said.
"We've turned our full attention to helping ensure this kind of event doesn't happen again," he wrote.
One fix the company plans to make is to ensure request routers will work better by having them slow down when overloaded instead of refusing to accept traffic. Treynor said the request routers need to have sufficient failure isolation so that a problem in one data center doesn't affect servers in another data center.
The company will work over the next few weeks to make these changes and further improve reliability, he said.
- 12 iPhones Apps That Will Make You a Networking Star
- 10 Careers Robots Are Taking From You
- Big Data Gold Isn't Always Where You Would Expect It
- 6 Tips to Build Your Social Media Strategy
- A walking tour: 33 questions to ask about your company's security
- 15 social media scams
- The 7 elements of a successful security awareness program
- IT Certification Study Tips
- Register for this Computerworld Insider Study Tip guide and gain access to hundreds of premium content articles, cheat sheets, product reviews and more.
- Seven Contact Center Trends You Can't Ignore Rapid changes are underway in the world of traditional contact centers. It starts with the disruptive nature of social media and mobile apps,...
- Top Ten Reasons Customers Choose Siemens Enterprise Communications to Help Transform their Business Trusted by over 75% of the Fortune 500, Siemens Enterprise Communications is the only vendor to provide the complete range of Voice, UCC...
- Amplify collective effort. Dramatically improve performance. Discover why now is the time to revisit the untapped potential of team performance and leverage team collaboration as a vital corporate asset.
- The Untapped Potential of Virtual Teams The results from a recent global research study show that while the vast majority of organizations rely on remote, distributed and mobile team...
- Modernizing Wireless Infrastructure for Today's Mobile and Data Driven Enterprise Find out some of the compelling drivers and unique challenges that the Georgia Dome had to address to prepare the stadium for a...
- 5 Ways to Keep the Heart of Your IT Beating Strong in 2013 Your IT investments should bring you some combination of results, relief, and reward. So how do you make sure your ongoing data center... All Networking White Papers | Webcasts
The old PacBell building at 140 New Montgomery Street, San Francisco, (@140nm) was wired for connectivity long before the needs of a tenant like Yelp would make 21st century demands. But even this telecom landmark needs some major infrastructure improvements to support the companies it expects to move in soon. more