More outages hit Amazon's S3 storage service
Cloud storage service down for eight hours over the weekend
Network World - Amazon.com Inc.'s S3 cloud storage service suffered eight hours of downtime and elevated error rates in the U.S. and Europe Sunday.
The outage lasted several hours longer than a similar problem that hit the service in February, disrupting Web sites that rely on the online Simple Storage Service. The social networking site Twitter was disrupted during both outages.
"Today has been a bad day for many Web sites and start-ups across the Internet," a Geek Zone blogger wrote during this week's outage. "The reason? An Amazon S3 outage. ... One of the most high-profile victims of the current S3 outage is Twitter: Images, such as avatars of users, are currently not being served, because they are all stored on S3."
Amazon described its attempts to fix the problem Sunday on a "service health dashboard" page that the company uses to keep the public up to date on the status of its Web services. In addition to S3, service interruptions were reported with Amazon's Simple Queue Service (SQS), a tool that helps developers move data among distributed components of applications.
Amazon's online storage service is often used in conjunction with its Elastic Compute Cloud, which gives customers access to processing power via the Web. The Elastic Compute Cloud itself did not suffer any downtime on Sunday, but Amazon said the S3 problems prevented registering of new virtual machines on the Compute Cloud, and that some virtual machines could not be launched. Running instances on the Compute Cloud were not affected.
Amazon reported "elevated error rates" with S3 beginning at 9:05 a.m. PST Sunday, and later described the problem as "an issue with the communication between several Amazon S3 internal components." Amazon reported making "incremental progress" at 1:17 p.m., and then two hours later said "no data has been lost during this incident."
By 3:23 p.m., Amazon said service in Europe had been fully restored, but that the U.S. would take longer because it contains a larger number of storage systems. (Compare storage products.) At 5:12 p.m., Amazon said service in the U.S. had been fully restored.
"We will provide more detail on this event once we have completed a full investigation," Amazon said.
Amazon has said the February outage was because of elevated numbers of authentication requests, and that in response it has added "significant" amounts of capacity to its authentication service and improved the system that monitors the proportion of requests that are authenticated.
Amazon said there was no data loss during that incident, either, because the company stores multiple copies of every object in multiple locations.


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- The Total Economic Impact of the HP 3PAR Storage
- Forrester Consulting provides an analysis of four HP 3PAR storage customer implementations to quantify the efficiency and cost savings achieved over legacy storage...
- Using HP's Converged Storage to Develop/Enhance Business Resiliency in VMware Environments
- In this report, Enterprise Strategy Group reviews how HP's portfolio of hardware, software, and services can provide the foundational support for VMware environments....
- Converged Storage: Utility Storage - The Ideal Platform for Virtual and Cloud Computing
- Server virtualization has transformed corporate IT -- companies have enjoyed major cost savings and have gained flexibility and efficiency. But this has also...
- Defining Tier One Storage in the Modern Data Center
- This report defines "tier-1" storage in the modern IT world and in the data centers and services that support it. What was a...
- The Best Way to Build a Cloud -- HP CloudSystem Matrix and HP 3PAR Utility Storage provide solid, flexible foundation
- Learn how HP CloudSystem Matrix and HP 3PAR Utility Storage provide a solid, flexible foundation for your cloud environment.
Intel and the Intel logo...
All Storage White Papers
- Live Webcast
Today's NAS: A Solution Beyond Old Limits - Date: Tuesday, July 17, 2012 2:00 PM EDT
Traditional NAS systems don't scale beyond fixed limits. Proliferation of NAS systems leads to management... - Today's NAS: A Solution Beyond Old Limits
- Date: Tuesday, July 17, 2012 2:00 PM EDT
Traditional NAS systems don't scale beyond fixed limits. Proliferation of NAS systems leads to management... - Distributed Database Security with Real-time Monitoring
- View this demo and learn how IBM InfoSphere Guardium database activity monitoring can help protect your sensitive data in distributed DBMS environments with...
- InfoSphere Warehouse Packs Demo
- These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
- Delivery Management -- Extending Lifecycle Management
- Date: Wednesday, June 20, 2012, 1:00 PM EDT
Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,... - Leverage automation today to reduce IT complexity
- Date: Tuesday, June 5, 2012, 2:00 PM EDT
Whether your B2B complexity is caused by multiple technologies due to M&A, business or application specific...
All Storage Webcasts