More outages hit Amazon's S3 storage service
Cloud storage service down for eight hours over the weekend
Network World - Amazon.com Inc.'s S3 cloud storage service suffered eight hours of downtime and elevated error rates in the U.S. and Europe Sunday.
The outage lasted several hours longer than a similar problem that hit the service in February, disrupting Web sites that rely on the online Simple Storage Service. The social networking site Twitter was disrupted during both outages.
"Today has been a bad day for many Web sites and start-ups across the Internet," a Geek Zone blogger wrote during this week's outage. "The reason? An Amazon S3 outage. ... One of the most high-profile victims of the current S3 outage is Twitter: Images, such as avatars of users, are currently not being served, because they are all stored on S3."
Amazon described its attempts to fix the problem Sunday on a "service health dashboard" page that the company uses to keep the public up to date on the status of its Web services. In addition to S3, service interruptions were reported with Amazon's Simple Queue Service (SQS), a tool that helps developers move data among distributed components of applications.
Amazon's online storage service is often used in conjunction with its Elastic Compute Cloud, which gives customers access to processing power via the Web. The Elastic Compute Cloud itself did not suffer any downtime on Sunday, but Amazon said the S3 problems prevented registering of new virtual machines on the Compute Cloud, and that some virtual machines could not be launched. Running instances on the Compute Cloud were not affected.
Amazon reported "elevated error rates" with S3 beginning at 9:05 a.m. PST Sunday, and later described the problem as "an issue with the communication between several Amazon S3 internal components." Amazon reported making "incremental progress" at 1:17 p.m., and then two hours later said "no data has been lost during this incident."
By 3:23 p.m., Amazon said service in Europe had been fully restored, but that the U.S. would take longer because it contains a larger number of storage systems. (Compare storage products.) At 5:12 p.m., Amazon said service in the U.S. had been fully restored.
"We will provide more detail on this event once we have completed a full investigation," Amazon said.
Amazon has said the February outage was because of elevated numbers of authentication requests, and that in response it has added "significant" amounts of capacity to its authentication service and improved the system that monitors the proportion of requests that are authenticated.
Amazon said there was no data loss during that incident, either, because the company stores multiple copies of every object in multiple locations.
- What is this "File Sync" Thing and Why Should I Care About It? All of a sudden, getting a file from your work laptop to your iPad became as simple as clicking "Save." So it's no...
- The Keys to Securing Data in a Collaborative Workplace Losing data is costly. IT professionals have spent years learning how to protect their organizations from hackers, but how do you ward off...
- Cloud-to-Cloud Backup Case Study: AMAG Pharmaceuticals As an IT pioneer in the pharmaceuticals industry, AMAG realized that SaaS backup and recovery would give them the confidence to fully embrace...
- 9 Essentials for a Complete Cloud-to-Cloud Backup Solution In 9 Essentials for a Complete Cloud-to-Cloud Backup Solution, we'll walk you through potential sources of data loss in the cloud and provide...
- The Key to Happiness: Throw out Your Data Warehouse In this webinar, Kerry Reitnauer, Director, Solution Architect at FairPoint Communications will discuss the challenges the data warehouse brought, how they migrated to...
- The Foundation You Need to Build a Better Storage Infrastructure Watch this webcast to hear how you can maximize the economics of your data center by modifying your storage footprint and power usage... All Data Storage White Papers | Webcasts