Who gets blame for Amazon outage?
Reliability of cloud services makes customers complacent; many don't plan for worst-case scenarios
Computerworld - Amazon.com has promised to provide a "detailed post-mortem" on the root causes of the prolonged outage of its cloud services in recent days. Users of the Amazon services, meanwhile, may also have to explain how they got caught up in the outage.
The ensuing conversations may be uncomfortable for both Amazon and its cloud customers -- perhaps even more so for users of the services.
Cloud services overall have been remarkably reliable, which may be fostering a dangerous complacency among customers who are putting too must trust in them. This is another old and familiar story of technology hubris, one that was famously illustrated by another tech marvel, the unsinkable Titanic.
In this case, it is IT managers who will have to explain to their users -- and to their companies' executives -- why they didn't have a lifeboat.
Amazon's partial outage, which began Thursday and seemed largely resolved today, was an exceptional event.
Based on data compiled by AppNeta, the uptime reliability of 40 of the largest providers of cloud-based services, including Amazon, Google, Azure and Salesforce.com, shows how well cloud providers are delivering uninterrupted services. The performance management and network monitoring firm, known as Apparent Networks until this week, captures minute-by-minute uptime and other data from cloud providers used by its customers.
The overall industry yearly average of uptime for all the cloud services providers monitored by AppNeta is 99.948%, which is equal to 273 minutes of unavailability per year.
The worst providers clock in at 99.92%, or 420 minutes of unavailability each year.
The best providers are at 99.9994%, or three minutes of unavailability each year.
The takeaway for cloud users looking at the AppNeta data is that the risk of an outage is generally very low.
But that's not how the world works.
For example, Ken Brill, founder of the Uptime Institute, which researches data center issues, points to Japan's Fukushima Nuclear Power Plant. For 40 years, there were no problems at the plant. Then an earthquake and tsunami that hit in March disabled the facility with catastrophic consequences.
Brill expects that a post-mortem on the nuclear plant will show at least 10 things that could have been done to help avoid that failure and reduce the magnitude of damage and would have made it easier or faster to recover from.
The Amazon post-mortem will likely show something similar, said Brill.
Despite the redundancies and backups built into the Amazon cloud, "you hit a combination of events for which the backups don't work," he said.
Users see the promise of cloud technology as a way to reduce costs and be greener, but "that [also] means concentrating processing in fewer, bigger places," said Brill. Thus, when something goes wrong, "it has a bigger impact."
Cloud Watch
- Microsoft pitches SkyDrive over iCloud to Mac Office users
- Can Dropbox, other cloud providers survive Google Drive?
- Google Drive could be a boon -- and a headache -- for IT
- With Google Drive, 'personal cloud' will soon overshadow the PC
- What to consider before signing up for Google Drive
- Amazon cloud accessed daily by a third of all 'Net users
- HP offers its view of cloud's future
- Moving to the cloud in 2012? Look out for these pitfalls
- Microsoft, HP unveil joint cloud offering
- Feds launch cloud security standards program


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Finding the right cloud solutions for your organization
- HP is driving the evolution of what we call the Instant-On Enterprise. It is an enterprise that embeds technology into everything it does...
- Seven Priorities for Integrated Network Management - How HP Intelligent Management Center Delivers an Enterprise-class Solution
- This white paper describes the major requirements for network management solutions to help the organizations become more profitable, efficient and reliable.
Intel and the... - Building Cloud-Optimized Data Center Networks white paper
- Enterprises are turning to the Cloud to improve business agility, reduce expenses and accelerate business innovation. Cloud computing redefines the way IT assets...
- Converged Storage: Utility Storage - The Ideal Platform for Virtual and Cloud Computing
- Server virtualization has transformed corporate IT -- companies have enjoyed major cost savings and have gained flexibility and efficiency. But this has also...
- The Best Way to Build a Cloud -- HP CloudSystem Matrix and HP 3PAR Utility Storage provide solid, flexible foundation
- Learn how HP CloudSystem Matrix and HP 3PAR Utility Storage provide a solid, flexible foundation for your cloud environment.
Intel and the Intel logo...
All Cloud Computing White Papers
- Unlock the Value of Cloud Computing with Workload Automation
- Learn how to get the most from your cloud investment in our on-demand webinar from BMC and InformationWeek. You'll hear how integrating the...
- Get the Most from Your Cloud Investment
- Learn how to get the most from your cloud investment in our on-demand webinar from BMC and InformationWeek. You'll hear how integrating the...
- Must have Tools and Techniques to Optimize the Sales Pipeline and Win more Deals
- In this webcast, Vantage Point Performance's Michelle Vazzana will reveal how to coach your reps to better performing pipelines.
- Sales Effectiveness in the New Sales Paradigm - A Webcast Featuring the Latest Forrester Research Study
- In this webcast produced by the Sales Management Association (SMA), Forrester's Scott Santucci will explore the new sales paradigm and discuss how businesses...
- Virtualization 101: Launching into Cloud Computing for SMBs
- In the next year at least half of all small to mid- businesses will move to virtualization. Will yours be among them? The... All Cloud Computing Webcasts
