Skip the navigation
News

Microsoft blames Azure outage on OS upgrade

It promises to handle future problems 'quickly and gracefully'

By Elizabeth Montalbano
March 18, 2009 12:00 PM ET

IDG News Service - Microsoft is blaming a routine operating system upgrade for an outage that hit its Windows Azure cloud computing infrastructure over the weekend.

In a post on its Windows Azure blog, Microsoft said that, after the upgrade, "the deployment service within Windows Azure began to slow down due to networking issues" on Friday. "This caused a large number of servers to time out and fail," which brought some applications down, according to the post.

Microsoft said the Fabric Controller, a feature built into Azure that manages network resources and performs functions such as load balancing, automatically began taking steps to recover applications that were affected and move them to different servers. Only applications that were running in a single instance on the network went down during the outage, the company said.

"Very few applications running multiple instances went down, although some were degraded due to one instance being down," according to the post.

The ability to perform management tasks from Azure's Web portal was also unavailable for many applications during the outage, which lasted from about 10:30 p.m. Pacific time on Friday and 8:30 p.m. Pacific time on Saturday.

Microsoft said it is refining and tuning Azure's recovery algorithm so that if malfunctions occur in the future they will be handled "quickly and gracefully," it said in the post.

The company also recommends that people running applications in Azure deploy them in multiple instances, and it said it will make two the "default in our project templates and samples," according to the post.

"We will not count the second instance against quota limits, so [users] can feel comfortable running two instances of each application role," Microsoft said.

Microsoft confirmed late Monday that Azure users suffered an overnight outage over the weekend during which their applications weren't available.

Currently only a test release of Azure is available, with some early adopters running applications on it. Users can't expect an early test release of a product to run smoothly without hiccups, and Azure is a proving ground for how well Microsoft can support the development and deployment of hosted enterprise applications, for which even a short amount of downtime can pose a big problem.

Moreover, last week both Google and Microsoft had outages on their Gmail and Hotmail e-mail services. Outages raise questions about the ability of these companies and other online service providers to maintain a consistent quality of service for end users over the long term.

Microsoft unveiled Azure at its Professional Developers Conference (PDC) in Los Angeles in October, and according to public comments made by CEO Steve Ballmer last month, it plans to make the infrastructure generally available by November at this year's PDC.

Reprinted with permission from IDG.net. Story copyright 2010 International Data Group. All rights reserved.
Additional Resources
Forrester Consulting - Optimizing Users and Applications in a Mobile World
WHITE PAPER
Solving application issues over the WAN requires careful consideration. Based on their independent research, Forrester Consulting offers recommendations on how to tackle application performance issues, insufficient bandwidth and the inability to quickly restore users in a disaster.

Read now.

Security KnowledgeVault
WHITE PAPER
Security is not an option. This KnowledgeVault Series offers professional advice how to be proactive in the fight against cybercrimes and multi-layered security threats; how to adopt a holistic approach to protecting and managing data; and how to hire a qualified security assessor. Make security your Number 1 priority.

Read now.

Cut Communications Costs Once and for All
WHITE PAPER
New IP-based communications systems are being deployed by small and midsized businesses at a rapid rate. Learn how these organizations are enabling faster responsiveness, creating better customer experiences, speeding office or mobile interactions, and dramatically reducing existing communications costs.

Read now.

App Development White Papers
The Keys to Distributed & Agile Application Development
How leading firms are winning with strategies for efficient application development, without relying on co-location.
Overcome Top 7 Admin Challenges of Active Directory
As Active Directory's role in the enterprise has drastically increased, so has the need to secure the data. Gain insight on creating repeatable,...
Insiders Can Ruin Your Company. Take Action.
Did you know that 80 percent of threats to an organization come from the inside? The threat from insiders is often overlooked in...
Top Solutions and Tools to Prevent Devastating Malware
Custom malware frequently goes undetected. According to Forrester Research, the best way to reduce risk of breach is to deploy file integrity monitoring...
Streamline Compliance and Increase ROI
Streamline, simplify, and automate compliance related activities; especially those that impact multiple business units. This white paper from NetIQ, outlines solutions that will...
All App Development White Papers
App Development Webcasts
Reduced TCO for Communications Applications with New Oracle SPARC Servers
In this webcast learn how Oracle's new SPARC T4 servers and SPARC Supercluster deliver the security, performance, and scalability required for 4G network...
Optimizing Networks for the Cloud
Join guest speaker, Rohit Mehra, IDC Director of Enterprise Communications Infrastructure, to explore current trends, discuss best practices for optimizing Data Center and...
Apps QuickStart Series Part 2: Designing and Deploying SQL Server on VMware vSphere
Download this webcast to learn about the design considerations for virtualizing SQL workloads, performance and scalability information and high-availability options, as well as...
Apps QuickStart Series Part 1: Designing and Deploying Exchange 2010 on VMware vSphere
Download this webcast to learn the virtual hardware design considerations for Exchange 2010, deployment using the building block approach, options for high-availability and...
Customer Spotlight: How IPC The Hospitalist Company Implemented Oracle on VMware
Have you been looking to hear about customer's experiences with the new VMware vCenter Site Recovery Manager product? View this webcast to learn...
All App Development Webcasts
Newsletter Sign-Up

Receive the latest news test, reviews and trends on your favorite technology topics

Choose a newsletter
  1. View all newsletters | Privacy Policy
IT Jobs