Microsoft admits communications and tech problems during Office 365 outages
Lync Online and Exchange Online went down for hours in separate incidents earlier this week
IDG News Service - A "unique" breakdown coupled with a previously unknown flaw in Exchange Online caused Tuesday's extensive outage, and to make matters worse, the service disruption alert system also malfunctioned, leaving some affected customers in the dark.
So said Rajesh Jha, corporate vice president of Office 365 engineering, in an incident report posted to the Office 365 support forum in which he also addressed another separate, prolonged Lync Online outage from Monday.
"I want to apologize on behalf of the Office 365 team for the impact and inconvenience this has caused. Email and real-time communications are critical to your business, and my team and I fully recognize our accountability and responsibility as your partner and service provider," he wrote.
For customers on U.S. Eastern time, the Exchange Online outage covered virtually the entire workday.
The main selling point from Microsoft, Google, Amazon and other providers of cloud software and computing services is that their customers don't need to worry about maintaining on-premises servers, patching applications and rebooting systems that crash.
While no one expects even these mighty technology companies to be perfect, an email outage that lasts for almost nine hours during a workday is sure to plant the seeds of doubt on business managers about the wisdom of turning off their on-premises email servers and trusting this essential communications service to a cloud provider.
The second-guessing is bound to be even more intense when the email breakdown happens the day after a significant outage affecting Lync Online, which Office 365 customers use for instant messaging, presence, audio communications, video conferencing, Web meetings and, in some cases, IP telephony.
Many were IT professionals who were fielding complaints from their frazzled users, while having no control over the problem and little information from Microsoft about its cause and estimated time of resolution.
Jha addressed this breakdown in communications, saying that during the Exchange Online incident "we also experienced a problem with our Service Health Dashboard (SHD) publishing process, meaning not all impacted customers were notified in a timely way which we realize was frustrating and this has since been addressed."
For Microsoft, back-to-back outages of this magnitude are poisonous, embroiled as it is in a vicious fight with Google in the cloud email and collaboration suite market.
Jha said the outages affected Office 365 data centers in North America, but he didn't come close to clarifying how many customers were hit, which hurts Microsoft's attempts at transparency. Asked for this information twice this week by the IDG News Service, Microsoft declined to provide it. Customers will receive a formal, detailed report on the incidents later, so maybe it will include details about the scope of the outages.
- Scaling SaaS Delivery for Long-Term Success This paper provides recommendations for SaaS executives on using the cloud to increase market share and retaining their customers amid a growing number...
- Infographic: The Cloud Skills Gap Crisis Get the facts and stats on how the skills IT departments need and those they currently have do not always match, and what...
- Whitepaper: 10 Critical Requirements of a Successful Cloud Application Growing interest in cloud computing has prompted almost every enterprise software vendor to claim it's "in the cloud." However, in the industry's rush...
- Whitepaper: Real CIOs of the Cloud As cloud computing becomes the dominant technology of this decade, today's CIOs are the catalysts for change. This whitepaper published by Techweb features...
- EMC perspective on hybrid cloud Listen to the EMC Perspective on Hybrid Cloud: To Deliver ITaaS, you need Hybrid Cloud. Brian Gracely, Senior Director, Cloud Solutions, delivers EMC's...