Skip the navigation
News

RIM global outage caused by core switch failure; fix under way

Backup system also didn't work, RIM explains

October 11, 2011 06:30 PM ET

Computerworld - BlackBerry service delays experienced by users around the world on Tuesday were caused by a core switch failure within the infrastructure of Research In Motion (RIM), the company said late Tuesday.

A RIM spokesman said service was beginning to be restored to normal around 2 p.m. Eastern time, although there would be further delays as backlogs in data are cleared. It was the second outage -- or "delay," as RIM put it -- in two days affecting users in numerous countries.

RIM's system is designed to failover to a backup switch, but the failover system "did not function as previously tested," according to a statement issued by RIM at 5 p.m. Eastern time.

When the failover did not function, a backlog of data was generated. The company is working to clear that backlog.

"RIM has failed again at what plagued them in past outages, which is to provide a comprehensive disaster recovery solution," Ken Dulaney, an analyst at Gartner, said after the cause of the outage had been made public.

While it's true that switches can fail, "there should be automatic ways in which the system recovers from this type of event," Dulaney said. "Any vendor who runs this type of mission-critical service must constantly be reviewing disaster recovery solutions."

The latest problems occurred in two phases, with a 12-hour outage Monday morning affecting some BlackBerry users in Europe, the Middle East and Africa, according to RIM. The company said that problem was fixed, but it didn't explain the cause.

Then at about 10 a.m. Eastern time Tuesday, wireless carriers in the U.K. and Egypt reported outages that continued for hours.

RIM said an hour later that the delays affected some customers in South America, Europe, the Middle East, Africa and India, but it didn't immediately offer an update about the underlying problem.

Tweets and other reports blamed a server outage in Slough, England, where RIM operates a data center, but the company would not comment on those reports. The Slough data center would serve much of Europe and the Middle East, analysts said. RIM also runs a data center near its headquarters in Waterloo, Ontario.

But a data center outage in the U.K. or Canada probably wouldn't explain service problems in South American countries, such as Brazil, Chile and Argentina, analysts noted.

RIM doesn't usually explain the cause of its outages and disruptions. In the past, those outages have lasted one or two days and have only affected a certain region of a country or a portion of a continent, not several continents as happened Monday and Tuesday.

In March 2010, there was an outage in both North America and the U.K. on Wi-Fi-ready BlackBerry devices that were not connected to Wi-Fi. A more severe December 2009 outage in North America was related to a BlackBerry Messenger update.

Wireless carriers, including T-Mobile UK and Vodafone Egypt had informed their customers via Twitter at 10 a.m. Eastern time Tuesday that RIM was working on the problems. T-Mobile UK said the question of whether or not customers were owed refunds would be a matter for RIM to address.

Users on BlackBerryForums.com also took note of the problems Monday. One contributor, MrTuck, reported that "50% of the population with BlackBerries across Europe, Middle East and Africa are unable use their Internet, BlackBery Messenger, Facebook Twitter, Email and other applications." He noted that calls and texts were working normally at the time.

Some U.K. reports said the Monday problem seemed to be related to BlackBerry Internet Service customers, who are mostly individual users and small businesses.

Dulaney said the problem Tuesday seemed to be related to both BIS and BlackBerry Enterprise Server, which is used by larger businesses with email and other functions routed through a server placed inside a corporation and its firewall for added management and security. An IDG News Service editor based in Paris who uses a BES server in Boston reported that he was not affected by the Monday outage. However, he said that he was affected by the Tuesday problems and could not receive email.

RIM has not said whether BlackBerry BIS or BES or both were affected in either Monday's or Tuesday's delays. RIM also didn't explain what it meant by a "delay," since some users posted comments indicating that they could not receive certain services at all.

Matt Hamblen covers mobile and wireless, smartphones and other handhelds, and wireless networking for Computerworld. Follow Matt on Twitter at Twitter @matthamblen, or subscribe to Hamblen RSSMatt's RSS feed. His email address is mhamblen@computerworld.com.

Read more about Mobile and Wireless in Computerworld's Mobile and Wireless Topic Center.



Additional Resources
Forrester Consulting - Optimizing Users and Applications in a Mobile World
WHITE PAPER
Solving application issues over the WAN requires careful consideration. Based on their independent research, Forrester Consulting offers recommendations on how to tackle application performance issues, insufficient bandwidth and the inability to quickly restore users in a disaster.

Read now.

Security KnowledgeVault
WHITE PAPER
Security is not an option. This KnowledgeVault Series offers professional advice how to be proactive in the fight against cybercrimes and multi-layered security threats; how to adopt a holistic approach to protecting and managing data; and how to hire a qualified security assessor. Make security your Number 1 priority.

Read now.

Cut Communications Costs Once and for All
WHITE PAPER
New IP-based communications systems are being deployed by small and midsized businesses at a rapid rate. Learn how these organizations are enabling faster responsiveness, creating better customer experiences, speeding office or mobile interactions, and dramatically reducing existing communications costs.

Read now.

Mobile and Wireless White Papers
The Cost Advantages of Using a Hosted Unified Communications Service: A TCO Guide for SMBs
A challenge for small and mid-sized businesses (SMBs) is the cost of scaling their communications systems to rival the rich functionality and flexibility...
Digital Transformation: Creating New Business Models Where Digital Meets Physical
Individuals and businesses alike are embracing the digital revolution. Social networks and digital devices are being used to engage government, businesses and civil...
Empowering Your Mobile Worker
Today's most productive employees are mobile, and your company's IT strategy must be ready to support them with 24/7 access to the business...
An Interactive Guide: Bring Your Own Device
BYOD presents significant security and management challenges to IT departments who want to take advantage of the trend, but still protect corporate assets....
Calculating ROI for Mobile Client Acceleration
As mobile devices continue to expand in business use, ensuring these devices have optimal performance is becoming an IT imperative. This EMA paper...
All Mobile and Wireless White Papers
Mobile and Wireless Webcasts
Live Webcast
North Pole to South Seas: Overcoming the Pitfalls of remote Performance
In today's always-on world, connectivity is a business requirement. You need the tools that allow you to operate as if you were on...
Mobility KnowledgeVault
How "mobile ready" is your infrastructure? This Mobility Knowledge Vault provides a wide variety of expert advice on how to strike a balance...
Supporting Mobile Productivity With A Limited IT Budget
Join us and hear from Kaseya mobile IT management experts as we discuss core strategies for supporting the mobile revolution on a shoestring...
North Pole to South Seas: Overcoming the Pitfalls of remote Performance
In today's always-on world, connectivity is a business requirement. You need the tools that allow you to operate as if you were on...
Unified Communications 101
What's the best way to implement a unified communications solution for your organization?
QNX® and BlackBerry® PlayBook™ Tablet.
RIM's multi-processor, multi-tasking BlackBerry PlayBook runs a new Tablet OS powered by QNX, a bullet-proof microkernel operating system. This track will take a...
All Mobile and Wireless Webcasts
Newsletter Sign-Up

Receive the latest news test, reviews and trends on your favorite technology topics

Choose a newsletter
  1. View all newsletters | Privacy Policy
IT Jobs