Skip the navigation

Giving Bugs the Boot

Micro-reboot your IT troubles away.

By Todd R. Weiss
November 8, 2004 12:00 PM ET

Computerworld - It's been an IT mantra for years: "When all else fails, reboot." Rebooting often works, but isn't there a better approach to the problem of buggy software that crashes your computer and takes your valuable data with it?
That idea has been the focus of researchers at Stanford University and the University of California, Berkeley, who have been working feverishly to find better ways to bring computers back from the brink of disaster.
The researchers are seeking a fresh alternative to rebooting. Thinking backward, they reasoned that it might be a good idea to give up on the impossible job of making bug-free software and instead look for ways to recover from failures without losing data or time.
That's the concept behind "recovery-oriented computing," a 180-degree turn from traditional thinking. The idea is that since software can't be created without crash-causing flaws, it should be built to reboot much faster, allowing users to get back to work almost instantly.
"The idea is pretty simple: If availability is the fraction of time that you're up, then recovering fast is more critical than reducing the number of times that crashes happen," says David Patterson, a computer science professor at UC Berkeley.
"In the dawn of computing, people thought software bugs would go away, and they haven't, so now we need ways to co-exist with them," he says. "I think it's a fact to live with rather than a problem to be solved."
One way to do that is through an evolving technique called micro-rebooting, which quickly reboots just enough of the program processes to get the system stabilized and back on track for the user.

Giving Bugs The Boot
Image Credit: Melinda Beck
A micro-reboot is specific to one problematic area of the software's code and doesn't affect other parts of the application, so data in the processing pipeline is unaffected by the reboot.
Led by Patterson and Armando Fox, an assistant professor of computer science at Stanford, the project began in late 2000. Patterson, Fox and a team of graduate students had seen evidence that systems dependability could be improved. Some IT systems for use in avionics, spacecraft and health care were ultradependable because they had to be, but they were costly and complex, and that kind of reliability was impractical for typical IT use. Another way had to be found.
Heading Off a Crash
The researchers are experimenting with algorithms that watch over system processes and sense when something has gone awry, and a crash is imminent. The algorithms focus on determining the normal baseline


What is Tech Briefcase?
TechBriefcase is a new, free service where IT Professionals can Search, Store and Share IT white papers and content like this. Learn more
Bookmark content
Speed up your research efforts with content across the web.
Search and Store
Find the white papers you need. Create folders for any topic.
View Anywhere
Open your briefcase on your iPhone, tablet or desktop. Share with colleagues.
Don't have an account yet?
Additional Resources
Security KnowledgeVault
WHITE PAPER
Security is not an option. This KnowledgeVault Series offers professional advice how to be proactive in the fight against cybercrimes and multi-layered security threats; how to adopt a holistic approach to protecting and managing data; and how to hire a qualified security assessor. Make security your Number 1 priority.

Read now.

Cut Communications Costs Once and for All
WHITE PAPER
New IP-based communications systems are being deployed by small and midsized businesses at a rapid rate. Learn how these organizations are enabling faster responsiveness, creating better customer experiences, speeding office or mobile interactions, and dramatically reducing existing communications costs.

Read now.

App Development White Papers
Workload Automation Challenges and Opportunities
This Executive Brief discusses IDC's perspective on how enterprise workload management requirements are changing and highlights the ways that workload automation solutions can...
Enabling Remote Employees with High Quality Video
In this paper, we analyze the delivery of live and on-demand mobile video content. It focuses on specific ways in which organizations can...
Traditional Backup is Dead - Are You Prepared?
Conventional backup and recovery approaches are not robust enough to meet today's data and information management challenges, let alone those of tomorrow. A...
Redefining Backup & Recovery: A call to CIOs
Re-evaluate your data management strategy and embrace new ways to store, access and protect your data through virtualization and cloud computing - all...
CIO Guide to Virtual Server Data Protection
Server virtualization is changing the face of the modern data center. CIOs are looking for ways to virtualize more applications, faster across the...
All App Development White Papers
App Development Webcasts
Redefine Expectations in the Data Center
Need to do more with less? Watch this video to learn how HP ProLiant Gen8 servers can help your business deploy servers three...
BMC Control-M - Single Point of Control Demo
With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's...
Operational Analytics - Changing the Competitive Dynamics of the Business
Date/Time: June 5, 2012, 11:00 a.m., EDT, 4:00 p.m. BST / 3:00 p.m. UTC

Please join us for this webcast, as Dr. Barry...
A Geek's Guide to Presenting to Business People
Live Webcast: Wednesday, June 20th at 1:00 PM EDT

Join this live webinar with Paul Glen, author of Leading Geeks, to learn how to...
Today's NAS: A Solution Beyond Old Limits
Date: Tuesday, July 17, 2012 2:00 PM EDT

Traditional NAS systems don't scale beyond fixed limits. Proliferation of NAS systems leads to management...
All App Development Webcasts
Newsletter Sign-Up

Receive the latest news test, reviews and trends on your favorite technology topics

Choose a newsletter
  1. View all newsletters | Privacy Policy
IT Jobs