Giving Bugs the Boot
Micro-reboot your IT troubles away.
Computerworld - It's been an IT mantra for years: "When all else fails, reboot." Rebooting often works, but isn't there a better approach to the problem of buggy software that crashes your computer and takes your valuable data with it?
That idea has been the focus of researchers at Stanford University and the University of California, Berkeley, who have been working feverishly to find better ways to bring computers back from the brink of disaster.
The researchers are seeking a fresh alternative to rebooting. Thinking backward, they reasoned that it might be a good idea to give up on the impossible job of making bug-free software and instead look for ways to recover from failures without losing data or time.
That's the concept behind "recovery-oriented computing," a 180-degree turn from traditional thinking. The idea is that since software can't be created without crash-causing flaws, it should be built to reboot much faster, allowing users to get back to work almost instantly.
"The idea is pretty simple: If availability is the fraction of time that you're up, then recovering fast is more critical than reducing the number of times that crashes happen," says David Patterson, a computer science professor at UC Berkeley.
"In the dawn of computing, people thought software bugs would go away, and they haven't, so now we need ways to co-exist with them," he says. "I think it's a fact to live with rather than a problem to be solved."
One way to do that is through an evolving technique called micro-rebooting, which quickly reboots just enough of the program processes to get the system stabilized and back on track for the user.

![]()
Image Credit: Melinda Beck ![]()
Led by Patterson and Armando Fox, an assistant professor of computer science at Stanford, the project began in late 2000. Patterson, Fox and a team of graduate students had seen evidence that systems dependability could be improved. Some IT systems for use in avionics, spacecraft and health care were ultradependable because they had to be, but they were costly and complex, and that kind of reliability was impractical for typical IT use. Another way had to be found.
Heading Off a Crash
The researchers are experimenting with algorithms that watch over system processes and sense when something has gone awry, and a crash is imminent. The algorithms focus on determining the normal baseline



- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Forrester Total Economic Impact (TEI) Case Study - Oracle
- In this paper, Forrester Consulting examines the total economic impact and potential return on investment (ROI) realized by three Enterprise organizations as they...
- The Hidden Truth About Virtualizing Business-Critical Applications
- This IDG whitepaper highlights key findings based on the Quickpoll Survey conducted with more than 300 Enterprise and Commercial IT decision makers worldwide...
- Top 10 Myths About Virtualizing Business-Critical Applications
- Even though virtualization has brought positive change to enterprise IT over the last decade, some skepticism remains about how valuable virtualization can be...
- Enterprise Java Applications on VMware: Unix to Linux Migration Guide
- This guide focuses on key considerations for IT Architects who are in the process of migrating Java applications from UNIX to Linux as...
- Virtualizing Tier 1 Applications: A Critical Step on the Journey Toward the Private Cloud
- This IDC white paper explains how much of the Enterprise IT community is at a crossroads in extending their journey to the private... All Applications White Papers
- Live Webcast
Banish Poor Application Performance: Eliminate Business Disruptions, Increase End User Productivity - End User Experience, 30-Min Webinar
Wed. Feb. 22nd ~ 11 AM ET
Are you ready to gain the proactive ability to rapidly respond... - Apps QuickStart Series Part 2: Designing and Deploying SQL Server on VMware vSphere
- Download this webcast to learn about the design considerations for virtualizing SQL workloads, performance and scalability information and high-availability options, as well as...
- Apps QuickStart Series Part 1: Designing and Deploying Exchange 2010 on VMware vSphere
- Download this webcast to learn the virtual hardware design considerations for Exchange 2010, deployment using the building block approach, options for high-availability and...
- Virtualize Business-Critical Applications with Confidence
- Virtualizing business-critical applications has become a key focus for organizations as they move along their virtualization journey. With the launch of VMware vSphere®...
- Discover the Benefits of Virtualization for Federal Applications
- Want to say goodbye to missed SLAs? VMware can help you virtualize mission-critical applications such as Oracle, MS Exchange and SharePoint to achieve...
- Reduce Application Lifecycle Management Costs with VMware ThinApp
- Traditional desktop application deployment and management is a time-consuming and costly endeavor for IT. From development to deployment, including help desk support, the... All Applications Webcasts