Ads by TechWords

See your link here
Receive the latest technology news and information.
Storage
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
Cloud Computing
View all newsletters




Privacy Policy
 

Hard Data

February 26, 2007 12:00 PM ET

Computerworld - No theory is ever as good as lots of real-world data. So here, based on lots of real-world data, is what you should do to minimize problems with hard disk drives: a) burn them in rigorously; b) replace them as soon as they start throwing errors, especially scan errors; and c) retire them before they turn three years old. Oh, and d) remember that none of those measures is a substitute for regular backups.

That’s the gist of a pair of amazing studies presented at the FAST ’07 storage conference this month. Two separate research groups each collected data on 100,000 disk drives, some of which failed — then they crunched the numbers to identify how the drives failed, what they (mainly) failed from and what they (mostly) didn’t fail from.

And ho boy, do they ever fail. Hard drives are the most commonly replaced hardware item in many data centers, and they account for 16% of all hardware-related outages. Anything that tells us how to keep them from dropping dead is money in the bank for IT shops.

One of the studies, from Carnegie Mellon University, got its statistics from a wide range of sites, including the Los Alamos National Laboratory, the Pittsburgh Supercomputing Center and various Internet service providers. (You can find that study online at www.usenix.org/events/fast07/tech/schroeder.html.)

The other study sifted through data from Google’s automated system for tracking performance of drives in its own huge storage farms. That one’s at http://labs.google.com/papers/disk_failures.pdf.

If those two populations sound very much alike — well, listen harder. High-performance computing centers tend to buy gear with high-performance specs. Google, on the other hand, is notoriously cheap when it comes to hardware — it buys garden-variety hard drives in large lots from whoever is offering the best deal that particular week.

But it turns out that high-end and consumer drives have a lot in common. For one thing, they typically don’t last the five years that drive vendors say they should, at least not in server-farm settings. Drive failures at Google take a big jump once drives get to be more than two years old. And according to the Carnegie Mellon team, those rising failure rates never level off — they just keep going up as drives get older.

Think using a drive a lot will make it much more likely to fail? Nope, say the guys from Google. Low-utilization drives fail at almost exactly the same rate as high-utilization drives.

Think RAID is a guarantee against a storage catastrophe? Don’t believe it, say the Carnegie Mellon folks. According to their real-world data, in RAID 5 arrays, when one drive fails, another drive failure will often happen much sooner than it theoretically should — maybe even before you’ve replaced the bad drive and rebuilt the data set on the RAID array.



Jump to comments

disk drive failure

Additional Resources

EFD vs. HDD - What You Need to Know
WHITE PAPER
Enterprise flash drives provide a new Tier 0 storage layer capable of delivering high I/O performance at a very low latency. Proper use of EFDs in an Oracle environment can deliver increased performance compared to fibre channel drives. Read the recommendations for identification of the best DB components for EFDs.
Gartner Research Report: Magic Quadrant for Application Delivery Controllers, 2009
WHITE PAPER
The market for products to improve the delivery of application software over networks remains dynamic and innovative. Vendors focused on solving enterprises' most-pressing application problems have become the top players.
Eight Criteria for Server Load Balancing
WHITE PAPER
Server load balancers are a simple yet highly effective means to scale an application environment while ensuring its availability. Today's solutions should also address application performance and security. Read about the top eight criteria you should consider when choosing a server load balancer and how Citrix NetScaler meets those requirements.

What People Are Saying

White Papers & Webcasts

Cache Tier Memory Efficiency with Gear6 Web Cache
Download this valuable white paper!  

Connecting to the Cloud with F5 and VMware VMotion
F5 and VMware partner to enable live application and storage migrations between datacenters and clouds, over short or long distances.  

Virtualize Microsoft Applications on VMware
Register for this live webcast now!

F5 Virtualization Guide: Seven Key Challenges You Can't Ignore
Seven Key Challenges You Can't Ignore  

Strategic ECM Webinar
Learn what new strategic business benefits can be realized through ECM!


IT Jobs

 

Partnered Content
Hitachi - Inspire the Next
Storage Economics: Understanding Tiered Storage Solutions
Storage Economics is a suite of methodologies, tools, and services that help customers identify the total cost of storage ownership and provide a tiered storage solution to reduce ongoing costs. Understand the benefits of implementing a tiered storage architecture which include improving storage capacities and easing the access demands to any single storage tier. Learn more.
Download this white paper 
Strategies for an Increasingly Cost-Conscious Data Storage World
Whatever word you use, we can all agree that the global economy continues to face challenging times. Yet, the essential challenge remains the same: IT demands continue to increase but the resources to address such challenges are being flattened or cut. However, we truly have an opportunity here to do more with less and focus on efficiency. Hitachi can help. Learn more.
Download this white paper 
Four Principles to Reduce TCO
Yes, good news! The good news is that there are proven strategic investments available today for storage infrastructure cost reduction. Smart organizations will follow the principles of Storage Economics to evaluate them not just for their technical prowess but also for how well they can support business performance and particularly efforts to economize. Learn more.
Download this white paper