Disk systems will repair themselves or can be left unrepaired for years.
Computerworld - You can fly a two-engine plane with one engine, but how many passengers would want to be on it?
That's the idea behind "bulletproof storage," a concept that IBM has been developing for two years and plans to begin unveiling incrementally over the next one to three years.
"I think the basic idea we're going after is we really want the storage system to be something the customer just doesn't worry about," says Jai Menon, an IBM fellow and chief technology officer of storage systems.
IBM's technology initiative deals with fault tolerance in every part of a storage system: disk, controller, network cards, power supplies and software. By building more-robust storage systems that can defer replacement of failed parts for up to three years because of redundant components, IBM believes it can also eliminate many human errors that happen when failing components are replaced.
A Matter of Time
According to Stanley Zaffos, an analyst at Gartner Inc. in Stamford, Conn., the bulletproof storage concept still has another five to 10 years before it's broadly embraced by users. But once it is, storage systems will require less maintenance and, therefore, cost less to maintain.
"We know how to build very reliable code. We use appliances every day that have software built into them that work forever: your automobile, your calculator, the disk drive in your PC, your telephone," Zaffos says.
But IBM is looking to attack far more complex systems than telephones or calculators.
Under its bulletproof initiative, IBM is addressing disk-sector failures that grow along with disk capacity. While disk capacities double every 12 to 18 months, uncorrectable read/write error rates haven't improved, nor has the probability of an uncorrectable error occurring on a disk read decreased. There are more sectors on today's disks and, therefore, a greater chance of an uncorrectable error.
The answer, Menon says, is to create self-healing capabilities for storage management software and more-robust RAID configurations.
IBM says that in about a year it will release storage systems that can support three simultaneous disk-drive failures in a single array by introducing additional parity disks into RAID configurations, offering many times the resiliency of a RAID configuration with two parity disks. Today, standard systems allow for only two disk failures.
But Zaffos argues that 80% of downtime today is caused by user error and software failures, not hardware failures. He says that the failures resulting from software are created by complexity and that there is an almost infinite number of failures that can occur in a complex system.
IBM is addressing those code failures with a software project called N-Version Programming, where two pieces of code in the same application save data and then compare the data to ensure that there are no errors.
- 15 Non-Certified IT Skills Growing in Demand
- How 19 Tech Titans Target Healthcare
- Twitter Suffering From Growing Pains (and Facebook Comparisons)
- Agile Comes to Data Integration
- Slideshow: 7 security mistakes people make with their mobile device
- iOS vs. Android: Which is more secure?
- 11 sure signs you've been hacked
- Using VM Archiving to Solve VM Sprawl This CommVault whitepaper discusses how archiving virtual machines can mitigate VM sprawl with a comprehensive approach to VM lifecycle management.
- Keep Your Network Available, Efficient and Secure Make the most of your network by working with experts who "get it." CDW and F5 have partnered to keep networks highly optimized....
- VCE Converged Infrastructure Enables Continuous Operation for Swiss Power Plant Read how Vblock™ Systems, running in active-active mode, enabled KKL to transform its twin data centers in just two months, enable continuous operations,...
- The Future of IT: A Customer First Approach Explore how customer-first policies can make use of social, mobile and cloud technologies to give workers the freedom and flexibility they desire to...
- Make or Break: New Auto Products Must Go To Market On Time This Webcast quantifies the value of time to market for the auto industry and highlights how Primavera Enterprise Portfolio Management can help organizations.
- IBM Flash Webcast: Optimizing your Datacenter for Efficient Storage & ROI Register for this webcast to learn the benefits of flash storage from IBM Customer, Leonardo Irastorza of Royal Caribbean Cruise Ltd and Storage... All Data Storage White Papers | Webcasts