The Story So Far: The History of RAID
Redundant Arrays of Inexpensive Disks turned out to be expensivebut dependable.
Computerworld - David A. Patterson led the team at the University of California, Berkeley, that developed the idea of RAID storage. In an interview with Frank Hayes, Patterson recalled the beginnings of his RAID project in 1987.
"We had just been working on RISC processors, and we consciously said, 'Processors are going to start getting fast, improving faster than they have in the past. So what are we going to do about I/O?' That was one motivation.
"The other one was that Randy Katz [one of Patterson's colleagues at Berkeley] got a Macintosh, and it had a hard disk in a separate box next to it. And he said, 'That's kind of interesting; here's a much smaller disk than I'm used to. What could we do with that as a building block?'
"So we held a graduate course where we started off with some rough ideas, and then we and the graduate students—Garth Gibson, Pete Chen, Ed Lee, Ann Chevernak, Ethan Miller—met and talked and read papers, and the ideas evolved from there.
"But when we tried to tell people our ideas, they couldn't understand. They'd say, 'Oh yeah, that's the same thing that IBM's been doing forever in terms of mirroring.' Or, 'Oh yeah, Thinking Machines, they've got a product in this area.' And so when we tried to explain things, they assumed what we'd done had already been subsumed by other work.
"That motivated us to write a paper ['The Case for Redundant Arrays of Inexpensive Disks']. It advocated that we should be replacing these big disks by lots of small disks. Basically, a big, relatively thick disk that has to spin fast is much less efficient than lots of small disks, and we get all these benefits in terms of volume and footprint and power. We submitted the paper to the database conference SIGMOD, and Garth Gibson [the lead graduate student on the project] and I went to a short course that was given at Santa Clara University by Al Hoagland, who was kind of the godfather of the disk industry. We came with 20 or 30 copies of our report and handed it out at that meeting, and that was a good thing to do. The paper just clicked. It was a good time, I guess, for that set of arguments.
"We built the RAID I [in 1989] to try the ideas in software. For RAID II [in 1993], we said, 'Let's try to build a high-performance I/O system that connects over a network.' Then at the end of the project, we had a little demo where we pulled the disk out and the thing kept working.
"We were still performance-oriented, thinking RAID was for performance, so we were shocked to see somebody write this up in Byte magazine. The PC community was obviously not so performance-oriented as it was dependability-oriented, and they thought, Hey, less-expensive dependable computing.
"It really just took off after that. EMC decided to build mainframe storage out of PC disks. Compaq had RAID early, and Data General. And of course IBM had its own. We didn't know IBM had its own RAID 5 set of ideas in the AS/400 line. IBM had completely independently done the same RAID part of the ideas but used large disks.
"One of the surprises about RAID was it was so expensive. The I in the name when we coined the term was for inexpensive disks. But the system was so expensive, that was kind of awkward for marketing people. So Randy blessed the change to independent for I. Since the RAID boxes weren't cheap, that was probably a better name.
"The current project I'm working on is ROC, Recovery-Oriented Computing. With the RAID stuff, we were always thinking performance, but obviously, dependability is the reason people are doing it. People get mad if their program crashes, but they just go berserk if they lose data. The ROC philosophy is recovering fast when outages happen. That's a different engineering ethic. Hardware will break, software has bugs, people will make mistakes. And if you believe that, then it makes sense to recover fast, rather than just try to make things that never break."
David Patterson's current work: the Recovery-Oriented Computing (ROC) Project
"Self-Repairing Computers," Scientific American, June 2003
David Patterson, An Oral History
- Editor's Note: The New Rules of Storage
- The Story So Far: The History of RAID
- Regulated Storage
- The Slow Move to Information Life-cycle Management
- IP Storage: Keeping a Safe Distance May Make Sense for Data Recovery
- Unpleasant Success
- iSCSI's early adopters
- The Almanac: Storage Briefs
- Serial vs. Parallel Storage
- Storage Careers: Thinking Outside the Box
- The Next Chapter: Predictions About Storage
- Storage Regulations Quiz
- Negotiating a storage deal? Improve the odds of success with these tips
- Data destruction: What they can't find can get you 20 years
- Readers share their stories
- Best iPhone, iPad Business Apps for 2014
- 14 Tech Conventions You Should Attend in 2014
- 10 Desktop Apps to Power Your Windows PC
- How to Add New Job Skills Without Going Back to School
- Slideshow: 7 security mistakes people make with their mobile device
- iOS vs. Android: Which is more secure?
- 11 sure signs you've been hacked
- OpenStack Hype vs. Reality: CIO Quick Pulse Open-source architecture can enable IT departments to build infrastructure-as-a-service (IaaS) clouds running on standard hardware.
- OpenStack and Red Hat: IDC White paper Most OpenStack deployments are by public cloud providers that are early adopters of technology and use OpenStack in a do-it-yourself deployment and support...
- Red Hat Enterprise Linux OpenStack Platform Datasheet Seamlessly transition to the cloud. Red Hat Enterprise Linux OpenStack Platform delivers an integrated foundation to create, deploy, and scale a secure and...
- Pay-as-you-Grow Data Protection: IBM Tivoli's Full-featured Data Protection Suite for Small to Medium Businesses IBM Tivoli Storage Manager Suite for Unified Recovery gives small and medium businesses the opportunity to start out with only the individual solutions...
- Make or Break: New Auto Products Must Go To Market On Time This Webcast quantifies the value of time to market for the auto industry and highlights how Primavera Enterprise Portfolio Management can help organizations.
- IBM Flash Webcast: Optimizing your Datacenter for Efficient Storage & ROI Register for this webcast to learn the benefits of flash storage from IBM Customer, Leonardo Irastorza of Royal Caribbean Cruise Ltd and Storage... All Data Storage White Papers | Webcasts
- OpenStack Hype vs. Reality: CIO Quick Pulse
- Red Hat Enterprise Linux OpenStack Platform Datasheet
- Streamline Data Protection with IBM Tivoli Storage Manager Operations Center
- Keep Your Network Available, Efficient and Secure
- The Future of IT: A Customer First Approach
- MyIT- Consumer Cool for Business Apps
- Get the Facts: LTO Tape Reliability Saves
- LTFS Hits the Mark in Media Entertainment: An In-Depth Introduction to LTFS for Digital Media
- Addressing the Broken State of Backup with a New Category of Disk-Based Backup Solutions
- Customer Analytics: The Role of Integrated Systems
- OpenStack and Red Hat: IDC White paper
- Pay-as-you-Grow Data Protection: IBM Tivoli's Full-featured Data Protection Suite for Small to Medium Businesses
- Using VM Archiving to Solve VM Sprawl
- VCE Converged Infrastructure Enables Continuous Operation for Swiss Power Plant
- MyIT- Consumer Cool for Business Apps
- Revisiting the Search for Long-Term Storage -- A TCO Analysis of Tape and Disk
- The Evolving Role of Disk and Tape
- Tape Fallacies Exposed -- The Future of Tape Is Still Bright
- The hidden costs of storage
- Cost/Benefit Case for IBM PureData System for Analytics