Berkeley Developing Worldwide Storage System
Computerworld - The University of California at Berkeley is looking to create a data storage network that encompasses the planet.
OceanStore is a research project at the university that would use software to break data into many tiny, encrypted parts and store them across a vast array of Web servers owned by Internet service providers around the world.
A vast redundant storage network such as OceanStore would afford easy access to data from anywhere and unprecedented levels of disaster recovery, according to its inventor, John Kubiatowicz. If more than one computer or server were to crash, OceanStore would be able to rebuild the information using pieces stored in multiple clusters on other servers.
OceanStore would track documents by assigning each one a globally unique identification (GUID) tag before it's split into fragments and sent over the Internet to be stored randomly throughout the network.
"You would maybe spread 64 fragments of a document around, and maybe 16 of those can be used to reconstruct [the original document]," Kubiatowicz said. "We're assuming a system the scale of OceanStore will have pieces of it broken all the time."
For example, to retrieve a chopped-up 1989 tax return, OceanStore would send intelligent agents onto the Internet looking for a GUID tag, he said. As the messenger agents search, they would leave behind trails of digital bread crumbs so that the next time, the agents could find the data more quickly.
More frequently used data would be stored on nearby servers to cut down on latency.
Consumers who want to save their documents on OceanStore would pay a monthly fee to an Internet service provider, which would then arrange to redundantly store the data on another Internet provider's server for a small fee.
Neal Goldman, a research analyst at The Yankee Group in Boston, said there are inherent problems associated with OceanStore. For example, enterprises probably wouldn't store mission-critical data on a Web-based system.
"You don't know what the bandwidth is between you and the piece of data you want, so there are some real issues as far as performance," Goldman said.
The project has received about $500,000 in seed funding from vendors including IBM, Nortel Networks Corp. in Brampton, Ontario, EMC Corp. in Hopkinton, Mass., and federal agencies such as the National Science Foundation and the Defense Advanced Research Projects Agency, both in Arlington, Va., Kubiatowicz said.
Read more about Storage in Computerworld's Storage Topic Center.



- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Datacenter Consolidation Best Practices Whitepaper
- The benefits of storage consolidation are being realized by companies and seen as a way to streamline many storage-driven applications. Learn why the...
- Eliminating VMware / Storage Related Performance Challenges
- How to proactively monitor the performance in a Fibre Channel SAN / vSphere environment is always a concern. Understand the importance of a...
- Cloud Environments Have Familiar Storage Challenges
- Cloud environments have many storage challenges that are familiar to data center managers, but due to their density and abstraction, the issues become...
- Eight Considerations for Evaluating Disk-Based Backup Solutions
- In the past, the movement from tape- to disk-based backup has been less compelling due to the expense of storing backup data on...
- ExaGrid Helps U.S. Federal Government Agencies Reduce Backup Windows and Improve Data Protection
- The U.S. Government has been the largest user of tape-based backup systems since the 1970s. Most agencies have begun to deploy disk storage... All Storage White Papers
- Understand Your Data: The Future of Backup and Archiving
- Archiving and Backup are the foundation of the next generation of information governance. However, commodity data protection tools and basic archives are only...
- Optimizing Networks for the Cloud
- Join guest speaker, Rohit Mehra, IDC Director of Enterprise Communications Infrastructure, to explore current trends, discuss best practices for optimizing Data Center and...
- Apps QuickStart Series Part 2: Designing and Deploying SQL Server on VMware vSphere
- Download this webcast to learn about the design considerations for virtualizing SQL workloads, performance and scalability information and high-availability options, as well as...
- Apps QuickStart Series Part 1: Designing and Deploying Exchange 2010 on VMware vSphere
- Download this webcast to learn the virtual hardware design considerations for Exchange 2010, deployment using the building block approach, options for high-availability and...
- Customer Spotlight: How IPC The Hospitalist Company Implemented Oracle on VMware
- Have you been looking to hear about customer's experiences with the new VMware vCenter Site Recovery Manager product? View this webcast to learn... All Storage Webcasts