Berkeley Developing Worldwide Storage System
Computerworld - The University of California at Berkeley is looking to create a data storage network that encompasses the planet.
OceanStore is a research project at the university that would use software to break data into many tiny, encrypted parts and store them across a vast array of Web servers owned by Internet service providers around the world.
A vast redundant storage network such as OceanStore would afford easy access to data from anywhere and unprecedented levels of disaster recovery, according to its inventor, John Kubiatowicz. If more than one computer or server were to crash, OceanStore would be able to rebuild the information using pieces stored in multiple clusters on other servers.
OceanStore would track documents by assigning each one a globally unique identification (GUID) tag before it's split into fragments and sent over the Internet to be stored randomly throughout the network.
"You would maybe spread 64 fragments of a document around, and maybe 16 of those can be used to reconstruct [the original document]," Kubiatowicz said. "We're assuming a system the scale of OceanStore will have pieces of it broken all the time."
For example, to retrieve a chopped-up 1989 tax return, OceanStore would send intelligent agents onto the Internet looking for a GUID tag, he said. As the messenger agents search, they would leave behind trails of digital bread crumbs so that the next time, the agents could find the data more quickly.
More frequently used data would be stored on nearby servers to cut down on latency.
Consumers who want to save their documents on OceanStore would pay a monthly fee to an Internet service provider, which would then arrange to redundantly store the data on another Internet provider's server for a small fee.
Neal Goldman, a research analyst at The Yankee Group in Boston, said there are inherent problems associated with OceanStore. For example, enterprises probably wouldn't store mission-critical data on a Web-based system.
"You don't know what the bandwidth is between you and the piece of data you want, so there are some real issues as far as performance," Goldman said.
The project has received about $500,000 in seed funding from vendors including IBM, Nortel Networks Corp. in Brampton, Ontario, EMC Corp. in Hopkinton, Mass., and federal agencies such as the National Science Foundation and the Defense Advanced Research Projects Agency, both in Arlington, Va., Kubiatowicz said.
Read more about Data Storage in Computerworld's Data Storage Topic Center.
- Mobile Content, Collaboration & IDC's 3rd IT Platform: The Next Frontier for the Mobile Enterprise IDC focuses this article on talks about the new IT platform. This 3rd IT Platform will be the new wave for about the...
- 2014 Magic Quadrant for Enterprise Backup Software and Integrated Appliances Enterprise backup is among the oldest, most-performed tasks for IT professionals. Gartner provides analysis and evaluation of the leading providers that offer a...
- Data Warehousing: modern ecosystems for big data & analytics Five years ago, IBM observed that the planet was becoming more instrumented, interconnected and intelligent. Some 20,000 engagements later, here's what we've learned...
- IBM PureData System for Analytics compared with Teradata This report from ITG compares the cost and time to value of IBM PureData System for Analytics with that of Teradata DW Appliance...
- The Key to Happiness: Throw out Your Data Warehouse In this webinar, Kerry Reitnauer, Director, Solution Architect at FairPoint Communications will discuss the challenges the data warehouse brought, how they migrated to...
- The Foundation You Need to Build a Better Storage Infrastructure Watch this webcast to hear how you can maximize the economics of your data center by modifying your storage footprint and power usage... All Data Storage White Papers | Webcasts