Ads by TechWords

See your link here
Receive the latest technology news and information.
Storage
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
Cloud Computing
View all newsletters




Privacy Policy
 

Stanford researchers may have world's largest database

April 17, 2002 12:00 PM ET

Computerworld - Experts at the Stanford Linear Accelerator Center (SLAC) at Stanford University said they believe they may have the biggest database in the world.
The database recently passed the 500TB mark, and "as far as I can see," that makes it the largest such repository in the world by far, said database manager Jacek Becla. The database began storing data in 1999.
The 500TB of data in the BaBar database, if printed out, would fill 1 billion books, according to a statement released by SLAC. That's nearly 60 times the number of books in the Library of Congress, the largest library in the world.
The database, which collects information about subatomic particle collisions, is used by 600 physicists from nine nations taking part in the BaBar research project, Becla said yesterday. BaBar's goal is to understand the difference between matter and antimatter and how it shaped the universe. The project has adopted Babar, the elephant from the popular children's stories, as its mascot, although the name really comes from B-bar, a type of particle some of the scientists study.
Becla said each collision generates about 30KB of raw data. Not all of the collisions are recorded -- "only the interesting ones," he added.
Becla said that caring for all the data has posed a disaster recovery challenge. But, he said, the key to solving that problem is simple: back up everything.
Most of the data is stored as read-only, he said. The data is also backed up on tape almost as soon as it's stored.
BaBar's elephant-like memory is further aided by the various research groups around the world that are taking part in the project. Becla said each group backs up its own data, so if anything happened to BaBar, it could be reconstructed.
Becla said most of the database runs on CPUs from Sun Microsystems Inc., but SLAC has recently begun to invest in a number of Linux boxes. The project has used more than 100 servers spread over a number of different server farms.
The center and the BaBar project are funded by the U.S. Department of Energy.
Related story:



Jump to comments

Disaster Recovery

Additional Resources

Xerox
By using solid ink technology only from Xerox, you could save up to 65% by printing color for the cost of black and white. Enter for a chance to WIN a PhaserTM 8860 network color printer!
Microsoft
Save time and mitigate security risk. Deploy it now.
Sybase
In this white paper, IDC analyzes the role of next-generation mobile enterprise platforms as organizations seek a more strategic deployment of mobile solutions.

Learn the important issues you must consider before starting your next mobility initiative. Get your mobility white paper from IDC now, compliments of Sybase.

White Papers & Webcasts

Connecting to the Cloud with F5 and VMware VMotion
F5 and VMware partner to enable live application and storage migrations between datacenters and clouds, over short or long distances.  

Data Protection is not an insurance policy -you cannot buy-back lost data
Find out why you need to maintain access to critical information to run your business and remain competitive.

SiliconFS - The BlueArc Filesystem
Learn the power of the BlueArc family of products to enterprise storage management features, providing real value for its customers.  

Strategic ECM Webinar
Learn what new strategic business benefits can be realized through ECM!

Enabling Enterprise Class Features for the Mid-Range
Learn how BlueArc's new storage platform, BlueArc Mercury™, scales in fixed increments that make it easy to install and deploy, scales up to...  

Tabor Research: NFS Evolution Changes the Landscape of HPC Data Management
A hybrid file system combining the benefits of standard NFS and the performance and scale of parallel file systems.  

5 Architecture Issues that Impact BES performance
Register to attend this LIVE Webinar to learn 5 Architecture Issues that Impact BES performance!

Intelligent Tiered Storage: BlueArc's Implementation
This ESG White Paper discusses the importance of tiered storage, examines BlueArc's approach to intelligent tiering, and shows how it creates operational value...  

Four Principles for Reducing Storage TCO
View cost reduction strategies in this video! Provided by Hitachi Data Systems.