Ads by TechWords

See your link here
Subscribe to our e-mail newsletters
For more info on a specific newsletter, click the title. Details will be displayed in a new window.
Storage
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
More E-Mail Newsletters 
 

Stanford researchers may have world's largest database

April 17, 2002 12:00 PM ET

Computerworld - Experts at the Stanford Linear Accelerator Center (SLAC) at Stanford University said they believe they may have the biggest database in the world.
The database recently passed the 500TB mark, and "as far as I can see," that makes it the largest such repository in the world by far, said database manager Jacek Becla. The database began storing data in 1999.
The 500TB of data in the BaBar database, if printed out, would fill 1 billion books, according to a statement released by SLAC. That's nearly 60 times the number of books in the Library of Congress, the largest library in the world.
The database, which collects information about subatomic particle collisions, is used by 600 physicists from nine nations taking part in the BaBar research project, Becla said yesterday. BaBar's goal is to understand the difference between matter and antimatter and how it shaped the universe. The project has adopted Babar, the elephant from the popular children's stories, as its mascot, although the name really comes from B-bar, a type of particle some of the scientists study.
Becla said each collision generates about 30KB of raw data. Not all of the collisions are recorded -- "only the interesting ones," he added.
Becla said that caring for all the data has posed a disaster recovery challenge. But, he said, the key to solving that problem is simple: back up everything.
Most of the data is stored as read-only, he said. The data is also backed up on tape almost as soon as it's stored.
BaBar's elephant-like memory is further aided by the various research groups around the world that are taking part in the project. Becla said each group backs up its own data, so if anything happened to BaBar, it could be reconstructed.
Becla said most of the database runs on CPUs from Sun Microsystems Inc., but SLAC has recently begun to invest in a number of Linux boxes. The project has used more than 100 servers spread over a number of different server farms.
The center and the BaBar project are funded by the U.S. Department of Energy.
Related story:



Additional Resources

POLL RESULTS
Accelerate your knowledge of the IT world you inhabit by viewing the results of a series of polls taken by your IT peers. These polls of 100+ IT professionals each are available for full viewing. They cover key topics such as virtualization, processor performance, green IT, cloud computing and many others. Be a part of the buzz.
WHITE PAPER
Technology is complex. Keeping it running productively shouldn't be. To that end, you want to minimize the number of solutions needed in-house to simplify operations, maintenance, and support. Kodak offers a best-practices model. One company provides support for both scanner and software, for fast problem resolution without vendor finger-pointing. Download now!
WHITE PAPER
Utilizing demand intelligence improves the precision of pricing, product assortments, channel/store placement, and promotion, which are all essential for sustainable revenue management performance. Learn more, download this free whitepaper today.

White Papers & Webcasts

Creating a Complete ECM Solution - DocuShare and Sharepoint
Learn the pros and cons of using a single ECM solution versus combining the portal functionality of SharePoint....  

Strategic ECM Webinar
Learn what new strategic business benefits can be realized through ECM!...

IDC Whitepaper: Requirements for Protection, Archiving, and Recovery
This paper segments the differences in requirements and characteristics among companies of various sizes when it comes to how they manage their data...  

5 Architecture Issues that Impact BES performance
This Live webinar will identify critical log file errors, performance counters, and configurations to pay close attention to when optimizing BES server performance....

Exchange Backup & Recovery Test Results: Symantec Beats CommVault
Microsoft Exchange is a critical application for businesses today. And quick recovery is vital to getting back on-line and producing revenue. This comparison...  

Four Principles for Reducing Storage TCO
(Source: Hitachi Data Systems) Difficult economic times require new strategies for reducing costs. Where storage technology and economics meet, there are...

VMWare Backup & Recovery Test Results: Symantec Beats CommVault
Use of virtual servers is rapidly growing as businesses recognize the cost savings that can be achieved. But virtual environments introduce a new...  

Deduplication and Other Strategies for Protecting Your Assets with the Veritas NetBackup Platform
(Source: Symantec) Many companies find their backup and storage resources strained by data growth and increased regulatory requirements for data retention. In today's...

Better Protection for VMWare Environments: Symantec Beats EMC
Use of virtual servers is rapidly growing as businesses recognize the cost savings that can be achieved. But virtual environments introduce a new...  

Controlling Email and File Server Growth and Costs with Intelligent Archiving
(Source: Symantec) According to IDC 54% of the storage capacity added by organizations in 2008 will be dedicated to the storage of file-based...