Stanford researchers may have world's largest database
Computerworld - Experts at the Stanford Linear Accelerator Center (SLAC) at Stanford University said they believe they may have the biggest database in the world.
The database recently passed the 500TB mark, and "as far as I can see," that makes it the largest such repository in the world by far, said database manager Jacek Becla. The database began storing data in 1999.
The 500TB of data in the BaBar database, if printed out, would fill 1 billion books, according to a statement released by SLAC. That's nearly 60 times the number of books in the Library of Congress, the largest library in the world.
The database, which collects information about subatomic particle collisions, is used by 600 physicists from nine nations taking part in the BaBar research project, Becla said yesterday. BaBar's goal is to understand the difference between matter and antimatter and how it shaped the universe. The project has adopted Babar, the elephant from the popular children's stories, as its mascot, although the name really comes from B-bar, a type of particle some of the scientists study.
Becla said each collision generates about 30KB of raw data. Not all of the collisions are recorded -- "only the interesting ones," he added.
Becla said that caring for all the data has posed a disaster recovery challenge. But, he said, the key to solving that problem is simple: back up everything.
Most of the data is stored as read-only, he said. The data is also backed up on tape almost as soon as it's stored.
BaBar's elephant-like memory is further aided by the various research groups around the world that are taking part in the project. Becla said each group backs up its own data, so if anything happened to BaBar, it could be reconstructed.
Becla said most of the database runs on CPUs from Sun Microsystems Inc., but SLAC has recently begun to invest in a number of Linux boxes. The project has used more than 100 servers spread over a number of different server farms.
The center and the BaBar project are funded by the U.S. Department of Energy.
Related story:
- Taming data chaos, April 15, 2002
Read more about Business Continuity in Computerworld's Business Continuity Topic Center.



- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- An Interactive Guide: Bring Your Own Device
- BYOD presents significant security and management challenges to IT departments who want to take advantage of the trend, but still protect corporate assets....
- Malware Security Report: Protecting Your Business, Customers, and the Bottom Line
- Protect your business and customers by understanding the threat from malware and how it can impact your online business. This paper highlights how...
- Security Predictions for 2012
- With all of the crazy 2011 security breaches, exploits and notorious hacks, what can we expect for 2012? Last year's Websense Security Labs...
- Overcome Top 7 Admin Challenges of Active Directory
- As Active Directory's role in the enterprise has drastically increased, so has the need to secure the data. Gain insight on creating repeatable,...
- Insiders Can Ruin Your Company. Take Action.
- Did you know that 80 percent of threats to an organization come from the inside? The threat from insiders is often overlooked in... All Business Continuity White Papers
- Data Protection and Information Governance
- Today, legal hold and information governance are increasingly becoming drivers for data protection. However, few organizations knows what information they have, where to...
- Data Protection and Disaster Recovery with iSCSI and VMware
- Get this on demand webcast now
- Optimizing Networks for the Cloud
- Join guest speaker, Rohit Mehra, IDC Director of Enterprise Communications Infrastructure, to explore current trends, discuss best practices for optimizing Data Center and...
- Apps QuickStart Series Part 2: Designing and Deploying SQL Server on VMware vSphere
- Download this webcast to learn about the design considerations for virtualizing SQL workloads, performance and scalability information and high-availability options, as well as...
- Apps QuickStart Series Part 1: Designing and Deploying Exchange 2010 on VMware vSphere
- Download this webcast to learn the virtual hardware design considerations for Exchange 2010, deployment using the building block approach, options for high-availability and... All Business Continuity Webcasts