Open source and SaaS offerings rethink the DB
InfoWorld - The world of low-rent key-value storage silos is exploding. Here's a list of some of the more prominent new projects:
Thrift, a tool that Facebook.com built and donated to the Apache software project, isn't really a database. In fact, it's more of a pre-compiler that converts a file describing the data structures into a pile of code in your choice of languages. This code may need some extra libraries (Java comes with some methods that serialize the data to an output stream), but it's ready to include with your own code. The project includes formatters for many of the major languages (including C, Perl, Java, PHP, and Python), and it aims to honor the various idioms used by the programmers familiar with a language. You give it the schema and it does the rest.
[ In this brave, new world of databases, find out whether Amazon SimpleDB, Apache CouchDB, Google App Engine, or Persevere come out on top in InfoWorld Test Center's review. ]
ThruDB is a layer of glue that joins together the Lucene text indexing library with Thrift and a good set of tools for storing the data in the cloud. One of the more sophisticated options will store the data locally on a disk for fast storage while sending out a duplicate set of transactions to Amazon S3 (Simple Storage Service). There's no official release yet, so you'll need to grab the code from the Subversion repository.
MongoDB accepts JSON (JavaScript Object Notation) pairs of keys and values for storage across several nodes and then builds a full text index of the values, making it possible to search deep inside of data structures. 10gen, the company that launched the MongoDB open source project, is also spinning off an app server, Babble, that sits on top of it. Auto-sharding -- that is, shared nothing partitioning across many distributed servers -- is said to be arriving in the second quarter of 2009.
There are some tools designed to add horizontal partitioning to traditional databases. Hibernate Shards and HiveDB can work together to direct queries to the right server in a cluster of databases. They work with any database with a Hibernate interface.
Harry Potter readers may hate the name, but Project Voldemort is a "big, distributed, persistent, fault-tolerant hash table." You put in key-value pairs and they get serialized to disk somewhere in a cloud. There are no queries. You can't ask for fields within a certain range. You just ask for a key and get the result. When data is stored to two different nodes at once, as might happen in an ACID-free environment like this, Voldemort uses a vector clock to choose a winner. It's very, very lightweight. You might even try to make a joke about the real Voldemort's shadow-like presence.



- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- X-Ray of the PCI Process-4 Proactive Steps
- This white paper from Forrester Research Inc., helps break PCI into understandable components. Security and risk professionals will gain knowledge and insight into...
- Forrester: Economic Impact of Switching to Google Apps
- Content provided by Google
Read this Forrester report on the "total economic impact" of Google Apps, and learn how switching to Google Apps creates... - Intelligent Systems: Unlocking Hidden Business Value with Data
- An intelligent system enables data to flow across an enterprise infrastructure, spanning the devices where valuable data is gathered from employees and customers,...
- Concepts of NonStop SQL/MX
- For DBAs and developers who are familiar with Oracle solutions and want to learn about NonStop SQL/MX, this whitepaper provides an overview of...
- HP Advanced Information Services for SAP In-Memory Appliance (SAP HANA)
- Organizations are eager to connect the vast amounts of data available within and outside their businesses to compete more effectively and make better... All BI and Analytics White Papers
- Quantifying the Business Value of VMware View - Webcast
- Many enterprises have discovered that the use of virtualization to support desktop workloads creates a range of significant benefits. These benefits include price...
- Good to Great - How to Take Business Analytics to the Next Level
- By attending this webcast you will learn how you can implement an effective BA strategy that will deliver maximum strategic value to your...
- Supporting Mobile Productivity With A Limited IT Budget
- Join us and hear from Kaseya mobile IT management experts as we discuss core strategies for supporting the mobile revolution on a shoestring...
- User Experience Monitoring
- In this webinar, you will learn hints & tips for improving end-user response times from Forrester Research analyst, Jean-Pierre Garbani.
- Hints & Tips Cisco
- Overwhelmed by tracking your Vblock, Flexpod or Cisco UCS performance? Spend one hour with Nimsoft to learn how you can eliminate the overhead... All BI and Analytics Webcasts