Skip the navigation
)
News

Microsoft researchers: NoSQL needs standardization

By Joab Jackson
April 5, 2011 04:49 PM ET

IDG News Service - The ever-growing number of non-relational, or NoSQL, databases needs standardization in order to thrive, two Microsoft researchers argue in the new issue of the Association for Computing Machinery's flagship publication, Communications.

"The nascent NoSQL market is extremely fragmented, with many competing vendors and technologies. Programming, deploying, and managing NoSQL solutions requires specialized and low-level knowledge that does not easily carry over from one vendor's product to another," the two researchers, Erik Meijer and Gavin Bierman, write in a paper published in the April issue of Communications.

The pair of researchers offer a mathematical data model and standardized query language that could be used to unify NoSQL and SQL data models, work they call "coSQL."

"There is little to disagree with in this paper," said James Phillips, a co-founder and vice president of products for NoSQL database vendor Couchbase, who had no involvement in the work. "I firmly support the conclusion that a standardized data manipulation language would accelerate market adoption of NoSQL database technologies by eliminating developer-impacting fragmentation."

Over the past few years, a variety of non-relational databases has emerged, including CouchDB, Cassandra and MongoDB. Administrators have found these new data stores more suitable than relational databases for tasks such as storing large amounts of data across multiple servers, or for easily storing information that does not need to be indexed for complex querying.

Meijer and Bierman compare this current flourish of non-relational databases to the proliferation of relational databases in the early 1970s. At that time, developers would have to understand the peculiarities of each database, as well as how to interact with the underlying hardware. What unified this industry was the widespread adoption of SQL (Structured English Query Language), the researchers argue.

SQL was an implementation of Edgar F. Codd's relational model, which provided an algebraic basis for modeling databases. The mathematical model assured that all SQL databases would return the same results to the same queries, given the same data. And because most of the database vendors such as IBM adopted the model, programmers could just learn SQL, rather than a new language for each database.

Meijer and Bierman claim that NoSQL could benefit from the same standardization. "Just as Codd's discovery of relational algebra as a formal basis for SQL ... propelled a billion-dollar industry around SQL, we believe that our categorical data-model formalization and monadic query language will allow the same economic growth to occur for coSQL key-value stores," they write.

The researchers also cast doubt on the widely held assumption that NoSQL databases are uniquely suited to tasks of storing large amounts of data, or Big Data as it is known. "It is possible to scale SQL databases by careful partitioning," they write.

"Despite common wisdom, SQL and coSQL are not diabolically opposed, but instead deeply connected via beautiful mathematical theory," they write.

Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com

Reprinted with permission from IDG.net. Story copyright 2012 International Data Group. All rights reserved.
What is Tech Briefcase?
TechBriefcase is a new, free service where IT Professionals can Search, Store and Share IT white papers and content like this. Learn more
Bookmark content
Speed up your research efforts with content across the web.
Search and Store
Find the white papers you need. Create folders for any topic.
View Anywhere
Open your briefcase on your iPhone, tablet or desktop. Share with colleagues.
Don't have an account yet?
Additional Resources
Security KnowledgeVault
WHITE PAPER
Security is not an option. This KnowledgeVault Series offers professional advice how to be proactive in the fight against cybercrimes and multi-layered security threats; how to adopt a holistic approach to protecting and managing data; and how to hire a qualified security assessor. Make security your Number 1 priority.

Read now.

Cut Communications Costs Once and for All
WHITE PAPER
New IP-based communications systems are being deployed by small and midsized businesses at a rapid rate. Learn how these organizations are enabling faster responsiveness, creating better customer experiences, speeding office or mobile interactions, and dramatically reducing existing communications costs.

Read now.

Databases White Papers
Measuring the Business Value of CI in the Data Center
One of the key strategies that IT teams are pursuing to reduce capital costs while boosting asset utilization and employee productivity is the...
The Different Types of UPS Systems
There is much confusion in the marketplace about the different types of UPS systems and their characteristics. Each of these UPS types is...
SAS High Performance Analytics
This paper explains how you can shrink decision times from days to seconds to quickly respond to changing business conditions.
Drive Your Business with Predictive Analytics
Predictive analytics has the power to significantly improve the bottom line. From better targeting and risk assessment to streamlining operations and optimizing business...
The Analytical SMB: More Data, More Users, Less Time
This Aberdeen Research Brief examines the key trends in business analytics and the tangible business impact effective analytics can have for SMBs.
All Databases White Papers
Databases Webcasts
Oracle Database Appliance Best Practices
Business users increasingly demand 24x7 availability of their data while IT departments face the challenge of ensuring maximum availability while operating with limited...
Accelerate Document Processing and Wow Your Customers
Learn how intelligent imaging and BPM solutions, coupled with pragmatic best practices and methodology, can improve productivity, lower cost, increase accuracy, reduce cycle...
Distributed Database Security with Real-time Monitoring
View this demo and learn how IBM InfoSphere Guardium database activity monitoring can help protect your sensitive data in distributed DBMS environments with...
InfoSphere Warehouse Packs Demo
These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
Delivery Management -- Extending Lifecycle Management
Date: Wednesday, June 20, 2012, 1:00 PM EDT

Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,...
All Databases Webcasts
Newsletter Sign-Up

Receive the latest news test, reviews and trends on your favorite technology topics

Choose a newsletter
  1. View all newsletters | Privacy Policy
IT Jobs