Ads by TechWords

See your link here
Receive the latest technology news and information.
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
Cloud Computing
View all newsletters




Privacy Policy
 

Relational database pioneer says technology is obsolete

Michael Stonebraker blogs not to praise RDBMSes but to bury them

September 6, 2007 12:00 PM ET

Computerworld - As a researcher at the University of California, Berkeley, in the early 1970s, Michael Stonebraker co-created the Ingres and Postgres technology that underlies many leading relational databases today: Microsoft Corp.'s SQL Server, Sybase Inc.'s Adaptive Server Enterprise, Ingres Corp.'s eponymous product, IBM's Informix, and others.

But Stonebraker now argues that relational databases, also known as RDBMSes, are "long in the tooth" and "should be considered legacy technology."

In an entry Tuesday at a new blog, The Database Column, Stonebraker also argued that today's relational databases lag badly in performance behind a new wave of databases that flip database tables 90 degrees.

Column-oriented databases -- such as the one built by Stonebraker's latest start-up, Andover, Mass.-based Vertica Systems Inc. -- store data vertically in table columns rather than in successive rows.

By putting similar data together, column-oriented databases minimize the time to read the disk, which can add up when executing large-scale calculations such as those typically done in a data warehouse.

Column databases "will take over the warehouse market over time, completely displacing row stores," Stonebraker wrote. "Since many warehouse users are in considerable pain (can't load in the available load window, can't support ad-hoc queries, can't get better performance without a "fork-lift" upgrade), I expect this transition to column stores will occur fairly quickly."

Column-oriented database systems are not new. Sybase has successfully sold its column-based IQ database for years as a high-performance business intelligence solution.

More recently BigTable, the database that Google Inc. built to handle a number of its applications, stores data in columns.

But they remain a niche offering. In comparison, the leading players in the mainstream database market, which is estimated at $15 billion annually worldwide, all rely on systems using row-based tables.

Organizing data by rows does have its advantages. Writing data to disk in row format is faster than doing so by columns. That is key for high-transaction database applications where data is constantly being read and written to the database, though markedly less important for data warehouses, where data is typically written just once and accessed many times after that.

Stonebraker, who is a co-founder and chief technology officer of Vertica, claims that his latest start-up has other performance-boosting features, such as very aggressive data compression and a query executor that "runs against compressed data."

As a result, "Vertica beats all row stores on the planet -- typically by a factor of 50," he wrote. "The only engines that come closer are other column stores, which Vertica typically beats by around a factor of 10."

Stonebraker says other firms similar to Vertica can do just as well.



Jump to comments

michael stonebraker

Additional Resources

EFD vs. HDD - What You Need to Know
WHITE PAPER
Enterprise flash drives provide a new Tier 0 storage layer capable of delivering high I/O performance at a very low latency. Proper use of EFDs in an Oracle environment can deliver increased performance compared to fibre channel drives. Read the recommendations for identification of the best DB components for EFDs.
Gartner Research Report: Magic Quadrant for Application Delivery Controllers, 2009
WHITE PAPER
The market for products to improve the delivery of application software over networks remains dynamic and innovative. Vendors focused on solving enterprises' most-pressing application problems have become the top players.
Eight Criteria for Server Load Balancing
WHITE PAPER
Server load balancers are a simple yet highly effective means to scale an application environment while ensuring its availability. Today's solutions should also address application performance and security. Read about the top eight criteria you should consider when choosing a server load balancer and how Citrix NetScaler meets those requirements.

What People Are Saying

IT Jobs

 

SAS Information Management Kit

SAS is the leader in business intelligence and analytical software and services. Only SAS offers leading data integration, storage, analytics and business intelligence applications within a comprehensive enterprise intelligence platform. SAS gives 97 of the top 100 companies in the 2007 Fortune 500 THE POWER TO KNOW®.

Webcast: The Information Management Roadmap
Imagine high-quality data, cleansed, analyzed and delivered throughout your organization. Join Computerworld, IT visionary Thornton May and a panel of experts to learn how SAS® can help you make it happen.

View this webcast 
Research Report: Information Management Initiatives at Midsize and Large Organizations
See the top-line results of this Computerworld sponsored survey to see how IT and business leaders are handling information management implementation.

Download this report 
White Paper: Information Management: Better Information for Winning Decisions.
This white paper explains how the SAS Information Evolution Model aids companies in assessing how they use this information to make strategic decisions and drive business.

Download this white paper