Hadoop finds niche alongside conventional database systems
Computerworld - The growing need for companies to manage surging volumes of structured and unstructured data is continuing to propel enterprise use of open-source Apache Hadoop software.
But instead of replacing existing technologies, Hadoop appears to be working alongside conventional relational database management systems (RDBMS), according to a Ventana Research report released late last month.
Hadoop is designed to help companies manage and process petabytes of data. The technology's appeal lies in its ability to break up very large data sets into smaller data blocks that are then distributed across a cluster of commodity hardware for faster processing.
Early adopters, including Facebook, Amazon, eBay and Yahoo, use Hadoop to analyze petabytes of unstructured data that conventional RDBMS setups couldn't handle easily. Ventana's report, based on a survey of more than 160 companies, shows that a growing number of businesses have begun putting Hadoop to use for similar purposes.
The survey found that most of those companies are using Hadoop to collect and analyze huge volumes of unstructured and machine-generated information, such as log and event data, search-engine results and content from social media sites, said David Menninger, author of the Ventana report.
"In two-thirds of the cases, we found that people are using Hadoop for advanced analytics and for types of analysis that they were not doing before," he said.
The technology is much less likely to be used for analyzing conventional structured data such as transaction data, customer information and call records, where traditional RDBMS tools still appear to have an edge, Menninger said.
Despite Hadoop's early promise, the study said, enterprises that use it still face challenges related to issues such as security, clustering and a shortage of people with Hadoop skills.
This version of this story was originally published in Computerworld's print edition. It was adapted from an article that appeared earlier on Computerworld.com.
Read more about Applications in Computerworld's Applications Topic Center.


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Establishing a Strategy for Database Security is No Longer Optional
- The options for securing increasingly valuable databases are very broad and deep, and can be confusing. This research provides an overview of three...
- Driving Secure Enterprise File Sharing and Syncing in the Enterprise
- GroupLogic's new activEcho is the industry's only secure Enterprise File Sharing and Synching solution that balances the need for simplicity for the end...
- The Enterprise File Sharing Option
- Enterprises and IT departments need to address several critical security issues when considering file sharing and syncing products. Many of today's solutions do...
- Activities Streams Base An Integrated Social Layer
- The enterprise social software market is exploding thanks to converging trends of consumerization, cloud, and mobile. In this must-read report, "The Forrester Wave:...
- Converged Infrastructure for Dummies
- As you know, everything is mobile, connected, interactive, and immediate. This is exactly why organizations need a highly agile IT infrastructure in order... All Applications White Papers
- Delivery Management -- Extending Lifecycle Management
- Date: Wednesday, June 20, 2012, 1:00 PM EDT
Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,... - Leverage automation today to reduce IT complexity
- Date: Tuesday, June 5, 2012, 2:00 PM EDT
Whether your B2B complexity is caused by multiple technologies due to M&A, business or application specific... - BMC Control-M - Single Point of Control Demo
- With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's...
- Operational Analytics - Changing the Competitive Dynamics of the Business
- Date/Time: June 5, 2012, 11:00 a.m., EDT, 4:00 p.m. BST / 3:00 p.m. UTC
Please join us for this webcast, as Dr. Barry... - Oracle Database Appliance Best Practices
- Business users increasingly demand 24x7 availability of their data while IT departments face the challenge of ensuring maximum availability while operating with limited... All Applications Webcasts