Oracle's Hadoop move could push rivals toward big data bundles
Analysts say Oracle's delivery of a Hadoop appliance could force IBM, HP and SAP to follow suit.
Computerworld - The shipping of Oracle's Big Data Appliance earlier this month could pressure major rivals like IBM, Hewlett-Packard and SAP to come up with Hadoop offerings that tightly bundle hardware, software and other tools, analysts say.
On the day the appliance shipped, Oracle announced that its new offering will run Cloudera's Apache Hadoop implementation and management software.
The tightly bundled Big Data Appliance includes Sun server hardware -- 18 Linux-based x86 Sun servers with 216 processor cores, 864GB of working memory and 648TB of raw disk storage -- along with an Oracle NoSQL database, an open-source distribution of R statistical software and a copy of Oracle's Java HotSpot Virtual Machine.
The appliance provides 40Gbps InfiniBand connectivity among the nodes, a rarity in Hadoop deployments, many of which use Ethernet to connect the nodes. Multiple racks can be tethered together in a cluster configuration, noted George Lumpkin, vice president of data warehousing product management at Oracle.
Lumpkin added that the company this month also started shipping Big Data Connector drivers for exchanging data between the Big Data Appliance and several other Oracle products, including Oracle Database 11g, the Exadata data warehousing and online transaction processing appliance and the new Exalytics business intelligence appliance.
Oracle unveiled the Big Data Appliance, along with the NoSQL database, at its OpenWorld user conference last fall.
David Menninger, an analyst with Ventana Research, said the offering will likely prompt IT managers to look more closely at bundled systems that could be "one-stop shops" for corporate data needs.
The Oracle system is designed to manage and analyze data sets, such as telemetry data, click-stream data or other log data, that are too large or are otherwise unsuitable for keeping in databases, Menninger said.
James Kobielus, an analyst at Forrester Research, said if IT managers do in fact take to the bundled Hadoop system, the other top tech vendors, many of which have already spent considerable sums on big data research and development, could be pushed to follow suit. "[Multiple bundled systems would be] good for the market. It gives customers a choice of commercially available products," he added.
Most enterprise Hadoop deployments have so far been custom built by internal IT engineers -- companies had to buy the hardware, license the software and integrate it on their own, Kobielus noted.
Kobielus said the growing popularity of the open-source Apache Hadoop framework should in time help to further increase the number of packaged big data options available to IT managers.
Menninger said Oracle's unexpected decision to use the Cloudera implementation of Hadoop could prove beneficial to corporate users -- and Oracle. When Oracle first announced the appliance, it was expected to run an Oracle distribution of Hadoop, he said.
The selection of an established implementation means that the vendor won't have to "fight to establish another distribution," Menninger added. "If Oracle had created its own distribution, it would have given the competitors a potential weakness to exploit."
Jackson is a reporter for the IDG News Service. Chris Kanaracus of the IDG News Service contributed to this story.
This version of this story was originally published in Computerworld's print edition. It was adapted from an article that appeared earlier on Computerworld.com.
Read more about Data Center in Computerworld's Data Center Topic Center.
- Hadoop for Dummies Today, organizations in every industry are being showered with imposing quantities of new information. Along with traditional sources, many more data channels and...
- The Top Five Ways to Get Started with Big Data Despite the increased focus on big data over the past few years, most organizations are still talking about what big data is rather...
- Data Warehouse Augmentation: The Queryable Data Store While organizations have, to date, been busy exploring and experimenting, they are now beginning to focus on using big data technologies to solve...
- The IBM Big Data Platform IBM is unique in having developed an enterprise class big data platform that allows you to address the full spectrum of big data...
- Live Webcast Best Practices: How to Improve Business Continuity with Virtualization VMware solutions include a range of business continuity capabilities to help ensure availability for applications across your virtualized environment. Learn More>>
- Cloud Knowledge Vault Learn how your organization can benefit from the scalability, flexibility, and performance that the cloud offers through the short videos and other resources...
- Endpoint Data Management: Protecting the Perimeter of the Internet of Things Not surprisingly, "Internet of Things" (IoT) and Big Data present new challenges AND opportunities for enterprise IT. Teams need to harness, secure and... All Data Center White Papers | Webcasts