Intel releases Hadoop software primed for its own chips
Hadoop software makes the most of capabilities in Intel Xeon processors, vendor says
IDG News Service - Intel has released its own Hadoop distribution in a move intended to accelerate adoption of the big data platform while ensuring more of those workloads run on Intel's own Xeon processors.
The Intel Distribution for Apache Hadoop includes core pieces of the data analysis platform that Intel is releasing as open-source software, as well as deployment and tuning tools that Intel developed itself and which are not open source.
Organizations will be more willing to expand their investments in Hadoop if they know there's a consistent distribution backed by a big, stable vendor like Intel, said Boyd Davis, general manager of Intel's data center software division, at a launch event in San Francisco Tuesday.
Intel has been upping its investments in software for several years, to help ensure its processors are widely used beyond their traditional stronghold in client/server computing. It said it has worked with customers over the past few years to develop its Hadoop distribution, and that this is actually its third release of the software.
Still, it's a significant announcement that moves Intel deeper into the software industry. Like many other open-source providers, Intel will now sell support and maintenance services for its distribution, Boyd said.
Hadoop includes a dozen or so open-source projects that work together to make it easier for users to store, manage and analyze large amounts of data. It's become the go-to software platform for companies mining Web logs, transaction histories and other data in search of added value.
Intel's distribution includes versions of the Hadoop Distributed File System, the Hadoop Processing Framework, Hive and Hbase. Intel has tweaked those programs to take advantage of capabilities in its own Xeon chips, such as its processor instructions for accelerating AES encryption.
"By incorporating silicon-based encryption support of the Hadoop Distributed File System, organizations can now more securely analyze their data sets without compromising performance," it said in a news release.
But Intel says the core components of its distribution remain open and compatible with other implementations of Hadoop. If customers choose Intel's distribution, "they're not getting locked into a technology," Boyd said.
At the same time, Intel has developed some of its own tools that will not be released as open source. They include a deployment and configuration tool called Intel Manager for Apache Hadoop, and a tool for tuning cluster performance, called Active Tuner for Apache Hadoop.
Customers who run Intel's Hadoop distribution on servers loaded with Intel hardware, including its processors, solid-state drives and 10 Gigabit Ethernet cards, will see a 40% performance boost over users who don't go with an all-Intel platform, according to Boyd.
- The Benefits of IBM: The Savings of Open Source Download Now
- Path Selection Infographic Path Selection Infographic
- Hyperconvergence Infographic A wide range of observers agree that data centers are now entering an era of "hyperconvergence" that will raise network traffic levels faster...
- Preparing Your Infrastructure for the Hyperconvergence Era From cloud computing and virtualization to mobility and unified communications, an array of innovative technologies is transforming today's data centers.
- LIVE EVENT: 5/7, The End of Data Protection As We Know It. Introducing a Next Generation Data Protection Architecture. Traditional backup is going away, but where does this leave end-users?
- On-demand webinar: "Mobility Mayhem: Balancing BYOD with Enterprise Security" Check out this on-demand webinar to hear Sophos senior security expert John Shier deep dive into how BYOD impacts your enterprise security strategy... All Open Source White Papers | Webcasts