Hortonworks brings Hadoop to Windows
Hortonworks expects its Windows version of Hadoop will feature full feature parity with the Linux version
IDG News Service - Hortonworks is bringing the popular open-source Apache Hadoop data processing platform to Microsoft shops.
The company has released a beta version of its Hortonworks Data Platform (HDP) Hadoop distribution for Windows and expects to release the final, enterprise-ready version in the months to come.
HDP is "the first and only distribution of Hadoop available on both Linux and Windows," said David McJannet, Hortonworks vice president of marketing.
According to McJannet, Hortonworks heard a lot of demand from potential customers for a Hadoop distribution that would run on the Microsoft platform.
"The real catalyst is, frankly, market demand. The significant majority of the servers running in the enterprise today are running Windows Server," McJannet said. "We've seen significant interest from our customers towards using Hadoop on the platform that they rely on for their critical applications."
Hortonworks and Microsoft have been porting the software to Windows over the past 18 months, as well as testing the software for enterprise use, McJannet said. The HDP distribution consists of a set of different software programs -- including HDFS, MapReduce, Hive, Pig and others. Like the Linux version, the Windows HDP will be available as open source "so others can benefit and extend the work that we have done," McJannet said.
Going forward, Hortonworks will release new versions of the HDP in both Linux and Windows. This first Windows beta version is based on the HDP 1.1 codebase.
Initially, the Windows beta does not have feature parity with the Linux version, though it does have all the "core components" to run Hadoop, McJannet said. But it does not include the Ambari set of management tools. Over time, however, Hortonworks does plan to duplicate all the features on the Windows version.
Hortonworks expects that the kind of workloads run on the Windows platform will be similar to those run on Linux, in terms of size and scope. "We fully anticipate some of the largest deployments of Hadoop could well be on Windows," McJannet said.
The distribution does not support running a mixture of Windows nodes and Linux nodes in the same deployment. Deployments should be all in one OS or another. "In practice, we'd expect homogeneity across the infrastructure, though we'd have to wait and see how that pattern emerges," McJannet said.
Over time, Microsoft will provide more support in other software products, most notably System Center, for organizations that want to move Windows Hadoop workloads in between their own data centers and a Microsoft Azure cloud service, said Herain Oberoi, Microsoft director of product marketing in the company's server and tools division.
As of press time, Hortonworks hasn't finalized the versions of Windows Servers upon which HDP will run, though the beta will run on Windows Server 2008 and Windows Server 2012. The product will not run on Windows desktop versions.
Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com
- 10 Hot Big Data Startups to Watch
- 11 Unique Uses for Google Glass, Demonstrated by Celebs
- How to Export Your Google Reader Account
- How to Better Engage Millennials (and Why They Aren't Really so Different)
- Telltale signs of ATM skimming
- 20 security and privacy apps for Androids and iPhones
- Big screen con artists: 7 great movies about social engineering
- IT Certification Study Tips
- Register for this Computerworld Insider Study Tip guide and gain access to hundreds of premium content articles, cheat sheets, product reviews and more.
- Big Data, Big Demands While the concept of big data is nothing new, the tools and technology are in place for companies to take full advantage. Enterprises...
- Big Data Transforms Business eBook Businesses that exploit Big Data to improve their strategy and execution can distance themselves from competitors by using new insights from data that...
- Harness IT -- An Introduction to Business Intelligence Solutions Learn the key selection criteria required to provide your organization with the capability to address structured data, unstructured data and mobile demands so...
- Business Intelligence Shows its Smarts Today's Business Intelligence (BI) tools provide a new way to think about data with self-service capabilities and user-friendly analytics that can be used...
- Content Analytics: Big Data Conquered, Customer Service Elevated For organizations looking to start a content analytics program or improve their existing capabilities, Aberdeen Group and IBM will lay out several recommendations...
- Virtustream (Vayence) video taking a 3000-Seat SAP Environment to the Cloud How can public cloud services help your organization reduce costs and increase security for your mission All Big Data White Papers | Webcasts
Get started with this popular programming language for data visualization and analysis. Read more....