Hortonworks brings Hadoop to Windows
Hortonworks expects its Windows version of Hadoop will feature full feature parity with the Linux version
IDG News Service - Hortonworks is bringing the popular open-source Apache Hadoop data processing platform to Microsoft shops.
The company has released a beta version of its Hortonworks Data Platform (HDP) Hadoop distribution for Windows and expects to release the final, enterprise-ready version in the months to come.
HDP is "the first and only distribution of Hadoop available on both Linux and Windows," said David McJannet, Hortonworks vice president of marketing.
According to McJannet, Hortonworks heard a lot of demand from potential customers for a Hadoop distribution that would run on the Microsoft platform.
"The real catalyst is, frankly, market demand. The significant majority of the servers running in the enterprise today are running Windows Server," McJannet said. "We've seen significant interest from our customers towards using Hadoop on the platform that they rely on for their critical applications."
Hortonworks and Microsoft have been porting the software to Windows over the past 18 months, as well as testing the software for enterprise use, McJannet said. The HDP distribution consists of a set of different software programs -- including HDFS, MapReduce, Hive, Pig and others. Like the Linux version, the Windows HDP will be available as open source "so others can benefit and extend the work that we have done," McJannet said.
Going forward, Hortonworks will release new versions of the HDP in both Linux and Windows. This first Windows beta version is based on the HDP 1.1 codebase.
Initially, the Windows beta does not have feature parity with the Linux version, though it does have all the "core components" to run Hadoop, McJannet said. But it does not include the Ambari set of management tools. Over time, however, Hortonworks does plan to duplicate all the features on the Windows version.
Hortonworks expects that the kind of workloads run on the Windows platform will be similar to those run on Linux, in terms of size and scope. "We fully anticipate some of the largest deployments of Hadoop could well be on Windows," McJannet said.
The distribution does not support running a mixture of Windows nodes and Linux nodes in the same deployment. Deployments should be all in one OS or another. "In practice, we'd expect homogeneity across the infrastructure, though we'd have to wait and see how that pattern emerges," McJannet said.
Over time, Microsoft will provide more support in other software products, most notably System Center, for organizations that want to move Windows Hadoop workloads in between their own data centers and a Microsoft Azure cloud service, said Herain Oberoi, Microsoft director of product marketing in the company's server and tools division.
As of press time, Hortonworks hasn't finalized the versions of Windows Servers upon which HDP will run, though the beta will run on Windows Server 2008 and Windows Server 2012. The product will not run on Windows desktop versions.
- Top 3 Myths about Big Data Security : Debunking common misconceptions about big data security Big data represents massive business possibilities and competitive advantage for organizations that are able to harness and use that information. But how are...
- Magic Quadrant for Data Masking Technology IBM is a leader in Gartner Inc's Magic Quadrant for Data Masking Technology. Read the full report to learn about IBM.
- Best Practices for Securing Hadoop Historically, Apache Hadoop has provided limited security capabilities. To protect sensitive data being stored and analyzed in Hadoop, security architects should use a...
- Top Tips for Securing Big Data Environments: Why Big Data Doesn't Have to Mean Big Security Challenges Organizations must come to terms with the security challenges they introduce. As big data environments ingest more data, organizations will face significant risks...
- Live Webcast Charting Your Analytical Future - "Making predictive analytics part of your business processes" Webinar This session will show how predictive analytics can be used throughout the organization by anyone looking for answers and how organizations can make...
- Charting Your Analytical Future - "Making predictive analytics part of your business processes" Webinar This session will show how predictive analytics can be used throughout the organization by anyone looking for answers and how organizations can make...
- Improved Data-centric Application Development and Hadoop Operations with BMC and Hortonworks Join this webinar to hear from BMC and Hortonworks how their combined solutions help customers unlock the value of Big Data by implementing... All Big Data White Papers | Webcasts
Our new bimonthly Internet of Things newsletter helps you keep pace with the rapidly evolving technologies, trends and developments related to the IoT. Subscribe now and stay up to date!