Microsoft climbs onto Hadoop bandwagon
Joint Microsoft-Hortonworks effort aims to deliver a Hadoop distribution for Windows Server and Windows Azure
Computerworld - Microsoft is the latest of the world's top IT vendors to climb aboard the Hadoop 'big data' bandwagon.
The company Wednesday announced it will collaborate with Yahoo spin-off Hortonworks to develop a Apache Hadoop implementation for its Windows Server and Windows Azure platforms.
Under the strategic partnership, Hortonworks will lend its domain expertise to help Microsoft integrate Hadoop into its Windows technology.
Microsoft said it expects to have a preview of a Hadoop-based service for Windows Azure by the end of this year, and one for Windows Server sometime in 2012. The Windows Server Hadoop implementation will work with existing Microsoft BI tools, Microsoft said in a statement.
Microsoft made the announcement at the PASS Summit, a SQL Server user conference held in Seattle.
The move will help Microsoft customers better manage their 'big data' requirements, said Microsoft corporate vice president Ted Kummert in a statement. "The next frontier is all about uniting the power of the cloud with the power of data to gain insights that simply weren't possible even just a few years ago."
Microsoft's move comes barely a week after Oracle unveiled a Hadoop-based big data appliance, along with a new Oracle NoSQL database and an open source distribution of the R programming language for statistical analysis.
Like Microsoft, Oracle said its Hadoop offering aims to tap a growing enterprise interest in big data analytics.
Just yesterday, IBM announced plans to Platform Computing a Toronto-based maker of software for managing the large computing clusters on which Hadoop typically runs.
Hadoop, an open-source software framework that supports big data applications, is increasingly attracting the attention of top IT executives for its ability to handle massive volumes of unstructured data like email content, weblogs, clickstream data, audio and video files, and sensor data.
A growing number of companies are looking to collect and analyze such unstructured data to glean new business insights. But to date they have been somewhat hampered in the because of the inherent scalability limitations of conventional relational database management products that are designed mostly to handle structured, relational data.
Early adopters of Hadoop such as Yahoo, AOL, Google and others have been using Hadoop to store and analyze petabytes of unstructured data. Other enterprise data warehouse technologies have not been able to easily handle such tasks.
Gartner analyst Merv Adrian said Microsoft's alliance with Hortonworks is not surprising.
"Every leading database vendor needs to ensure that customers who want to exploit Big Data don't move any more of their share of wallet away than necessary," he said, "[The] only question was whether they would go it alone or align with someone."


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Thinking Outside The Data Warehouse
- This high level, business problem focused eBook uses 5 customer scenarios to show how people and organizations are tackling real issues using IBM...
- Using BD for Smarter Decision Making
- This paper looks at new developments in business analytics and discusses the benefits analyzing big data bring to the business.
- Measuring the Business Value of CI in the Data Center
- One of the key strategies that IT teams are pursuing to reduce capital costs while boosting asset utilization and employee productivity is the...
- Switching Schedulers - Not As Complicated As You Think
- Changing or consolidating job schedulers may seem daunting. However, the benefits of switching to enterprise workload automation outweigh the risks. Read how BMC...
- Capture-Enabled Business Process Management
- Organizations today must deal with a vast amount of incoming information from many different sources. Efficient, automated business processes are critical to managing... All BI and Analytics White Papers
- InfoSphere Warehouse Packs Demo
- These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
- Delivery Management -- Extending Lifecycle Management
- Date: Wednesday, June 20, 2012, 1:00 PM EDT
Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,... - Leverage automation today to reduce IT complexity
- Date: Tuesday, June 5, 2012, 2:00 PM EDT
Whether your B2B complexity is caused by multiple technologies due to M&A, business or application specific... - BMC Control-M - Single Point of Control Demo
- With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's...
- BMC Control-M - Single Point of Control Demo
- With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's... All BI and Analytics Webcasts