Splunk woos Hadoop users
Unveils Hunk tool set that promises to help businesses interact more easily with data in Hadoop
Computerworld - The need by corporate IT operations to enable easier interaction with massive -- and fast growing -- data sets in Hadoop environments is driving a flurry of vendor activity.
For one, Splunk last week rolled out a beta version of an analytics tool that it claims can be used to access, search, analyze and use data in Hadoop environments more efficiently than current technologies like MapReduce, Pig and Hive.
The product, called Hunk, lets companies gain insight into Hadoop data assets without the need for custom development, data migration batch processing and data modeling, said Clint Sharp, principal product manager for big data at Splunk.
The Hunk tool set lets enterprises explore, query and analyze Hadoop data where it resides, Sharp said.
The technology supports ad hoc querying of Hadoop data and enables users to analyze and correlate petabytes of structured and unstructured data in a distributed Hadoop environment, he added.
Business can use Hunk to build graphs, visualize data and create custom dashboards in Hadoop. It also allows them to more easily share insights, gathered from Hadoop data, with others in the enterprise, the company says.
Hunk is the company's first major foray into the Hadoop business beyond a connector product for sharing data between Splunk and Hadoop and another one for monitoring the health of a Hadoop environment.
Hunk taps into growing enterprise interest in Hadoop technologies, and the need for easier to use products than are available today, Sharp said.
"It's not that hard getting data into Hadoop. But getting value from the data is incredibly hard," he said.
The open source Hadoop software, distributed by the Apache Software Foundation, and some of the technologies that have grown up around it are mostly optimized for batch processing tasks and do not allow the sort of interactive, ad hoc querying of data that companies are increasingly looking for, Sharp said. "Our goal is to give you a user interface for Hadoop that is easy to use," and allows such interaction, he said.
Splunk has done a good job so far of helping companies tap machine log data for useful information, said Merv Adrian, an analyst with Gartner Inc. "With Hunk, they are taking what they learned with machine data and moving it over to more general purpose data in Hadoop," he said
Hunk is one of a small but emerging set of tools that enable direct interactive analytics against the Hadoop Distributed File System, Adrian said. "It is part of a new wave" of products, along with Cloudera's Impala, EMC Greenplum's Pivotal HD and the open source Apache Drill project. "The first wave was brute force batch processing of files in Hadoop."
Hunk comes at a time when enterprise interest in Hadoop appears to be gradually picking up.
A Gartner survey of 687 large enterprises earlier this year showed that 30% had had invested in big data technologies over the past year and another 19% said they planned to this year, Adrian said.
The numbers are modestly higher than last year, when Gartner found that 27% of companies said they had invested in big data products and 15% planned to, Adrian said.
"Adoption has increased steadily but not dramatically," he said. But new capabilities such as those introduced by Splunk and other vendors could drive faster growth, he said.
Jaikumar Vijayan covers data security and privacy issues, financial services security and e-voting for Computerworld. Follow Jaikumar on Twitter at @jaivijayan, or subscribe to Jaikumar's RSS feed . His email address is firstname.lastname@example.org.
BI and analytics
- Brewer taps Bud Lab at University of Illinois
- Splunk woos Hadoop users
- RSA brings big data analytics to security threat management
- Moving beyond Hadoop for big data needs
- Q&A: What's needed to get a big data job?
- SAS extends analytics support for unstructured data
- Time has come for chief analytics officers
- Big data brings big academic opportunities
- Finding the business value in big data is a big problem
- IT-centric enterprise BI models unsustainable, says Forrester
Read more about Big Data in Computerworld's Big Data Topic Center.
- 15 Non-Certified IT Skills Growing in Demand
- How 19 Tech Titans Target Healthcare
- Twitter Suffering From Growing Pains (and Facebook Comparisons)
- Agile Comes to Data Integration
- Slideshow: 7 security mistakes people make with their mobile device
- iOS vs. Android: Which is more secure?
- 11 sure signs you've been hacked
- Smarter Environmental Analytics Solutions: Offshore Oil and Gas Installations Example This IBM Redbooks® Solution Guide describes a solution for implementing smarter environmental monitoring and analytics for oil and gas industries. The solution implements...
- Who's afraid of the big (data) bad wolf? Survive the big data storm by getting ahead of integration and governance functional requirements This paper provides a detailed review of the best practices clients should consider before embarking on their big data integration projects.
- Understanding big data so you can act with confidence Automating information integration and governance and employing it at the point of data creation helps organizations boost confidence in their big data.
- Integrating and Governing Big Data The end-to-end information integration capabilities of IBM® InfoSphere® Information Server are designed to help organizations understand, cleanse, monitor, transform and deliver data-as well...
- Meg Whitman presents Unlocking IT with Big Data During this Web Event you will hear Meg Whitman, President and CEO, HP discuss HAVEn - the #1 Big Data platform, as well...
- Webinar: Building a Big Data solution that's production-ready Big data solutions are no longer just a nice-to-have. All Big Data White Papers | Webcasts