Hortonworks adds a Hadoop sandbox
Hadoop newbies can learn how to use the data processing platform on an easy-to-run virtual machine
IDG News Service - Those who want to try the much-hyped Hadoop but haven't got a cluster or two to spare can now test the data processing platform on their desktops, thanks to a new release from Hadoop distributor Hortonworks.
Hortonworks Sandbox is a single-node implementation of Hadoop, one based on the Hortonworks Data Platform. Packaged in a virtual machine, it includes all the typical components found in a Hadoop deployment, including the HCatalog storage management subsystem, the Hive data warehouse and the Pig set of data analysis tools.
The package also offers a number of tutorials that show users how to execute Hadoop data analysis tasks, according to Cheryle Custer, who is the Hortonworks director of services marketing. The package includes three tutorials, and more will be made available to download in the months to come. The package also includes videos and even online datasets that can be used to test features.
While widely used, Hadoop can present a challenge for new users to learn, at least for data scientists and anyone who isn't a system administrator. The software requires a considerable amount of work to set up and run. In addition to installing the software and a Java Virtual Machine (JVM) if one is not already on the system, the user must also install a file system, and the software itself requires a user account, which could pose a security risk.
The Hortonworks Sandbox eliminates all that installation work, requiring only that the user download and run a virtual machine. The virtual machine package, which is built on the CentOS Linux distribution, will run on either VMware and Oracle Virtual Box environments.
In addition to building a Hadoop sandbox, Hortonworks engineers have also been busy working on the company's flagship enterprise Hadoop distribution. The Hortonworks Data Platform version 1.2, released last week, offers new management and security tools.
Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com
- 12 iPhones Apps That Will Make You a Networking Star
- 10 Careers Robots Are Taking From You
- Big Data Gold Isn't Always Where You Would Expect It
- 6 Tips to Build Your Social Media Strategy
- A walking tour: 33 questions to ask about your company's security
- 15 social media scams
- The 7 elements of a successful security awareness program
- IT Certification Study Tips
- Register for this Computerworld Insider Study Tip guide and gain access to hundreds of premium content articles, cheat sheets, product reviews and more.
- The Practitioner's Guide to Data Profiling This paper considers the techniques used by data profiling tools, including reverse engineering, assessment for potential anomalies and validation of metadata and data...
- Data Visualization Techniques: From Basics to Big Data with SAS Visual Analytics This paper discusses some of the basic issues concerning data visualization, from data size and column composition, to solving unique challenges presented by...
- How to Effectively Realize Data Visualization Data visualization enables decision makers to understand what data really means. SAS Visual Analytics is a high-performance, in-memory solution for exploring massive amounts...
- Practical Fundamentals for Master Data Management Discover the early benefits that can be achieved by concentrating on simplifying and standardizing semantics, managing metadata and improving data quality as first...
- Live Webcast
Content Analytics: Big Data Conquered, Customer Service Elevated - For organizations looking to start a content analytics program or improve their existing capabilities, Aberdeen Group and IBM will lay out several recommendations...
- Content Analytics: Big Data Conquered, Customer Service Elevated For organizations looking to start a content analytics program or improve their existing capabilities, Aberdeen Group and IBM will lay out several recommendations...
- Bridging HTTP and FTP with FileXpress Internet Server What if you could take an FTP server on your internal network, and allow external users (partners or customers) to securely access it... All Big Data White Papers | Webcasts