Pervasive pairs parallel development API with Hadoop MapReduce
InfoWorld - Pervasive Software is unveiling on Wednesday version 5.0 of its DataRush parallel application software, which now works with the popular Hadoop MapReduce framework for processing large volumes of data in parallel.
Functioning with the JVM (Java Virtual Machine), DataRush helps developers build parallel applications without requiring expertise in parallel development, the company said. "The idea is to take a pure programmer off the street and enable him to write multithreaded apps," said Davin Potts, Pervasive director of product management.
[ Also on InfoWorld.com: Apple has quietly joined the ranks of Hadoop users | Keep up with the latest developer news with InfoWorld's Developer World newsletter. ]
"DataRush is an API you would use in your normal application development. It's just another library that you access," Potts said. MapReduce backing helps developers get more performance out of their MapReduce cluster. "You can get the same [query] answer in less time. You're being more efficient in how you use your cluster," Potts said.
DataRush scales across clusters, with the ability to accelerate every node in a Hadoop cluster. At data marketplace Infochimps, a DataRush user site, the company is using the software in a pilot effort to run Hadoop programs. "DataRush will coordinate shuttling the data around and gets you the concurrency," said Infochimps CTO Flip Kromer.
"Computer scientists [have] done a terrible job of letting us use multicore programs efficiently," Kromer said. "Programming concurrency is really hard. DataRush lets you bring all that performance out," while keeping programs simple, he said, though he also noted that developers still must adhere to DataRush primitives, tracking back to the DataRush data flow language.
Also featured in DataRush 5.0 is backing for newer languages on the JVM, including JRuby, Python, and Scala; users of these languages get parallel development capabilities. DataRush also can access data in data warehouses, databases, and flat files.
Pricing for DataRush 5.0 is based on factors such as use of perpetual or subscription licenses, contract terms and number of machines in a cluster, Pervasive said. Free trial downloads of DataRush 5.0 can be accessed at the Pervasive website.
This article, "Pervasive's parallel development API paired with Hadoop MapReduce," was originally published at InfoWorld.com. Follow the latest developments in business technology news and get a digest of the key stories each day in the InfoWorld Daily newsletter. For the latest business technology news, follow InfoWorld.com on Twitter.
Read more about developer world in InfoWorld's Developer World Channel.


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Thinking Outside The Data Warehouse
- This high level, business problem focused eBook uses 5 customer scenarios to show how people and organizations are tackling real issues using IBM...
- Using BD for Smarter Decision Making
- This paper looks at new developments in business analytics and discusses the benefits analyzing big data bring to the business.
- Measuring the Business Value of CI in the Data Center
- One of the key strategies that IT teams are pursuing to reduce capital costs while boosting asset utilization and employee productivity is the...
- Switching Schedulers - Not As Complicated As You Think
- Changing or consolidating job schedulers may seem daunting. However, the benefits of switching to enterprise workload automation outweigh the risks. Read how BMC...
- Capture-Enabled Business Process Management
- Organizations today must deal with a vast amount of incoming information from many different sources. Efficient, automated business processes are critical to managing... All BI and Analytics White Papers
- InfoSphere Warehouse Packs Demo
- These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
- Delivery Management -- Extending Lifecycle Management
- Date: Wednesday, June 20, 2012, 1:00 PM EDT
Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,... - Leverage automation today to reduce IT complexity
- Date: Tuesday, June 5, 2012, 2:00 PM EDT
Whether your B2B complexity is caused by multiple technologies due to M&A, business or application specific... - BMC Control-M - Single Point of Control Demo
- With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's...
- BMC Control-M - Single Point of Control Demo
- With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's... All BI and Analytics Webcasts