Oracle rolls out 'Big Data' appliance
The newest member of Oracle's appliances includes support for the open-source Hadoop and R frameworks
IDG News Service - Oracle unveiled the Big Data Appliance, the newest addition to its line of products that combine software and hardware, during the OpenWorld conference in San Francisco on Monday.
"Big data" is an industry buzzword that refers generally to the massive amounts of information generated by websites, sensors and other sources apart from traditional enterprise applications.
The new appliance includes a distribution of the open-source Hadoop programming framework, Oracle Data Integrator Application Adapter for Hadoop, Oracle Loader for Hadoop, a distribution of the R open-source statistical analysis software, and the Oracle NoSQL database, according to a statement.
"There's a lot of data, and a lot of it has very low business value. There's only a few nuggets that people want to find," Andy Mendelsohn, senior vice president of database server technologies, told press and analysts. Hadoop and other tools can distill that data down to something useful, and it can then be loaded into a data warehouse, particularly one powered by Oracle's Exadata appliance, for further analysis, he said.
NoSQL refers to a growing set of database technologies that can be defined by what they omit, such as "SQL, joins, strong analytic alternatives to those, and some forms of database integrity," analyst Curt Monash said recently. "If you leave all four out, and you have a strong scale-out story, you're in the NoSQL mainstream."
The Oracle NoSQL database is a "distributed, highly scalable, key-value database" that is "easy to install, configure and manage, supports a broad set of workloads and delivers enterprise-class reliability backed by enterprise-class Oracle support," according to an Oracle statement.
It is based on Oracle's Berkeley DB product. "Berkeley DB is probably the most popular key-value store out there on the Web," but it uses a single index, Mendelsohn said. For the NoSQL database, Oracle "turned it from a single index to a distributed implementation, where you could have maybe 100 indexes," he said.
Like Berkeley DB, the NoSQL database will be available in both open-source and commercial versions. The latter will probably gain premium features over time, Mendelsohn said.
Meanwhile, Oracle recognizes that administrators and developers may not be familiar with programming models like Hadoop, Mendelsohn said.
"Hadoop as it currently stands is a very niche technology," according to Mendelson. "Everybody's talking about it, but who in our enterprise installed base can use something like this?"
That's why tools like the data-integrator adapter and loader for Hadoop are so important, since they help bridge that skills gap, he said.
"Have we done enough [with Hadoop tooling]? I don't think we're there yet, but we've made some good steps," Mendelsohn added.


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Thinking Outside The Data Warehouse
- This high level, business problem focused eBook uses 5 customer scenarios to show how people and organizations are tackling real issues using IBM...
- Using BD for Smarter Decision Making
- This paper looks at new developments in business analytics and discusses the benefits analyzing big data bring to the business.
- Measuring the Business Value of CI in the Data Center
- One of the key strategies that IT teams are pursuing to reduce capital costs while boosting asset utilization and employee productivity is the...
- Switching Schedulers - Not As Complicated As You Think
- Changing or consolidating job schedulers may seem daunting. However, the benefits of switching to enterprise workload automation outweigh the risks. Read how BMC...
- Capture-Enabled Business Process Management
- Organizations today must deal with a vast amount of incoming information from many different sources. Efficient, automated business processes are critical to managing... All BI and Analytics White Papers
- Live Webcast
Data Privacy and Protection in Production Environments: New Research from Ponemon Institute - Date: Wednesday, June 13, 2012, 1:00 PM EDT / 10:00 AM PDT
In a recent study conducted by Ponemon Institute, fifty-five percent of respondents... - Live Webcast
A Geek's Guide to Presenting to Business People - Live Webcast: Wednesday, June 20th at 1:00 PM EDT
Join this live webinar with Paul Glen, author of Leading Geeks, to learn how to... - Live Webcast
Today's NAS: A Solution Beyond Old Limits - Date: Tuesday, July 17, 2012 2:00 PM EDT
Traditional NAS systems don't scale beyond fixed limits. Proliferation of NAS systems leads to management... - InfoSphere Warehouse Packs Demo
- These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
- Delivery Management -- Extending Lifecycle Management
- Date: Wednesday, June 20, 2012, 1:00 PM EDT
Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,... - Leverage automation today to reduce IT complexity
- Date: Tuesday, June 5, 2012, 2:00 PM EDT
Whether your B2B complexity is caused by multiple technologies due to M&A, business or application specific... - BMC Control-M - Single Point of Control Demo
- With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's...
- BMC Control-M - Single Point of Control Demo
- With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's... All BI and Analytics Webcasts