Skip the navigation

Big Data Topic Center

Big data news, in-depth articles and more

Big Data News

R programming language gains on traditional statistics packages

The R programming language is quickly gaining popularity over the traditional statistics packages such as SPSS, SAS and MATLAB, at least according to one data statistician who teaches the language.
Read more...

TPC helps take the measure of big data systems

Comparing commercial Hadoop big data-styled analysis systems might get a little easier, thanks to a new benchmark from the Transaction Processing Performance Council (TPC).

SAP makes it cheaper, easier for customers to use Hana in certain scenarios

SAP has relaxed hardware requirements for using its Hana in-memory database platform for development and testing, in a move that could make existing Hana customers' operations easier and less expensive while also generating more Hana sales.

Google's big-data tool, Mesa, holds petabytes of data across multiple servers

Google has found a way to stretch a data warehouse across multiple data centers, using an architecture its engineers developed that could pave the way for much larger, more reliable and more responsive cloud-based analysis systems.

Danish university transfers data at record-breaking 43Tbps over a single fiber

Using souped-up fiber, the Technical University of Denmark has set a new record for transmitting data over one channel: 43 terabits per second.

Oracle hits back at ex-employee's claims about in-memory database option

Oracle has responded to a former employee's claim that a new in-memory processing option is turned on by default with the latest release of Database 12c, insisting that the process of enabling it requires a series of deliberate steps.

Hortonworks and Pivotal team up to better manage Hadoop

In the hot market for big-data products and services, sometimes even competitors must work together for the common good.

HP invests in Hortonworks' Hadoop

Hewlett-Packard is betting $50 million that its customers will find value in the Hadoop data processing platform by investing in Hadoop distributor Hortonworks.

Oracle builds on BlueKai acquisition with Data Cloud

Oracle is combining its BlueKai consumer data aggregation platform with other parts of its catalog to create Oracle Data Cloud, a data-as-a-service offering aimed at companies that want to reach customers and prospects across multiple channels.

Teradata acquires assets of Hadapt, Revelytix for big data

Teradata has bought the assets of Revelytix and Hadapt in a bid to grow out its capabilities for the Hadoop big-data processing framework.

Big Data In Depth

Hadoop gets real

Its fast and robust data processing and storage power make Hadoop both wildly popular and wildly complex. Here's how four IT leaders have managed to bring Hadoop systems from the sandbox into production.

Apache Hive brings real-time queries to Hadoop

Hive's SQL-like query language and vastly improved speed on huge data sets make it the perfect partner for an enterprise data warehouse

The Apple/IBM deal: iOS claims the IoT

The partnership announced last week isn't just about selling more iPhones. It's part of a big push into the Internet of Things.

Julia King: We're all data scientists now

It's up to each one of us to figure out what in the daily surge of data is useful, what's crap and what's truly valuable.

Microsoft Azure ML -- big data modeling in Azure

Microsoft has jumped in with both feet with the release to Preview of a new Microsoft Azure-based tool that helps organizations do Machine Learning and predictive analysis all from a Web console.

Big data security analytics mantra: Collect and analyze everything

In a recent research survey, ESG asked security professionals to identify the most important type of data for use in malware detection and analysis (note: I am an employee of ESG). The responses were as follows:

R data manipulation tricks: PDF

Download this companion to our story, to more easily follow along tricks, tips and code.

Cassandra lowers the barriers to big data

Apache Cassandra is a free, open source NoSQL database designed to manage very large data sets (think petabytes) across large clusters of commodity servers. Among many distinguishing features, Cassandra excels at scaling writes as well as reads, and its "master-less" architecture makes creating and expanding clusters relatively straightforward. For organizations seeking a data store that can support rapid and massive growth, Cassandra should be high on the list of options to consider.

Government open data proves a treasure trove for savvy businesses

Ever since President Obama signed the Open Data Executive Order, government agencies have been making their vast data stores available to the public. These once-secret data sets are proving a valuable business resource, too.

Thornton May: The path to big data mastery

While the path-to-mastery pattern is conceptually simple, successfully executing it requires courage, perseverance and patience.

Internet of Things: Get the latest!
Internet of Things

Our new bimonthly Internet of Things newsletter helps you keep pace with the rapidly evolving technologies, trends and developments related to the IoT. Subscribe now and stay up to date!