Big data not just about the analytics, says Amazon CTO
Enterprises also need to consider how data is collected, stored, organized and shared
IDG News Service - Enterprises that are thinking about big data need to realize that it isn't just about analyzing vast amounts of data, but also how that information is stored, Amazon CTO Werner Vogels said during a keynote at the Cebit trade show.
Vogels' speech was entitled "Data without limits" and besides encouraging enterprises to think about the big picture, he also presented a blueprint for how Amazon's cloud can be used to ease some of the pain of implementing big data systems.
"Big data is not only about analytics, it's about the whole pipeline. So when you think about big data solutions you have to think about all the different steps: collect, store, organize, analyze and share," said Vogels.
To make full use of the growing amounts of data many enterprises collect and to gain a competitive advantage, innovation has to occur in all of these areas, not just analytics, according to Vogels.
Amazon itself has been doing big data and analytics for a long time to try to target customers and come up with relevant recommendations. What it has learned along the way is that bigger, in this case, is better, according to Vogels. When mistakes have been made, it's because there isn't enough data to back up a recommendation, for instance, he said.
But Amazon isn't just using big data itself, it is also helping drive demand for its cloud, which is the great enabler of this market, according to Vogels.
"It is really important that if you go into this big data world that you have limitless possibilities in your hand. You should not be restricted in the way you store things or the way you process it," said Vogler.
Amazon Web Services offers a number services that can help enterprises collect, store, organize, analyze and share their data.
For example, Direct Connect allows enterprises to establish a dedicated network connection from a customer's site to Amazon. For really large amounts of data there is also AWS Import/Export, which allows enterprises to send portable storage devices to Amazon, which are then uploaded to Amazon's cloud storage.
"You should not underestimate the bandwidth of a Fedex box," said Vogels.
Other services that are also a good fit for big data include Amazon's Simple Storage Service, the DynamoDB NoSQL database and the Apache Hadoop-based Elastic MapReduce, which can be used to perform data-intensive analytics tasks.
The purported advantages are the same as when using cloud services in other areas -- having to pay only for resources used, faster deployment times, less management, and the ability to add more computing power quickly.
Vogels also had some homework for his audience, recommending a book called "The Fourth Paradigm: Data-Intensive Scientific Discovery," which tells the origins of big data.
Send news tips and comments to mikael_ricknas@idg.com
- Google I/O 2013's Coolest Products and Services
- 10 Star Trek Technologies That are Almost Here
- 19 Generations of Computer Programmers
- 25 Must-Have Technologies for SMBs
- A walking tour: 33 questions to ask about your company's security
- 15 social media scams
- The 7 elements of a successful security awareness program
- IT Certification Study Tips
- Register for this Computerworld Insider Study Tip guide and gain access to hundreds of premium content articles, cheat sheets, product reviews and more.
- Case Study: Simplifying the Transition to Exchange 2010 with Email Management Solutions Read this case study to learn how a cloud-based email management solution greatly simplified the company's transition to Exchange 2010.
- Application Security eGuide In this eGuide, CIO and sister publications CSO and InfoWorld bring you news, opinions, research and advice regarding the risks that enterprises face...
- How Storage Resource Management Suite Meets Today's Storage Management Challenges This white paper outlines the common use cases Storage Resource Management Suite addresses including comprehensive monitoring, reporting, and analysis for heterogeneous block, file,...
- When Application Performance is Better, Business Works Better Poor application performance can cost more than you think. In fact, Enterprise Management Associates reports that it can exceed $1 million per hour...
- Live Webcast
Webinar: Create Competitive Advantage, Featuring Synchology - View Now!
- Webinar: Create Competitive Advantage, Featuring Synchology View Now!
- Software Asset Management - Program Considerations to Help Reduce Risk and Lower Costs SAM: A must have IT tool to help reduce costs and minimize business and legal risks. All Business Intelligence/Analytics White Papers | Webcasts