Skip the navigation
)
News

Big data to drive a surveillance society

Analysis of huge quantities of data will enable companies to learn our habits, activities

March 24, 2011 01:23 PM ET

Computerworld - NEW YORK -- As real-time and batch analytics evolve using big data processing engines such as Hadoop, corporations will be able to track our activities, habits and locations with greater precision than we ever thought possible.

"It will change our existing notions of privacy. A surveillance society is not only inevitable, it's worse. It's irresistible," said Jeff Jonas, a distinguished engineer with IBM. Jonas spoke to a packed house of several hundred people Wednesday at the Structure Big Data 2011 conference here.

For businesses, the ability to determine where people are by using geo-locational data will help them personalize advertising and marketing materials disseminated via the Web. For example, if a company knows a customer is in Aruba, it won't bother showing him ads for restaurants in New York; it might market sunblock or scuba-diving excursions instead.

Knowing where people are will also enable companies to accurately determine which potential customer is which. For example, if there are five people in the U.S. who have the same name and the same date of birth but live in different cities, it would be possible to verify the identity of each individual by determining their locations at a given time.

"Just look at the last 10 years of address histories ... it is very telling if this is the same person or not," Jonas said. "Two different things cannot occupy the same space at the same time."

Jonas said 600 billion electronic transactions are created in the U.S. every day, and many of those transactions come from geo-locational data generated by cell phones, which through cellular towers, triangulate a person's exact location at any time. Wireless providers have that data in real time.

By looking at data over a period of years, corporations can know how you spend your time, where you work, and who you typically spend time with.

"This is super food [for big data analytics]," Jonas said. "With 87% certainty, I can tell you where you'll be next Thursday at 5:35 p.m."

"Big data" -- an industry term that refers to large data warehouses -- includes machine- and human-generated data such as computer system log files, financial services electronic transactions, Web search streams, e-mail metadata, search engine queries and social networking activity. In 2010 alone, 1.5 zetabytes of that kind of data was created, most of it machine-generated. Companies filled their data center storage systems with about 16 exabytes of that data last year, according to Jason Hoffman, founder and chief scientist at cloud software provider Joyent.

Bill McColl, CEO of analytics engine vendor Cloudscale, said that up until now, big data analytics has been about offline queries or "MapReduce" algorithms, which were developed by Google. But 90% of corporate data warehouse users say they want to move forward into a world with real-time analytics.

"Companies know if they can extract more insight from data faster than their competitors, they're going to win," McColl said.

Jim Baum, founder and CEO of Netezza, maker of a massively parallel processing (MPP) data warehouse appliance, agreed with McColl. Baum argued that if a corporate user has to wait even three days to get an answer to an analytics query, the user won't bother asking a follow-up question that could the key to unlocking the truly valuable insights the information has to offer.

"If I can get an answer in real time, I will ask the next question and the next question, and that'll be followed by another. Getting answers in near real time is critical. It's the enabler of what we can do with big data," said Baum, whose company was purchased by IBM last year. IBM's purchase of Netezza was among a flurry of big data analytics vendor acquisitions over the past year. Other deals included EMC's purchase of Greenplum, Hewlett-Packard's purchase of Vertica and Teradata's planned acquisition of Aster Data Systems.



What is Tech Briefcase?
TechBriefcase is a new, free service where IT Professionals can Search, Store and Share IT white papers and content like this. Learn more
Bookmark content
Speed up your research efforts with content across the web.
Search and Store
Find the white papers you need. Create folders for any topic.
View Anywhere
Open your briefcase on your iPhone, tablet or desktop. Share with colleagues.
Don't have an account yet?
Additional Resources
Security KnowledgeVault
WHITE PAPER
Security is not an option. This KnowledgeVault Series offers professional advice how to be proactive in the fight against cybercrimes and multi-layered security threats; how to adopt a holistic approach to protecting and managing data; and how to hire a qualified security assessor. Make security your Number 1 priority.

Read now.

Cut Communications Costs Once and for All
WHITE PAPER
New IP-based communications systems are being deployed by small and midsized businesses at a rapid rate. Learn how these organizations are enabling faster responsiveness, creating better customer experiences, speeding office or mobile interactions, and dramatically reducing existing communications costs.

Read now.

BI and Analytics White Papers
Thinking Outside The Data Warehouse
This high level, business problem focused eBook uses 5 customer scenarios to show how people and organizations are tackling real issues using IBM...
Using BD for Smarter Decision Making
This paper looks at new developments in business analytics and discusses the benefits analyzing big data bring to the business.
Measuring the Business Value of CI in the Data Center
One of the key strategies that IT teams are pursuing to reduce capital costs while boosting asset utilization and employee productivity is the...
Switching Schedulers - Not As Complicated As You Think
Changing or consolidating job schedulers may seem daunting. However, the benefits of switching to enterprise workload automation outweigh the risks. Read how BMC...
Capture-Enabled Business Process Management
Organizations today must deal with a vast amount of incoming information from many different sources. Efficient, automated business processes are critical to managing...
All BI and Analytics White Papers
BI and Analytics Webcasts
InfoSphere Warehouse Packs Demo
These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
Delivery Management -- Extending Lifecycle Management
Date: Wednesday, June 20, 2012, 1:00 PM EDT

Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,...
Leverage automation today to reduce IT complexity
Date: Tuesday, June 5, 2012, 2:00 PM EDT

Whether your B2B complexity is caused by multiple technologies due to M&A, business or application specific...
BMC Control-M - Single Point of Control Demo
With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's...
BMC Control-M - Single Point of Control Demo
With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's...
All BI and Analytics Webcasts
Newsletter Sign-Up

Receive the latest news test, reviews and trends on your favorite technology topics

Choose a newsletter
  1. View all newsletters | Privacy Policy
IT Jobs