Skip the navigation

IBM unveils Web privacy work

By Ann Bednarz, Network World
May 31, 2002 12:00 PM ET

Network World - Researchers at IBM's Privacy Institute are working on software that automatically scrambles Web visitors' personal information so consumers won't feel compelled to lie to protect their privacy.
It's no secret that online visitors often provide false personal data to avoid any repercussions should the data be misused or shared with multiple sources. For merchants, that means the customer data they painstakingly track with customer relationship management software -- and often rely on when making product development and marketing decisions -- can be flawed from the start.
To help solve this problem, researchers Rakesh Agrawal and Ramakrishnan Srikant are developing what IBM calls "privacy-preserving data mining." The duo's research, which IBM announced yesterday, relies on the notion that a Web visitor's personal data can be protected if the information is scrambled, or randomized, before it gets to the merchant. Once the data is transferred to the merchant's systems, the IBM software applies algorithms to compensate for the data scrambling. With this technology, a retailer could still generate accurate data models and extract useful demographic information, but without ever seeing personal consumer data, IBM said.
"Our research institutionalizes the notion of fibbing on the Internet, and does so to preserve the overall reality behind the data," Agrawal said.
When a Web user submits a piece of personal data, such as age or salary, the IBM software immediately scrambles that number by adding to or subtracting from it a random value. This randomization step is performed independently for every user, IBM said. This means a 30-year-old's age may be changed to 42, while a 34-year-old's age may become 28.
The merchant determines the range of the randomization -- plus or minus 1 to 12 years, for example -- which then remains constant. Once all the scrambled data is collected for a large number of users, IBM's data mining software determines how the true data might have looked and uses the reconstruction to build a data-mining model, IBM said.
The greater the range of number scrambling that's allowed, the more consumers' private data is obscured. However, as randomization parameters increase, the accuracy of the post-scramble data mining results decreases. According to Agrawal, it's a trade-off. IBM said that in its experiments, after compensating for the data scrambling, it found only a 5% to 10% loss in accuracy, even with 100% randomization allowances.
The research project is under way at IBM's Privacy Institute in Almaden, Calif. Beta trials will begin soon. It's the first project announced by the group, which was formed in November 2001.
Internet privacy is a hot topic, most recently making headlines when U.S. Sen. Ernest "Fritz" Hollings (D-S.C.) introduced a controversial bill designed to safeguard Internet users' privacy, and which opponents suggest will hamper online commerce (see story).

For more coverage of privacy issues, see our privacy page.

Reprinted with permission from NetworkWorld.com. Story copyright 2010 Network World, Inc. All rights reserved.
Additional Resources
Forrester Consulting - Optimizing Users and Applications in a Mobile World
WHITE PAPER
Solving application issues over the WAN requires careful consideration. Based on their independent research, Forrester Consulting offers recommendations on how to tackle application performance issues, insufficient bandwidth and the inability to quickly restore users in a disaster.

Read now.

Security KnowledgeVault
WHITE PAPER
Security is not an option. This KnowledgeVault Series offers professional advice how to be proactive in the fight against cybercrimes and multi-layered security threats; how to adopt a holistic approach to protecting and managing data; and how to hire a qualified security assessor. Make security your Number 1 priority.

Read now.

Cut Communications Costs Once and for All
WHITE PAPER
New IP-based communications systems are being deployed by small and midsized businesses at a rapid rate. Learn how these organizations are enabling faster responsiveness, creating better customer experiences, speeding office or mobile interactions, and dramatically reducing existing communications costs.

Read now.

Privacy White Papers
Overcome Top 7 Admin Challenges of Active Directory
As Active Directory's role in the enterprise has drastically increased, so has the need to secure the data. Gain insight on creating repeatable,...
Insiders Can Ruin Your Company. Take Action.
Did you know that 80 percent of threats to an organization come from the inside? The threat from insiders is often overlooked in...
Top Solutions and Tools to Prevent Devastating Malware
Custom malware frequently goes undetected. According to Forrester Research, the best way to reduce risk of breach is to deploy file integrity monitoring...
Streamline Compliance and Increase ROI
Streamline, simplify, and automate compliance related activities; especially those that impact multiple business units. This white paper from NetIQ, outlines solutions that will...
X-Ray of the PCI Process-4 Proactive Steps
This white paper from Forrester Research Inc., helps break PCI into understandable components. Security and risk professionals will gain knowledge and insight into...
All Privacy White Papers
Privacy Webcasts
A Road Map for Best Practice Social Media Acceptable Use Policy
Organizations around the world are racing to leverage the power of social media for business. Sites like Facebook are used for marketing, human...
Data Protection and Disaster Recovery with iSCSI and VMware
Get this on demand webcast now
Optimizing Networks for the Cloud
Join guest speaker, Rohit Mehra, IDC Director of Enterprise Communications Infrastructure, to explore current trends, discuss best practices for optimizing Data Center and...
Apps QuickStart Series Part 2: Designing and Deploying SQL Server on VMware vSphere
Download this webcast to learn about the design considerations for virtualizing SQL workloads, performance and scalability information and high-availability options, as well as...
Apps QuickStart Series Part 1: Designing and Deploying Exchange 2010 on VMware vSphere
Download this webcast to learn the virtual hardware design considerations for Exchange 2010, deployment using the building block approach, options for high-availability and...
All Privacy Webcasts
Newsletter Sign-Up

Receive the latest news test, reviews and trends on your favorite technology topics

Choose a newsletter
  1. View all newsletters | Privacy Policy
IT Jobs