Privacy Algorithms
Technology-based protections could make personal data impersonal.
October 14, 2002 12:00 PM ETComputerworld -
In the ongoing debate over how to protect personal information, much of the attention has focused on whether - and to what degree - the government should limit the amount of personal information companies can ask for or share.
Recently however, a small group of computer scientists has been taking a different tack. They're building software tools that promise to keep names, addresses, health status and other information secret while allowing patterns to emerge within large data sets that can help predict broad social trends, buying behaviors or massive health or terrorist threats.
Some of this software has been patented and used by government agencies in the U.S.; other algorithms are several years from practical implementation. The tools may someday be used by health care providers, financial services firms and the government for collecting and using data gleaned from individuals.
Some of the existing tools enhance anonymity. For example, the Freedom browser from Zero-Knowledge Systems Inc. in Montreal prevents sending of personal information over an Internet connection without the user's consent.

![]()
Latanya Sweeney, a professor at Carnegie Mellon University ![]()
For example, researchers at the IBM Privacy Research Institute in San Jose are perfecting an approach that "randomizes" data before it's communicated. A Web business might use it to extract valuable demographic data without knowing the underlying personal data of the consumer.
A user would enter his age, salary or weight, and software would randomize it by adding or subtracting that number from a random value. The random value would differ for every user, while the range of randomization wouldn't change. The software would use the randomized values and the range of randomization to find a close approximation of the true distribution, IBM officials say. Experiments show a 5% to 10% loss in accuracy of data even when all values are randomized, says Rakesh Agrawal, an IBM researcher on the project.
Carnegie Mellon University in Pittsburgh is focusing on protecting personal information that's already public, such as voter registration information and hospital discharge data. "One of the biggest problems is that people think their data might be anonymous when it is not," says Latanya Sweeney, a computer science professor and director of the school's Laboratory for International Data Privacy.
Sweeney estimates that 87% of the U.S. population can be uniquely identified if only a date of birth, gender and five-digit ZIP code are known.
Additional Resources



Learn the important issues you must consider before starting your next mobility initiative. Get your mobility white paper from IDC now, compliments of Sybase.
White Papers & Webcasts
Accelerate SSL Encrypted Applications
The amount of SSL traffic is growing in the enterprise. Because it is encrypted, it cannot be properly controlled and accelerated. Blue Coat...
Data Protection and Disaster Recovery with iSCSI and VMware
Data protection and disaster recovery are top of mind for any IT manager, and the challenges of complexity and cost remain as obstacles....
ESG Lab Field Audit
Many companies have successfully implemented Riverbed WAN optimization solutions within their Cisco networks. This ESG Lab Field Audit document explores the success that...
Usability Is Everything
Learn what sets Workday's HR and Payroll solutions apart from the competition....
Shape Your Apps Strategy to Reflect New SaaS Licensing and Pricing Trends
Why are smart companies choosing software-as-a-service? Find out in the complimentary Forrester Research report...
The Value of Real SaaS at Workday
Cost savings, speed to value, and innovation brought to the enterprise by Workday's software-as-a-service solutions for HR and Payroll....
Natural User Interface for Enterprise Applications
Learn how a revolutionary user interface can make a complex enterprise application so intuitive even casual users can jump right in....
SaaS at Flextronics, Inc.
Dave Smoley, CIO of Flextronics, discusses the real value of software-as-a-service and why he chose Workday for his HR solution....
A Truly Global HCM System
Learn about a system built with advanced object-oriented technology that support multi-national requirements and costs less to implement, maintain and upgrade....
Why Compliance Pays
This OnDemand webcast explores the relationship that firms with best compliance records have higher revenue, greater customer retention, lower financial losses from data...
Subscribe to Computerworld
