Skip the navigation
News Analysis

In reversal, Yahoo will store user search data longer

Move to store IP numbers, search terms, cookies for 18 months instead of 90 days unlikely to win friends in privacy community

April 18, 2011 07:30 PM ET

Computerworld - In a move that is unlikely to win it any new friends in the privacy community, Yahoo has announced that it will retain consumer search data for a substantially longer period of time than it does today.

Starting sometime in mid-July, Yahoo will hold raw search log file data, including IP addresses, cookies and search-related information, for up to 18 months. It currently retains such data for 90 days.

Yahoo's chief trust officer, Anne Toth, said in a blog post that the change, announced on Friday, was designed to give consumers a more robust and personalized search experience while also bringing Yahoo into closer alignment with industry-wide data retention norms.

"We will hold raw search log files for 18 months and we will be closely examining what the right policy and time frame should be for other log file data," Toth wrote. "In announcing this change, we have gone back to the drawing board to ensure that our policies will support the innovative products we want to deliver for our consumers."

Toth's announcement marks an abrupt reversal of Yahoo's current data retention policy which it put in place in 2008. Under its current policy, Yahoo stores most log file data for just 90 days, though in some cases the company holds raw data for as long as six months for what it calls fraud and security purposes, and to comply with legal requirements.

In contrast, Google stores search data for nine months, while Microsoft retains it for six months.

When Yahoo announced the policy in 2008, the company noted that its goal was to be able to minimize data retention while also being able to offer a highly personalized experience for users.

While the company's goals remain the same, the Internet has changed and so has Yahoo's business and its competition, Toth said. As a result of these changes "we are moving to align our log file data retention policy closer to the competitive norm across the industry," Toth said.

Yahoo's announcement comes just two months after the U.S. Department of Justice and an organization representing police chiefs from around the country called for legislation mandating Internet Service Providers retain certain customer usage data for as long as two years.

Jim Harper, director of information policy studies at the Cato Institute, a libertarian-leaning public policy research organization, said there's nothing to indicate that Yahoo's immediate move was the result of any government pressure. Instead, the move appears to be have been driven purely by competitive pressures.

He said it would be up to consumers to decide if it is something that is appropriate or not.

At the same time, companies such as Yahoo and Google are coming under increasing pressure from governments in the U.S. and the EU to store all sorts of data for law enforcement purposes, Harper noted.

"I am quite concerned about governments using online services of all kinds as data sources," Harper said. "I tend to worry about the lengthening of data periods."

"Once they are fully tapped into online services and start using them as an ongoing source of information," it will be near impossible to reverse the trend, he said.

Jaikumar Vijayan covers data security and privacy issues, financial services security and e-voting for Computerworld. Follow Jaikumar on Twitter at Twitter @jaivijayan or subscribe to Jaikumar's RSS feed Vijayan RSS. His e-mail address is jvijayan@computerworld.com.

Read more about Privacy in Computerworld's Privacy Topic Center.



Additional Resources
Forrester Consulting - Optimizing Users and Applications in a Mobile World
WHITE PAPER
Solving application issues over the WAN requires careful consideration. Based on their independent research, Forrester Consulting offers recommendations on how to tackle application performance issues, insufficient bandwidth and the inability to quickly restore users in a disaster.

Read now.

Security KnowledgeVault
WHITE PAPER
Security is not an option. This KnowledgeVault Series offers professional advice how to be proactive in the fight against cybercrimes and multi-layered security threats; how to adopt a holistic approach to protecting and managing data; and how to hire a qualified security assessor. Make security your Number 1 priority.

Read now.

Cut Communications Costs Once and for All
WHITE PAPER
New IP-based communications systems are being deployed by small and midsized businesses at a rapid rate. Learn how these organizations are enabling faster responsiveness, creating better customer experiences, speeding office or mobile interactions, and dramatically reducing existing communications costs.

Read now.

Privacy White Papers
ESG: Defining Tier One Storage in the Modern Data Center
This report defines "tier-1" storage in the modern IT world and in the data centers and services that support it. What was a...
ESG: Using HP's Converged Storage to Develop/Enhance Business Resiliency in VMware Environments
In this report, Enterprise Strategy Group reviews how HP's portfolio of hardware, software, and services can provide the foundational support for VMware environments....
HP 3PAR Storage Systems Designed for Mission Critical High Availability
In this technical whitepaper, learn how HP 3PAR Storage Systems have been designed to deliver 99.999% and greater availability, bringing new possibilities to...
Utility Storage - The Ideal Platform for Virtual and Cloud Computing
Server virtualization has transformed corporate IT -- companies have enjoyed major cost savings and have gained flexibility and efficiency. But this has also...
ESG Lab Review: Focus on Federated Workload Balancing, Asset Management, and Thin Provisioning
This ESG Lab review documents hands-on testing of HP 3PAR Peer Motion Software's distributed volume management with a focus on federated workload balancing,...
All Privacy White Papers
Privacy Webcasts
A Road Map for Best Practice Social Media Acceptable Use Policy
Organizations around the world are racing to leverage the power of social media for business. Sites like Facebook are used for marketing, human...
Data Protection and Disaster Recovery with iSCSI and VMware
Get this on demand webcast now
The Higher-Bandwidth, Lower-Cost Connection of Choice: 10GBASE-T LAN on Motherboard
Learn how Expedient, a cloud provider, is using 10 Gigabit Ethernet to boost its services and rein in costs.
Banish Poor Application Performance
End User Experience, 30-Min Webinar
Wed. March 21st ~ 11 AM ET

Are you ready to gain the proactive ability to rapidly respond...
Virtualization KnowledgeVault
Virtualization initiatives are underway at most small and midsize businesses, but some unexpected challenges have prevented many organizations from achieving original goals. This...
All Privacy Webcasts
Newsletter Sign-Up

Receive the latest news test, reviews and trends on your favorite technology topics

Choose a newsletter
  1. View all newsletters | Privacy Policy
IT Jobs