Google adds Caffeine for more up-to-date results
IDG News Service - Google has introduced a new Web indexing system to provide users with more up-to-date search results, the company said Tuesday.
The new system, called Caffeine, delivers results that are closer to "live" than Google's previous system, the company said.
Previously, Google would crawl a fraction of the Web each night, index it and push it out in its results. With Caffeine, as Google crawls the Web and finds new information, it indexes it immediately. "We process it immediately so we can serve it seconds later," said Matt Cutts, the head of Google's webspam team. He unveiled the news at the Search Marketing Expo in Seattle.
When Google started, it would update its index only every four months, he said. Around 2000, it started indexing every month in a process that took a week to 10 days. "The funny thing is, we didn't have enough capacity to update all our data centers at once," he said. That meant that people might get different results when searching for the same term if they were hitting different Google data centers.
Caffeine went live "in the last few days" and is now being used in all Google data centers, he said.
In addition to serving "fresher" results, Caffeine "massively increases our ability to scale up," Cutts said. The company will be able to index many more documents -- "on the order of 100 petabytes," he said.
Caffeine adds new information at a rate of hundreds of thousands of gigabytes per day, Google said in a blog post.
The progression in how Google does its indexing mirrors how people increasingly expect to find the very latest information online. Google noticed that after the Sept. 11 attacks on the U.S., when people were looking for the most up-to-the-minute information possible, Cutts said.


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Finding the right cloud solutions for your organization
- HP is driving the evolution of what we call the Instant-On Enterprise. It is an enterprise that embeds technology into everything it does...
- Converged Infrastructure for Dummies
- As you know, everything is mobile, connected, interactive, and immediate. This is exactly why organizations need a highly agile IT infrastructure in order...
- Seven Priorities for Integrated Network Management - How HP Intelligent Management Center Delivers an Enterprise-class Solution
- This white paper describes the major requirements for network management solutions to help the organizations become more profitable, efficient and reliable.
Intel and the... - Building Cloud-Optimized Data Center Networks white paper
- Enterprises are turning to the Cloud to improve business agility, reduce expenses and accelerate business innovation. Cloud computing redefines the way IT assets...
- Gartner on the Network Infrastructure Market
- The network infrastructure market has evolved rapidly, from one in which most organizations adhered to a single-vendor architecture to a more business-driven network... All Networking White Papers
- The Higher-Bandwidth, Lower-Cost Connection of Choice: 10GBASE-T LAN on Motherboard
- Learn how Expedient, a cloud provider, is using 10 Gigabit Ethernet to boost its services and rein in costs.
- Distributed Database Security with Real-time Monitoring
- View this demo and learn how IBM InfoSphere Guardium database activity monitoring can help protect your sensitive data in distributed DBMS environments with...
- InfoSphere Warehouse Packs Demo
- These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
- Delivery Management -- Extending Lifecycle Management
- Date: Wednesday, June 20, 2012, 1:00 PM EDT
Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,... - Leverage automation today to reduce IT complexity
- Date: Tuesday, June 5, 2012, 2:00 PM EDT
Whether your B2B complexity is caused by multiple technologies due to M&A, business or application specific...
All Networking Webcasts