Captchas: Computer Tests Can Defeat Spam
Ingenious computer tests may also advance machine vision and AI.
Computerworld - On the Internet, nobody knows you're a dog. Or a rogue robot program stealthily gathering personal information from chat rooms or registering for thousands of free e-mail accounts from which to blast out spam.
One way to stymie such bots is to use a captcha. Short for "completely automatic public Turing test to tell computers and humans apart," a captcha is a program that can generate and grade tests that are easy for humans to solve but very difficult for computers to crack.
Examples include words that have been precisely distorted by computers, images overlaid with other images or audio clips with background noise.
By including a captcha as part of the registration process for a free e-mail account, for instance, it would be relatively easy to establish whether the registrant is a human or a robot program.
"The human visual system and all of our experience in reading makes it possible to read images of text which computer vision systems at their best cannot do reliably," explains Henry Baird, a principal scientist at Palo Alto Research Center Inc. (PARC) in California.
The concept of using programs like captchas to deal with bots and spam on the Internet has been around since 1997. A team of researchers at what was then Digital Equipment Corp. was working on a way to deal with bots that were trying to influence the way certain sites were ranked on the company's AltaVista search engine. Researchers at the company developed and patented a character-recognition test that was used during the AltaVista registration process to weed out automated programs.
In September 2000, Pittsburgh-based Carnegie Mellon University's computer science department started developing similar programs in response to a request from Yahoo Inc.
Like AltaVista, Yahoo was grappling with rogue programs that were invading its chat rooms and illegally marketing products, stealing personal information and spamming users. "The idea was to create a computer program that could distinguish bots from humans. The program would have to serve as a sentry, but it couldn't itself pass the very test it gives," says Manuel Blum, a professor of computer science at Carnegie Mellon.
The result was Gimpy, a captcha containing seven words chosen at random from a dictionary of 850 words and then distorted and overlaid with clutter via software. Passing the test required identifying at least three of the distorted words correctly.
A simpler one-word version of Gimpy, called E-Z Gimpy, is currently used by Yahoo on its Web site to weed out humans from bots during the registration process.



- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Forrester Total Economic Impact (TEI) Case Study - Oracle
- In this paper, Forrester Consulting examines the total economic impact and potential return on investment (ROI) realized by three Enterprise organizations as they...
- The Hidden Truth About Virtualizing Business-Critical Applications
- This IDG whitepaper highlights key findings based on the Quickpoll Survey conducted with more than 300 Enterprise and Commercial IT decision makers worldwide...
- Top 10 Myths About Virtualizing Business-Critical Applications
- Even though virtualization has brought positive change to enterprise IT over the last decade, some skepticism remains about how valuable virtualization can be...
- Enterprise Java Applications on VMware: Unix to Linux Migration Guide
- This guide focuses on key considerations for IT Architects who are in the process of migrating Java applications from UNIX to Linux as...
- Virtualizing Tier 1 Applications: A Critical Step on the Journey Toward the Private Cloud
- This IDC white paper explains how much of the Enterprise IT community is at a crossroads in extending their journey to the private... All Applications White Papers
- Live Webcast
Banish Poor Application Performance: Eliminate Business Disruptions, Increase End User Productivity - End User Experience, 30-Min Webinar
Wed. Feb. 22nd ~ 11 AM ET
Are you ready to gain the proactive ability to rapidly respond... - Apps QuickStart Series Part 2: Designing and Deploying SQL Server on VMware vSphere
- Download this webcast to learn about the design considerations for virtualizing SQL workloads, performance and scalability information and high-availability options, as well as...
- Apps QuickStart Series Part 1: Designing and Deploying Exchange 2010 on VMware vSphere
- Download this webcast to learn the virtual hardware design considerations for Exchange 2010, deployment using the building block approach, options for high-availability and...
- Virtualize Business-Critical Applications with Confidence
- Virtualizing business-critical applications has become a key focus for organizations as they move along their virtualization journey. With the launch of VMware vSphere®...
- Discover the Benefits of Virtualization for Federal Applications
- Want to say goodbye to missed SLAs? VMware can help you virtualize mission-critical applications such as Oracle, MS Exchange and SharePoint to achieve...
- Reduce Application Lifecycle Management Costs with VMware ThinApp
- Traditional desktop application deployment and management is a time-consuming and costly endeavor for IT. From development to deployment, including help desk support, the... All Applications Webcasts