Captchas: Computer Tests Can Defeat Spam
Ingenious computer tests may also advance machine vision and AI.
Computerworld - On the Internet, nobody knows you're a dog. Or a rogue robot program stealthily gathering personal information from chat rooms or registering for thousands of free e-mail accounts from which to blast out spam.
One way to stymie such bots is to use a captcha. Short for "completely automatic public Turing test to tell computers and humans apart," a captcha is a program that can generate and grade tests that are easy for humans to solve but very difficult for computers to crack.
Examples include words that have been precisely distorted by computers, images overlaid with other images or audio clips with background noise.
By including a captcha as part of the registration process for a free e-mail account, for instance, it would be relatively easy to establish whether the registrant is a human or a robot program.
"The human visual system and all of our experience in reading makes it possible to read images of text which computer vision systems at their best cannot do reliably," explains Henry Baird, a principal scientist at Palo Alto Research Center Inc. (PARC) in California.
The concept of using programs like captchas to deal with bots and spam on the Internet has been around since 1997. A team of researchers at what was then Digital Equipment Corp. was working on a way to deal with bots that were trying to influence the way certain sites were ranked on the company's AltaVista search engine. Researchers at the company developed and patented a character-recognition test that was used during the AltaVista registration process to weed out automated programs.
In September 2000, Pittsburgh-based Carnegie Mellon University's computer science department started developing similar programs in response to a request from Yahoo Inc.
Like AltaVista, Yahoo was grappling with rogue programs that were invading its chat rooms and illegally marketing products, stealing personal information and spamming users. "The idea was to create a computer program that could distinguish bots from humans. The program would have to serve as a sentry, but it couldn't itself pass the very test it gives," says Manuel Blum, a professor of computer science at Carnegie Mellon.
The result was Gimpy, a captcha containing seven words chosen at random from a dictionary of 850 words and then distorted and overlaid with clutter via software. Passing the test required identifying at least three of the distorted words correctly.
A simpler one-word version of Gimpy, called E-Z Gimpy, is currently used by Yahoo on its Web site to weed out humans from bots during the registration process.
Meanwhile, researchers at the University of Hong Kong are working on a captcha that overlays audio clutter on top of a voice reading out random numbers and letters.
- 15 Non-Certified IT Skills Growing in Demand
- How 19 Tech Titans Target Healthcare
- Twitter Suffering From Growing Pains (and Facebook Comparisons)
- Agile Comes to Data Integration
- Slideshow: 7 security mistakes people make with their mobile device
- iOS vs. Android: Which is more secure?
- 11 sure signs you've been hacked
- What Datapipe customers need to know about the new PCI DSS 3.0 compliance standard This handy quick reference outlines what PCI DSS 3.0 is, who needs to be compliant and how Alert Logic solutions address the new...
- The 12 PCI DSS 3.0 requirements addressed by Peer 1 Hosting This handy quick reference outlines the 12 PCI DSS 3.0 requirements, who needs to be compliant and how Alert Logic solutions address the...
- Defense Throughout the Vulnerability Life Cycle This whitepaper provides insight into how to leverage threat and log management technologies to protect your IT assets throughout their vulnerability life cycle.
- The Critical Role of Support in Your Enterprise Mobility Management Strategy Most business leaders underestimate the importance of tech support when they choose an EMM solution. Here's what to put on your checklist.
- Live Webcast Best Practices for the Hyperconverged Enterprise Network To the Age of Constant Connectivity and Information overload
- Live Webcast Unmasking the Differences between Consumer and Enterprise File Sync & Share The consumerization of IT combined with the rapid pace of the modern mobile workplace is forcing enterprise IT teams to evaluate file sync...
- Live Webcast Government Agency Webifies Outdated COBOL Applications Let this CTO tell you how his agency converted 1980s-era green screens into an e-filing portal for the 100,000 cases handled each year...
- The New Way to Work Knowledge Vault This Knowledge Vault focuses on how, in today's increasingly virtual world, it's more important than ever to engage deeply with employees, suppliers, partners,...
- Getting Ready for BlackBerry Enterprise Service 10.2 Find out how BlackBerry® Enterprise Service 10 helps organizations address the full spectrum of EMM challenges, while balancing the needs of both the... All Applications White Papers | Webcasts