Open-source spam-blocker gets high marks at Cornell
Cornell's CIO said the antispam tool is 99% effective in blocking unwanted e-mail
Computerworld - When the academic year begins this fall, students at Cornell University's Johnson Graduate School of Management will be armed with what its CIO sees as a powerful new weapon to battle spam.
For the past two months, the school's IT organization has been beta-testing an open-source tool called the SpamBayes Outlook Plug-in and is preparing for a broad rollout.
The SpamBayes tool blocks spam using a unique form of statistical analysis that's far more efficient and customizable than any commercially available antispam product, according to Larry Fresinski, the school's CIO.
"It's been extraordinarily effective," he said. "It catches 99% of my spam." Fresinski said he has contacted 20 other business schools to inform them about the technology.
The university has been testing the SpamBayes Outlook Plug-in with Microsoft Corp.'s Outlook XP, Outlook 2003 Beta and an Exchange 2000 server. Cornell's management school is a beta tester of Outlook 2003, which, like other e-mail products, comes with its own antispam technology. As a tester of SpamBayes, the Ithaca, N.Y.-based school has recommended the approach to Microsoft, Fresinski said.
SpamBayes is the name of an open-source project working to develop an antispam filter based on Bayesian theory, a method of statistical analysis.
The approach is different from traditional antispam technologies that use predefined rules to look for specific features or words in mail headers and body text to identify unsolicited mail. Many of these technologies also use blacklists to block mail from certain addresses.
The problem with such approaches is that they rely on a predefined and general description of spam and not on a user-specific definition of the term, Fresinski said.
SpamBayes first analyzes a user's legitimate e-mail and spam mail for clues as to what makes each different. It then applies those clues to the headers, content and style of incoming messages to determine whether they are spam.
The greater the number of initial samples and the broader the variety, the more quickly Bayesian filters can be "trained" to recognize spam, said Brian Burton, president of Burton Computer Corp., a consultancy in LaVale, Md. The company has developed an open-source tool called SpamProbe, which uses similar techniques to block spam.
"That is one of the weaknesses of this approach," Burton said. "You've got to get it to a point where it can start making the right decisions."
Although SpamBayes won't prevent Cornell's mail servers from getting spammed, it will allow end users to weed out spam more effectively, Fresinski said. So far, there hasn't been one instance in which the software has stopped legitimate mail from getting through or failed to stop spam, he said.


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Practice Management: Double Billing Rate and Improve Patient Services
- Would you like to double your billing rate and achieve faster payment for services?
Download this customer success story to see how One Health... - Mission Critical Data Explosion and Customer Case Study
- Would you like to double your tier 1 storage capacity while simultaneously reducing your storage footprint?
Download this customer success story to see how... - Protecting Against Database Attacks and Insider Threats: Top 5 Scenarios
- Read this new eBook to learn the top five scenarios and essential best practices for preventing database attacks and insider threats.
- Database Activity Monitoring Is Evolving
- Read the analyst report and learn how you can leverage the core capabilities of a DAP solution for better database security.
- Establishing a Strategy for Database Security is No Longer Optional
- The options for securing increasingly valuable databases are very broad and deep, and can be confusing. This research provides an overview of three... All Desktop Apps White Papers
- Distributed Database Security with Real-time Monitoring
- View this demo and learn how IBM InfoSphere Guardium database activity monitoring can help protect your sensitive data in distributed DBMS environments with...
- InfoSphere Warehouse Packs Demo
- These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
- Delivery Management -- Extending Lifecycle Management
- Date: Wednesday, June 20, 2012, 1:00 PM EDT
Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,... - Leverage automation today to reduce IT complexity
- Date: Tuesday, June 5, 2012, 2:00 PM EDT
Whether your B2B complexity is caused by multiple technologies due to M&A, business or application specific... - Redefine Expectations in the Data Center
- Need to do more with less? Watch this video to learn how HP ProLiant Gen8 servers can help your business deploy servers three... All Desktop Apps Webcasts