QuickStudy: Bayesian Logic And Filters
Computerworld -
Listen to the Computerworld TechCast: Bayesian Logic
Some say that if you can't measure something, you're not doing science. Bayesian logic offers a way to measure things that were previously unmeasurable, allowing us to test hypotheses and predictions and thereby refine our conclusions and decisions. Bayesian filtering is a hot topic in the area of spam control today.
Basic probability is simple to calculate, because you're dealing with a limited number of factors and possibilities. Let's consider a horse race with 10 horses entered. If that's the only information we have on which to base a wager, then we could pick any horse on the basis that its chance of winning is 1 in 10, or 0.10. Take that kind of math to the track, however, and you'll quickly be separated from the contents of your wallet. The real world is far more complicated, and here's where Bayesian logic comes into the picture.
In fact, each of the 10 horses has already run at least a few races and therefore has a history. If Lightning has won every race he has entered, and Thunder has lost every one he has entered, then we've got a real evidential basis on which to bet on Lightning instead of on Thunder.
In fact, there's a lot more information available about every horse in the race. We know or can easily find out the following:
Lineage: Is this horse the offspring of a champion? How have his brothers and sisters performed?
Performance under different weather conditions: If it rains in the morning and the track is soft, how does that affect his speed?
Position on the track: Is our horse next to the rail or on the outside? And how does the horse react when he's in that position?
Length of time since last race: If the horse ran a long, hard race yesterday, how well is he likely to run today?
Distance of today's race: How has the horse fared at this distance in the past?
Other people's betting patterns also come into play. They don't affect how well a horse will perform, but they have a clear impact on the size of the payoff if he does win.
All of this information can help us make a better estimate of our horse's chance of winning than the simplistic 1 out of 10. Analyzing these factors is a Bayesian process.
Similar things are happening in the world of Major League Baseball -- ever the province of voluminous statistical records. Team owners and general managers are using Bayesian analysis when they study the way players perform under various conditions and in specific situations and factor that information into their decisions about the players they want to draft or seek in trades.
Additional Resources



Learn the important issues you must consider before starting your next mobility initiative. Get your mobility white paper from IDC now, compliments of Sybase.
White Papers & Webcasts
Centralized Data Backup and Your WAN
Is your organization prepared to tackle the massive challenge of protecting your data in a cost effective and timely manner? With a growing...
Why Compliance Pays
This OnDemand webcast explores the relationship that firms with best compliance records have higher revenue, greater customer retention, lower financial losses from data...
An All-in-One Approach to Web Security
Granting web access to employees poses challenges to IT administrators and introduces unique security risks. Even as companies have perfected their security techniques...
Best Practices for Managing Business Risks from the Use of IT
(Source: Symantec) Based on exhaustive benchmarks conducted by the IT Policy Compliance, this session highlights the relationship between business risks and use of...
The Hidden Dangers of Spam
Beyond the well-understood productivity drain that spam inflicts on businesses, threats posed by illicit email circulating through a network are causing many security...
Managing And Protecting Your Ever Increasing Mobile Assets
(Source: Absolute Software) Your users are becoming more mobile each day. This is great for productivity - yet challenging for IT control. Natalie...
Open Source Security Myths Dispelled
(Source: Astaro) Open Source Software is computer software whose source code is available to the general public. This openly viewable nature...
Sun OpenSSO Enterprise Webinar
(Source: Sun) This webinar replay discusses Sun OpenSSO Enterprise innovation--the single, open-source solution that helps your business solve the challenges around internal access...
Best Practices for Backing Up VMware® with Veritas NetBackup™
VMware® is used by enterprises large and small to increase the efficiency and cost-effectiveness of their IT operations. With this in mind, Symantec...
Agile Enterprise Content Management (ECM) for Rapid ROI
(Source: IBM) Content rich business processes are a core feature of daily operations at just about any organization today. Very often these essential...
Subscribe to Computerworld
