QuickStudy: Bayesian Logic And Filters
Computerworld - Listen to the Computerworld TechCast: Bayesian Logic
Some say that if you can't measure something, you're not doing science. Bayesian logic offers a way to measure things that were previously unmeasurable, allowing us to test hypotheses and predictions and thereby refine our conclusions and decisions. Bayesian filtering is a hot topic in the area of spam control today.
Basic probability is simple to calculate, because you're dealing with a limited number of factors and possibilities. Let's consider a horse race with 10 horses entered. If that's the only information we have on which to base a wager, then we could pick any horse on the basis that its chance of winning is 1 in 10, or 0.10. Take that kind of math to the track, however, and you'll quickly be separated from the contents of your wallet. The real world is far more complicated, and here's where Bayesian logic comes into the picture.
In fact, each of the 10 horses has already run at least a few races and therefore has a history. If Lightning has won every race he has entered, and Thunder has lost every one he has entered, then we've got a real evidential basis on which to bet on Lightning instead of on Thunder.
In fact, there's a lot more information available about every horse in the race. We know or can easily find out the following:
Lineage: Is this horse the offspring of a champion? How have his brothers and sisters performed?
Performance under different weather conditions: If it rains in the morning and the track is soft, how does that affect his speed?
Position on the track: Is our horse next to the rail or on the outside? And how does the horse react when he's in that position?
Length of time since last race: If the horse ran a long, hard race yesterday, how well is he likely to run today?
Distance of today's race: How has the horse fared at this distance in the past?
Other people's betting patterns also come into play. They don't affect how well a horse will perform, but they have a clear impact on the size of the payoff if he does win.
All of this information can help us make a better estimate of our horse's chance of winning than the simplistic 1 out of 10. Analyzing these factors is a Bayesian process.
Similar things are happening in the world of Major League Baseball -- ever the province of voluminous statistical records. Team owners and general managers are using Bayesian analysis when they study the way players perform under various conditions and in specific situations and factor that information into their decisions about the players they want to draft or seek in trades.



- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Overcome Top 7 Admin Challenges of Active Directory
- As Active Directory's role in the enterprise has drastically increased, so has the need to secure the data. Gain insight on creating repeatable,...
- Insiders Can Ruin Your Company. Take Action.
- Did you know that 80 percent of threats to an organization come from the inside? The threat from insiders is often overlooked in...
- Top Solutions and Tools to Prevent Devastating Malware
- Custom malware frequently goes undetected. According to Forrester Research, the best way to reduce risk of breach is to deploy file integrity monitoring...
- X-Ray of the PCI Process-4 Proactive Steps
- This white paper from Forrester Research Inc., helps break PCI into understandable components. Security and risk professionals will gain knowledge and insight into...
- Identity Governance: The Business Imperatives
- This white paper describes the business challenges and opportunities that are driving interest in Identity Governance while discussing considerations your organization should make... All Security White Papers
- Live Webcast
Playing Defense: Staying on Top of Your Disaster Recovery Game - When it comes to disaster recovery, rapidly growing data volumes, distributed computing models, and new technologies all combine to present an ever-changing playing...
- Introduction to VMware vCenter Site Recovery Manager 5
- Traditional disaster recovery solutions are often too expensive, complex and unreliable to meet business requirements. As a result, IT departments are hesitant to...
- The Top Ten Secrets to Avoiding SAN Performance Problems
- Maintaining peak performance while simultaneously addressing the root cause of SAN errors is challenging. Learn the most common SAN problems and explore new...
- Deduplication Without Compromise
- Go inside Quantum's scalable, high-performance, multi-protocol new DXi deduplication appliances, designed to make backup much more effective. Discover how the new future-proof DXi6700...
- Director of Disk Products Discusses DXi6700
- Discover how the new DXi 6700 series of deduplication appliances provide investment protection and a future-proof feature set, all while delivering fast, scalable,...
- Playing Defense: Staying on Top of Your Disaster Recovery Game
- When it comes to disaster recovery, rapidly growing data volumes, distributed computing models, and new technologies all combine to present an ever-changing playing... All Security Webcasts