Computerworld - Definition: Predictive analytics is the branch of data mining concerned with forecasting probabilities. The technique uses variables that can be measured to predict the future behavior of a person or other entity. Multiple predictors are combined into a predictive model. In predictive modeling, data is collected to create a statistical model, which is tweaked as additional data becomes available.
Predictive analytics is a set of mathematical techniques applied to a data set for determining the probability that some scenario is likely to happen or be true. These techniques are applied to many research areas, including meteorology, genetics and marketing — areas in which there’s an abundance of data and a need to forecast the future.
Cross-selling, upselling, determining customer profitability and promoting customer loyalty are the best-known uses of this technology, according to a report by Forrester Research Inc. analyst Lou Agosta. But there are many other applications, he notes, including credit scoring, predicting machine failures and making the supply chain more efficient.
Plenty of high-level mathematics are involved, but stated simply, predictive analytics is used to ask which characteristics, called predictors, in a data set are clustered together. The technique is also used to determine whether, given a set of predictors, the value for some other characteristic is likely to fall within a desired range.
Though these two questions sound very similar, in practice, they’re quite different. The first one, the search for clustered characteristics, is like saying, “Look through my data??base of information and find something about my business that I overlooked or might not already know.” You might look through the history of people who have declared bankruptcy to find which characteristics are most tightly linked together: late payments, number of addresses within the past two years, recent divorce or health problems, for example.
The second question, determining whether a particular characteristic falls within a desired range, is like saying, “Given what I know about a customer, find out how likely it is that something else is true.” For example, you might want to analyze the characteristics of a person filing an insurance claim to determine the likelihood that the claim is false. The predictors could be how recently he filed his last claim, the dollar amount of that claim or how long the customer has had the policy.
The two approaches work together. Once linked characteristics have been identified, then the second question can be asked. After an insurance company has found which characteristics are most tightly linked to fraud, for example, it can create an equation that produces a number indicating how likely it is that a particular claim is fraudulent.


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Thinking Outside The Data Warehouse
- This high level, business problem focused eBook uses 5 customer scenarios to show how people and organizations are tackling real issues using IBM...
- Using BD for Smarter Decision Making
- This paper looks at new developments in business analytics and discusses the benefits analyzing big data bring to the business.
- Measuring the Business Value of CI in the Data Center
- One of the key strategies that IT teams are pursuing to reduce capital costs while boosting asset utilization and employee productivity is the...
- Switching Schedulers - Not As Complicated As You Think
- Changing or consolidating job schedulers may seem daunting. However, the benefits of switching to enterprise workload automation outweigh the risks. Read how BMC...
- Capture-Enabled Business Process Management
- Organizations today must deal with a vast amount of incoming information from many different sources. Efficient, automated business processes are critical to managing... All BI and Analytics White Papers
- InfoSphere Warehouse Packs Demo
- These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
- Delivery Management -- Extending Lifecycle Management
- Date: Wednesday, June 20, 2012, 1:00 PM EDT
Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,... - Leverage automation today to reduce IT complexity
- Date: Tuesday, June 5, 2012, 2:00 PM EDT
Whether your B2B complexity is caused by multiple technologies due to M&A, business or application specific... - BMC Control-M - Single Point of Control Demo
- With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's...
- BMC Control-M - Single Point of Control Demo
- With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's... All BI and Analytics Webcasts