QuickStudy: Predictive Analytics
Computerworld - Definition: Predictive analytics is the branch of data mining concerned with forecasting probabilities. The technique uses variables that can be measured to predict the future behavior of a person or other entity. Multiple predictors are combined into a predictive model. In predictive modeling, data is collected to create a statistical model, which is tweaked as additional data becomes available.
Predictive analytics is a set of mathematical techniques applied to a data set for determining the probability that some scenario is likely to happen or be true. These techniques are applied to many research areas, including meteorology, genetics and marketing — areas in which there’s an abundance of data and a need to forecast the future.
Cross-selling, upselling, determining customer profitability and promoting customer loyalty are the best-known uses of this technology, according to a report by Forrester Research Inc. analyst Lou Agosta. But there are many other applications, he notes, including credit scoring, predicting machine failures and making the supply chain more efficient.
Plenty of high-level mathematics are involved, but stated simply, predictive analytics is used to ask which characteristics, called predictors, in a data set are clustered together. The technique is also used to determine whether, given a set of predictors, the value for some other characteristic is likely to fall within a desired range.
Though these two questions sound very similar, in practice, they’re quite different. The first one, the search for clustered characteristics, is like saying, “Look through my data??base of information and find something about my business that I overlooked or might not already know.” You might look through the history of people who have declared bankruptcy to find which characteristics are most tightly linked together: late payments, number of addresses within the past two years, recent divorce or health problems, for example.
The second question, determining whether a particular characteristic falls within a desired range, is like saying, “Given what I know about a customer, find out how likely it is that something else is true.” For example, you might want to analyze the characteristics of a person filing an insurance claim to determine the likelihood that the claim is false. The predictors could be how recently he filed his last claim, the dollar amount of that claim or how long the customer has had the policy.
The two approaches work together. Once linked characteristics have been identified, then the second question can be asked. After an insurance company has found which characteristics are most tightly linked to fraud, for example, it can create an equation that produces a number indicating how likely it is that a particular claim is fraudulent.



- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- X-Ray of the PCI Process-4 Proactive Steps
- This white paper from Forrester Research Inc., helps break PCI into understandable components. Security and risk professionals will gain knowledge and insight into...
- Forrester: Economic Impact of Switching to Google Apps
- Content provided by Google
Read this Forrester report on the "total economic impact" of Google Apps, and learn how switching to Google Apps creates... - Intelligent Systems: Unlocking Hidden Business Value with Data
- An intelligent system enables data to flow across an enterprise infrastructure, spanning the devices where valuable data is gathered from employees and customers,...
- Concepts of NonStop SQL/MX
- For DBAs and developers who are familiar with Oracle solutions and want to learn about NonStop SQL/MX, this whitepaper provides an overview of...
- HP Advanced Information Services for SAP In-Memory Appliance (SAP HANA)
- Organizations are eager to connect the vast amounts of data available within and outside their businesses to compete more effectively and make better... All BI and Analytics White Papers
- Quantifying the Business Value of VMware View - Webcast
- Many enterprises have discovered that the use of virtualization to support desktop workloads creates a range of significant benefits. These benefits include price...
- Good to Great - How to Take Business Analytics to the Next Level
- By attending this webcast you will learn how you can implement an effective BA strategy that will deliver maximum strategic value to your...
- Supporting Mobile Productivity With A Limited IT Budget
- Join us and hear from Kaseya mobile IT management experts as we discuss core strategies for supporting the mobile revolution on a shoestring...
- User Experience Monitoring
- In this webinar, you will learn hints & tips for improving end-user response times from Forrester Research analyst, Jean-Pierre Garbani.
- Hints & Tips Cisco
- Overwhelmed by tracking your Vblock, Flexpod or Cisco UCS performance? Spend one hour with Nimsoft to learn how you can eliminate the overhead... All BI and Analytics Webcasts