Sidebar: Text Mining Glossary
Computerworld -
Text miners use a variety of approaches to extract and present relevant information. Below are definitions of common methods:
Categorization - Presents the search results in categories, rather than as an undifferentiated mass.
Clustering - Grouping similar documents based on their content.
Extraction - Extracting relevant information from a document - for example, pulling out all the company names from a data set.
Keyword search - Searching documents for the occurrence of a particular word or set of words.
Natural-language processing - Determining the meaning of written words taking into account their context, grammar, colloquialisms and so on.
Taxonomy - Categorization of data according to a predefined framework, either industry-standard or customized. Some tools can automatically generate a taxonomy based on analysis of the data store.
Visualization - Graphically presenting the mined data so relationships are easier to spot and understand.
Business Intelligence
Additional Resources



White Papers & Webcasts
IDC Research Report: The Business Value of Consolidating on Energy-Efficient Servers
Download this Resource Now!
HP Technology Guide for Scalable Business Solutions
Download This Resource Now!
Architecting Business Intelligence Applications for Change: The Open Solution
Register for this webcast today!
Clipper Group Report: HP Provides Enhanced Options for Data Center
Download this Resource Now!
Enterprise Data Governance: Bridging the Business-IT Gap
Register for this live webcast today!
Technology Brief: Technologies in HP ProLiant G6 c-Class server blades with Intel Xeon processors
Download this Resource Now!
Informatica 9 Launch: Transform your Business. Transform your world.
Business and IT will finally be on the same page. Data quality issues will be a thing of the past. The promise of...
Introducing the HP ProLiant G6 servers
Download this Resource Now!
Lower IT Costs with Oracle Database 11g Release 2
Register for this webcast now!

