Computerworld
Quick Menu
Search



Ads by TechWords

See your link here


Subscribe to our e-mail newsletters
For more info on a specific newsletter, click the title. Details will be displayed in a new window.
Application/Web Development
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
More E-Mail Newsletters 
Computerworld 2007Subscribe to Computerworld
40 years of the most authoritative source of news and information for IT leaders.

IBM releases unstructured data framework code as open-source

The company hopes to spur wider compatibility for complex text analytics apps
 

Sign up to receive Development Resource Alerts

January 23, 2006 (Computerworld) -- IBM today released the source code for its Unstructured Information Management Architecture (UIMA) to encourage independent software vendors (ISV) to use the framework for the creation of complex, enterprise-ready text analytics applications on a standards-based platform.
In an announcement today, IBM said the UIMA code is now available as an open-source development project on SourceForge.net.
While traditional content and knowledge management applications today allow users to search for terms, they don't allow searches for concepts or relationships between words in documents, Web sites or other text, said Marc Andrews, a spokesman for content discovery strategy and business development at IBM. Complex text analytics applications from various vendors do provide that kind of analysis, but plugging them into existing search applications can be difficult because of code compatibility issues, he said.
The idea behind UIMA is to have a standards-based platform developers can use to create specialized text analysis applications, which can then be tied in by users with the search applications of their choice. UIMA defines a common, standard interface that enables text analytics components from multiple vendors to work together.
"Customers [have] had to do the integrations themselves because there are no interfaces" between proprietary text analysis applications and search products, Andrews said. "They've had to custom-tie them together," which is often difficult and costly. "UIMA enables them to tie these things together more easily, providing plug-and-play in a common language."
Last August, IBM announced that more than 15 ISVs, including SAS Institute Inc., Cognos Inc., ClearForest Corp. and Attensity Corp., had pledged to support UIMA in their text analytics and search products (see "IBM releases open analytics interface"). IBM also introduced its own offering, IBM WebSphere Information Integrator OmniFind Edition, which is based on UIMA.
Text analytics can comb through documents, comment and note fields, problem reports, e-mail, Web sites and other text-based information sources, according to IBM, which worked on the development of UIMA for more than four years.
Several medical institutions are using UIMA to help organize huge amounts of unstructured data that could be useful in medical research, according to IBM.
The Mayo Clinic is using it to help extract and collect data from some 20 million clinical notes in medical records that will be used for research and to improve patient treatments. The Memorial Sloan-Kettering Cancer Center is extracting data on cancer treatments from its records to search for new cancer treatments.
In addition, the International Federation of Pharmaceutical Manufacturers and Associations, a worldwide industry body that represents pharmaceutical companies, recently deployed a portal of clinical trial information that uses the UIMA framework with IBM's OmniFind application to identify medical terms and concepts. Thatallows doctors, pharmacists, researchers and others to search by disease area or medicine names. The tool even recognizes synonyms across multiple languages. The portal will be used to bring together content from a number of existing clinical trial registries and databases, allowing doctors and patients to review summarized results and find trials they can join, according to IBM.




Print this Story Send Us Feedback E-mail this Story Digg! Digg this Story Slashdot this Story
"Debian 5, Lenny, was due out at the end of the September. Whoops. There are still some last-minute details that..." Read more...
"Linux has had a hate/hate relationship with Broadcom. Linux users need Broadcom Wi-Fi drivers. Broadcom does a lousy job of..." Read more...
Read more Development posts or See all Blogs
Feds considering changes to H-1B application process in wake of report
Exploit code loose for six-month-old Windows bug
With market meltdown, which tech firms become predator or prey?
More top stories...
The Grill: Privacy is a thing of the past, says private investigator
Report: World Bank servers breached repeatedly
Apple asks judge to make iPhone lawsuit moot
Too much junk food, too little exercise and a 24/7 tether to technology? Your body ain't happy, friend. Let us count the pains.
Instruments on the surface of Mars have detected falling snow that is likely evaporating before it reaches the planet.
One positive development stemming from the collapse of Wall Street may be a boost in interest in computer science and IT careers among students who were previously interested in financial services jobs.
Getting new software installed on Linux doesn't have to be hard, but it can differ depending on what you're installing.
Reviews, analyses, how-tos, visual tours, hot issues and predictions about Microsoft's new OS.
Four years from now, the IT field will be a vastly different place. Will you be ready?
All Zones
Application Performance Zone
Business Continuity Zone
The File Data Management Zone
Security Management Zone
The SAS Zone
Business Intelligence and Analytics Zone
Windows Protection Zone
The Enterprise Search Zone
Software as a Service Zone
The Security Zone

Ads by TechWords

See your link here
Sold on SOA

(Source: Computerworld) It's the hot technology for most large companies, but business, technical and cultural issues must be addressed for a successful SOA implementation. Get the whole story, from the big picture to the how-to-do-it details, in this Executive Bulletin. Download this Executive Bulletin (a $49.95 value) for Free, compliments of Fujitsu.
Download this executive briefing download
Driving Business Success Through Workgroup Choice and Flexibility
Download this white paper compliments of Novell!
(Source: Novell) The structure of your workgroup environment plays a vital role in enabling your knowledge workers to be productive and collaborate securely. And IT choice and flexibility can mean the difference between reactive spending and proactive investment. Boost your competitive advantage with a workgroup infrastructure that lets you deliver the tools and services that are right for you. Download this white paper to learn how Novell offers a variety of solutions that give you the flexibility to address critical business initiatives and workforce productivity.
Download this white paper go
From Laggard to Leader: Transforming the Data Center
From Laggard to Leader: Transforming the Data Center
Register for this complimentary webcast today!
Go to the webcast 
White Papers
Read up on the latest ideas and technologies from companies that sell hardware, software and services.
Business Transaction Management: Facilitating the Management of Virtual Environments
Quick Sizing Guide for SAS Grid Running on HP BladeSystems and EVA Storage
Prudential Financial protects its brand with Symantec Data Loss Prevention solutions
View more whitepapers