Ads by TechWords

See your link here
Subscribe to our e-mail newsletters
For more info on a specific newsletter, click the title. Details will be displayed in a new window.
Application/Web Development
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
More E-Mail Newsletters 
 

IBM releases unstructured data framework code as open-source

The company hopes to spur wider compatibility for complex text analytics apps

January 23, 2006 12:00 PM ET

Computerworld - IBM today released the source code for its Unstructured Information Management Architecture (UIMA) to encourage independent software vendors (ISV) to use the framework for the creation of complex, enterprise-ready text analytics applications on a standards-based platform.
In an announcement today, IBM said the UIMA code is now available as an open-source development project on SourceForge.net.
While traditional content and knowledge management applications today allow users to search for terms, they don't allow searches for concepts or relationships between words in documents, Web sites or other text, said Marc Andrews, a spokesman for content discovery strategy and business development at IBM. Complex text analytics applications from various vendors do provide that kind of analysis, but plugging them into existing search applications can be difficult because of code compatibility issues, he said.
The idea behind UIMA is to have a standards-based platform developers can use to create specialized text analysis applications, which can then be tied in by users with the search applications of their choice. UIMA defines a common, standard interface that enables text analytics components from multiple vendors to work together.
"Customers [have] had to do the integrations themselves because there are no interfaces" between proprietary text analysis applications and search products, Andrews said. "They've had to custom-tie them together," which is often difficult and costly. "UIMA enables them to tie these things together more easily, providing plug-and-play in a common language."
Last August, IBM announced that more than 15 ISVs, including SAS Institute Inc., Cognos Inc., ClearForest Corp. and Attensity Corp., had pledged to support UIMA in their text analytics and search products (see "IBM releases open analytics interface"). IBM also introduced its own offering, IBM WebSphere Information Integrator OmniFind Edition, which is based on UIMA.
Text analytics can comb through documents, comment and note fields, problem reports, e-mail, Web sites and other text-based information sources, according to IBM, which worked on the development of UIMA for more than four years.
Several medical institutions are using UIMA to help organize huge amounts of unstructured data that could be useful in medical research, according to IBM.
The Mayo Clinic is using it to help extract and collect data from some 20 million clinical notes in medical records that will be used for research and to improve patient treatments. The Memorial Sloan-Kettering Cancer Center is extracting data on cancer treatments from its records to search for new cancer treatments.
In addition, the International Federation of Pharmaceutical Manufacturers and Associations, a worldwide industry body that represents pharmaceutical companies, recently deployed a portalof clinical trial information that uses the UIMA framework with IBM's OmniFind application to identify medical terms and concepts. That allows doctors, pharmacists, researchers and others to search by disease area or medicine names. The tool even recognizes synonyms across multiple languages. The portal will be used to bring together content from a number of existing clinical trial registries and databases, allowing doctors and patients to review summarized results and find trials they can join, according to IBM.



Additional Resources

Xerox
By using solid ink technology only from Xerox, you could save up to 65% by printing color for the cost of black and white. Enter for a chance to WIN a PhaserTM 8860 network color printer!
Microsoft
Save time and mitigate security risk. Deploy it now.
Sybase
In this white paper, IDC analyzes the role of next-generation mobile enterprise platforms as organizations seek a more strategic deployment of mobile solutions.

Learn the important issues you must consider before starting your next mobility initiative. Get your mobility white paper from IDC now, compliments of Sybase.

White Papers & Webcasts

The High Performance Workplace
In this paper we examine the challenges and define the critical steps CFOs, CIOs, COOs and CEOs, in midsized global companies, can take...  

How to Reduce Eclipse BIRT Development Effort for Data Visualizations
Web applications can come with a long list of visualization requirements for structured data. By delivering your output through the BIRT Interactive Viewer,...

Extend, Replace, or Convert; which is the best way forward for COBOL Applications?
There are a number of choices when looking at ways to take existing COBOL applications forward. This white paper discusses the most common...  

Strategic ECM Webinar
Learn what new strategic business benefits can be realized through ECM!...

2009 Gartner Magic Quadrant Report
Truly understand your options for WAN Optimization Controllers...  

Managing And Protecting Your Ever Increasing Mobile Assets
Learn best practices for desktop and application virtualization, computer security, and computer life-cycle management....

Tech Horizons: ASG's metaCMDB, The Technology That Rocks
Improved business productivity often requires more efficient IT and more efficient IT cannot be achieved without a better understanding of the way business...  

5 Architecture Issues that Impact BES performance
This Live webinar will identify critical log file errors, performance counters, and configurations to pay close attention to when optimizing BES server performance....

The Vector Approach to Data Center Power Planning
This white paper describes an approach that considers the major milestones and thresholds in data center power requirements-and how planners should adjust their...  

Usability Is Everything
Learn what sets Workday's HR and Payroll solutions apart from the competition....