
Subscribe to
Computerworld
or
Other Business Intelligence Stories
|
October 14, 2002 (Computerworld) -- After just two months, a new software tool enabled Aventis Pharmaceuticals Inc. to discover a promising candidate for a new drug to treat asthma, arthritis or even perhaps cancer; it's a chemical compound that might well have been overlooked using traditional IT tools.
Aventis is using DiscoveryLink, a feature of IBM's DB2 database management system that can propel a single SQL query out to multiple, heterogeneous data sources and bring information back to the user in one coherent view.
"Using this integrated framework, scientists were able to pull data from many different sources around the world, visualize it in a new way that they could never do before," says Peter Loupos, vice president for drug innovation and approval information systems at the Bridgewater, N.J.-based company.
IBM calls the Aventis approach to information integration "database federation." To get at federated data, DB2 uses IBM "wrapper" software called DataJoiner and Relational Connect. There's one wrapper for each type of data sourcewhether it's from Oracle Corp. or Sybase Inc. systems, Microsoft Corp. SQL Server or flat fileswith each one mapping the source data model to the DB2 data model. A single Aventis query may be sent against heterogeneous relational databases, unstructured documents and in-house expertise culled from e-mail and other sources.
Sending a SQL query against remote, heterogeneous databases is just one of several ways to integrate data. Others include the following:
Regardless of the approach taken, data integration can be difficult, expensive and error-prone. In particular, great care must be taken to build interfaces between applications and databases that ensure accuracy and timeliness of information and that answer the needs of disparate communities of end users. Below, we look at how two organizations tackled their data integration problems, and you can read two more case studies online .
Case 1: Standardization Is Key at Affina
While it supports all of the various methods of integrating data, Oracle favors loading data into centralized databases and data warehouses. And that's the approach taken by Oracle customer Affina, a contact-center service provider in Peoria, Ill.
Affina, which has about 3TB of data in Oracle databases, runs call centers for about 100 clients. That requires taking data from telephone calls, e-mail, Web chat and various back-end processing systems and feeding it into an "operational data store," a data warehouse and several data marts, says Tom Asp, vice president for IT. Affina also pushes informationsuch as call analysisback out to customers. For efficiency and security, data coming in and going out is staged in customer-specific tables that can be accessed by customers.
Affina uses Oracle Warehouse Builder, Oracle's SQL Loader utility, custom PL/SQL procedures and .Net assemblies in ETL processes to pull in and reformat customer data from Lotus Notes e-mail databases, call records from telephone switches, flat files from interactive voice response units and client data from a system that emulates mainframe CICS. "Any way we could get data, we get it," Asp says.
Affina also uses Oracle8i replication utilities to move data in real time from a big call center in El Paso, Texas, to its data warehouse in Peoria. The Peoria copy is used for reporting, but the production database remains in El Paso for performance reasons.
Automation and standardization are the keys to making such a complex operation run smoothly, Asp says. File transfers, schedulers and loaders are automated, and strict standards are adhered to so new processes can be added easily and without fear of bugs. He says Affina is increasingly moving to the exchange of information in XML format because "it provides a common, flexible file format so that ETL procedures can be created more efficiently."
Case 2: Database Dilemma at Terracon
Terracon Inc., an engineering consulting firm in Lenexa, Kan., uses an Oracle enterprise resource planning (ERP) system and an Oracle data warehouse. A two- to three-hour weekly process uses scripts that Terracon wrote in PL/SQL to move data from the operational ERP data to the warehouse, where it is used for reporting. Not all informationcustomer and employee names, for exampleis replicated to the warehouse. Terracon was unable to use Oracle's replication product, Streams, because the two repositories used different versions of Oracle, according to CIO Frank Milano.
Milano says he'd like to convert the data warehouse to Microsoft's SQL Server database. He says that with his existing Microsoft enterprise agreement he could do that for an additional $2,000 to $3,000 and save $30,000 to $50,000 in Oracle license fees. The ERP system would remain Oracle, however, because it's the best fit for Terracon's project accounting and financial management needs, he says.
And Milano says total cost of ownership with SQL Server would be lower than with Oracle. "Trying to maintain an Oracle database is brutal," he says. "It's complex, not for the faint of heart." But, he concedes, Oracle is better suited for large, complex, dynamic database applications.
But Milano says he worries that interfacing the Oracle and SQL Server databases may be tricky. In addition to the weekly transfer of production data from the ERP system to the warehouse, users must be able to initiate queries against the warehouse and have them go to the ERP database to pick up customer and employee names, and queries against the ERP system must be able to go to the warehouse for information that has already been calculated and summarized. "That's a great concern for us," he says. "I don't have a solution for it."
But Microsoft claims it does. According to Tom Rizzo, group product manager for SQL Server, Terracon could move data from the Oracle ERP system to the SQL Server data warehousedoing any needed editing or massaging en routeusing the Data Transformation Services tools that ship with SQL Server. And to access both databases in a single query, Terracon could use the Distributed Query engine in SQL Server, Rizzo says.
|
|
|
Print this Story |
|
Send Us Feedback |
|
E-mail this Story |
|
Digg this Story |
|
Slashdot this Story |
|
|
|
|
|
|
|
|
All Zones Application Performance Zone Business Continuity Zone Data Center Management Zone Enterprise-Class Security Zone The File Data Management Zone Grid Computing on Windows Zone Security Management Zone ITIL Best Practices Zone The SAS Zone Storage Virtualization Zone Business Intelligence and Analytics Zone |
|
|
| ||||||||
| ||||||||
| ||||||||
|


SAS Business and Analytics ZoneNo matter where your organization is on the path toward evolving your IT infrastructure, SAS can adapt to your situation to meet your long-term enterprise intelligence needs. We can help you drive intelligence evolution to the next level, while leveraging and extending the value of your existing IT investments. Learn more in the Business and Analytics Zone See All Zones
|
Intercept Spam & Viruses With MessageLabs MessageLabs is offering a complimentary 30 day trial of its managed Anti-virus and Anti-spam security solutions. MessageLabs guarantees complete protection against all know and unknown email threats. By providing 24 hour support, your business can increase productivity and decrease risk. Register for a complimentary trial and receive a free datasheet.Download this white paper now!
|

| XenServer FREE trial Citrix XenServer is the simplest and most effective way to virtualize and provision servers. XenServer combines comprehensive server virtualization capabilities with unparalleled scalability, performance, economics, and ease-of-use. Based on the open source Xen hypervisor, XenServer delivers fast performance, easy management, and advanced features such as live migration. |

Since You AskedA weekly storage column from storage analyst, Steve Duplessie of the Enterprise Strategy Group |
|
SAS Information Management Kit
SAS is the leader in business intelligence and analytical software and services. Only SAS offers leading data integration, storage, analytics and business intelligence applications within a comprehensive enterprise intelligence platform. SAS gives 97 of the top 100 companies in the 2007 Fortune 500 THE POWER TO KNOW®. |
Webcast: The Information Management Roadmap
Imagine high-quality data, cleansed, analyzed and delivered throughout your organization. Join Computerworld, IT visionary Thornton May and a panel of experts to learn how SAS® can help you make it happen. View this webcast
|
Research Report: Information Management Initiatives at Midsize and Large Organizations
See the top-line results of this Computerworld sponsored survey to see how IT and business leaders are handling information management implementation.
Download this report
|
White Paper: Information Management: Better Information for Winning Decisions.
This white paper explains how the SAS Information Evolution Model aids companies in assessing how they use this information to make strategic decisions and drive business.
Download this white paper
|