Ads by TechWords

See your link here
Receive the latest technology news and information.
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
Cloud Computing
View all newsletters




Privacy Policy
 

Database Horizons

A new wave of technology promises to simplify administration and bring together heterogeneous information.

August 5, 2002 12:00 PM ET

Computerworld - The modern database era began in 1970, when E.F. Codd published his paper "A Relational Model of Data for Large Shared Data Banks." His ideas enabled the logical manipulation of data to be independent of its physical location, greatly simplifying the work of application developers.


Now we are poised for another leap forward. Databases will scale to gargantuan proportions, span multiple locations and maintain information in heterogeneous formats. And they will be autonomous and self-tuning. The major database vendors are pursuing these goals in different ways.


Thirty years ago, IBM researcher Pat Selinger invented "cost-based" query optimization, by which searches against relational databases such as IBM's DB2 minimized computer resources by finding the most efficient access methods and paths. Now Selinger, vice president of data management architecture and technology, is leading an effort at IBM called Leo—for Learning Optimizer—that she says will push DB2 optimization into a new realm.












IBM researcher Pat Selinger
IBM researcher Pat Selinger

Rather then optimizing a query once, when it's compiled, Leo will watch production queries as they run and fine-tune them as it learns about data relationships and user needs. "It empirically derives interesting things about the data," Selinger says. For example, Leo would come to realize that a ZIP code can be associated with only one state, or that a Camry is made only by Toyota, even if those rules aren't specified in advance.


Selinger says Leo will be most helpful in large and complex databases, and in databases where interdata relationships exist but aren't explicitly declared by database designers. Leo is likely to be included in commercial releases of DB2 in about three years, she says.


Microsoft Corp. says users will never be persuaded to dump everything—e-mail, documents, audio/video, pictures, spreadsheets and so on—into one gigantic database. Therefore, the software vendor is developing technology that will allow a user to seamlessly reach across multiple, heterogeneous data stores with a single query.












Jennifer Widom, a computer science professor at Stanford
Jennifer Widom, a computer science professor at Stanford

Microsoft's Unified Data project involves three steps, says Stan Sorensen, director of SQL Server. First, the company will devise "schema" based on XML that define data types. Then it will develop methods for relating different data types to each other and finally develop a common query mechanism for distributed databases. For example, Sorensen says, "Suppose I search for a document that references Microsoft, and the document 'tells' the query that there's also a media file in another place that references Microsoft."


The technology will appear in 18 months in SQL Server. It will be added to other Microsoft products in ensuing years.



Jump to comments

Databases

Additional Resources

WHITE PAPER
Approximately 60 percent of data migration projects overrun time or budget, while some fail completely. Download this white paper, "Enhancing Your Chance for Successful Data Migration," to learn the critical steps you need to take to execute a data migration project with minimum cost and risk to your business.
WHITE PAPER
Read the Gartner research note to learn why the TCO of a server-based computing deployment used to deliver all applications to users is around 50% lower than that of an unmanaged desktop deployment.
WHITE PAPER
Economic downturns have a tendency to accelerate emerging technologies, boost the adoption of effective solutions, and punish solutions that are not cost competitive or that are out of synch with industry trends. This IDC White Paper presents the results of an IDC survey of 330 companies in Western Europe, Asia/Pacific and the Americas that measures the receptiveness to Linux and takes into consideration changing views driven by the disruptive economic environment that businesses face today.
 

SAS Information Management Kit

SAS is the leader in business intelligence and analytical software and services. Only SAS offers leading data integration, storage, analytics and business intelligence applications within a comprehensive enterprise intelligence platform. SAS gives 97 of the top 100 companies in the 2007 Fortune 500 THE POWER TO KNOW®.

Webcast: The Information Management Roadmap
Imagine high-quality data, cleansed, analyzed and delivered throughout your organization. Join Computerworld, IT visionary Thornton May and a panel of experts to learn how SAS® can help you make it happen.

View this webcast 
Research Report: Information Management Initiatives at Midsize and Large Organizations
See the top-line results of this Computerworld sponsored survey to see how IT and business leaders are handling information management implementation.

Download this report 
White Paper: Information Management: Better Information for Winning Decisions.
This white paper explains how the SAS Information Evolution Model aids companies in assessing how they use this information to make strategic decisions and drive business.

Download this white paper