Ads by TechWords

See your link here
Receive the latest technology news and information.
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
Cloud Computing
View all newsletters




Privacy Policy
 

Database Horizons

A new wave of technology promises to simplify administration and bring together heterogeneous information.

August 5, 2002 12:00 PM ET

Computerworld - The modern database era began in 1970, when E.F. Codd published his paper "A Relational Model of Data for Large Shared Data Banks." His ideas enabled the logical manipulation of data to be independent of its physical location, greatly simplifying the work of application developers.


Now we are poised for another leap forward. Databases will scale to gargantuan proportions, span multiple locations and maintain information in heterogeneous formats. And they will be autonomous and self-tuning. The major database vendors are pursuing these goals in different ways.


Thirty years ago, IBM researcher Pat Selinger invented "cost-based" query optimization, by which searches against relational databases such as IBM's DB2 minimized computer resources by finding the most efficient access methods and paths. Now Selinger, vice president of data management architecture and technology, is leading an effort at IBM called Leo—for Learning Optimizer—that she says will push DB2 optimization into a new realm.












IBM researcher Pat Selinger
IBM researcher Pat Selinger

Rather then optimizing a query once, when it's compiled, Leo will watch production queries as they run and fine-tune them as it learns about data relationships and user needs. "It empirically derives interesting things about the data," Selinger says. For example, Leo would come to realize that a ZIP code can be associated with only one state, or that a Camry is made only by Toyota, even if those rules aren't specified in advance.


Selinger says Leo will be most helpful in large and complex databases, and in databases where interdata relationships exist but aren't explicitly declared by database designers. Leo is likely to be included in commercial releases of DB2 in about three years, she says.


Microsoft Corp. says users will never be persuaded to dump everything—e-mail, documents, audio/video, pictures, spreadsheets and so on—into one gigantic database. Therefore, the software vendor is developing technology that will allow a user to seamlessly reach across multiple, heterogeneous data stores with a single query.












Jennifer Widom, a computer science professor at Stanford
Jennifer Widom, a computer science professor at Stanford

Microsoft's Unified Data project involves three steps, says Stan Sorensen, director of SQL Server. First, the company will devise "schema" based on XML that define data types. Then it will develop methods for relating different data types to each other and finally develop a common query mechanism for distributed databases. For example, Sorensen says, "Suppose I search for a document that references Microsoft, and the document 'tells' the query that there's also a media file in another place that references Microsoft."


The technology will appear in 18 months in SQL Server. It will be added to other Microsoft products in ensuing years.



Jump to comments

Databases

Additional Resources

Xerox
By using solid ink technology only from Xerox, you could save up to 65% by printing color for the cost of black and white. Enter for a chance to WIN a PhaserTM 8860 network color printer!
Microsoft
Save time and mitigate security risk. Deploy it now.
Sybase
In this white paper, IDC analyzes the role of next-generation mobile enterprise platforms as organizations seek a more strategic deployment of mobile solutions.

Learn the important issues you must consider before starting your next mobility initiative. Get your mobility white paper from IDC now, compliments of Sybase.

White Papers & Webcasts

Optimize Performance of Datacenter to Datacenter Traffic
To get the backups and database synchronizations completed on time, enterprises rely on WAN optimization from Blue Coat.  

Handling Unpredictable Queries
Row-based DB Limitations  

Strategic ECM Webinar
Learn what new strategic business benefits can be realized through ECM!

Sybase® IQ: The Economics of Business Reporting
Download this white paper today!  

Gaining the Performance Edge Using a Column-Oriented Database Management System
A Different Approach: Column-Oriented Data Management Systems  

Tabor Research: NFS Evolution Changes the Landscape of HPC Data Management
A hybrid file system combining the benefits of standard NFS and the performance and scale of parallel file systems.  

Effectively Implementing Datacenter Automation
Effectively select and deploy the best datacenter automation solution today!

 

SAS Information Management Kit

SAS is the leader in business intelligence and analytical software and services. Only SAS offers leading data integration, storage, analytics and business intelligence applications within a comprehensive enterprise intelligence platform. SAS gives 97 of the top 100 companies in the 2007 Fortune 500 THE POWER TO KNOW®.

Webcast: The Information Management Roadmap
Imagine high-quality data, cleansed, analyzed and delivered throughout your organization. Join Computerworld, IT visionary Thornton May and a panel of experts to learn how SAS® can help you make it happen.

View this webcast 
Research Report: Information Management Initiatives at Midsize and Large Organizations
See the top-line results of this Computerworld sponsored survey to see how IT and business leaders are handling information management implementation.

Download this report 
White Paper: Information Management: Better Information for Winning Decisions.
This white paper explains how the SAS Information Evolution Model aids companies in assessing how they use this information to make strategic decisions and drive business.

Download this white paper