XML Storage: Oracle Should be Hearing Footsteps
Computerworld - Twenty-four years ago, I raised a furor in the database management systems industry. As a rookie analyst -- a stock analyst, no less -- I argued that the then-dominant hierarchical/network data architectures should and would be replaced by "index-based" systems. Over the next few years, I was proved right, as inverted-list and relational products took over the DBMS market.
Recently, I've argued a contrasting position: XML-based data architectures should and will get an important IT role in applications where tabular data-bases don't do a great job. Thus, I think that IBM's and Microsoft's more- or-less native XML storage systems will be more than niche curiosities, and Oracle will soon have to offer a worthy competitor.
There are three basic parts to the argument:
1. There are applications for which XML offers a superior logical architecture to SQL. These fall into two groups. First, there are apps in traditional categories -- CRM, SCM and so on -- that don't have naturally concise relational schemas. We can say that the natural schema is highly variable, or we can say that the overarching schema that takes this variability into account is horrifically complex. Either way, stuffing these apps into a relational straitjacket causes a lot of unnecessary grief.
Second, there are apps that deal with new kinds of complex, dynamic documents. Before XML, either these documents didn't exist at all or their processing couldn't be fully automated.
2. For many of these applications, native XML storage is more efficient than traditional relational storage. Before Microsoft's and IBM's recent announcements, there were two ways to store XML in a relational database. First, since an XML document is a string of characters, you could stick it in a Clob, or Character Large Object. But updating or retrieving specific data values inside the Clob is very inefficient; you basically have to process the whole document.
Alternatively, the XML can be "shredded" into a series of relational tables. But that can make for some very complex updates and joins. So for documents that have complex structures, neither approach is appealing. Native storage is a superior alternative.
3. XML storage won't have the same drawbacks that hierarchical/network products did. Hierarchical systems failed because reusing data in multiple apps was too difficult. Today, however, RDBMS vendors integrate XML and relational storage. You can access XML documents through SQL and your tables through XQuery. "Native" storage really is just a performance issue.
Admittedly, some technical problems are still unresolved. The industry hasn't even agreed upon, let alone implemented, a reasonably



- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- HP Advanced Information Services for SAP In-Memory Appliance (SAP HANA)
- Organizations are eager to connect the vast amounts of data available within and outside their businesses to compete more effectively and make better...
- Galliker builds next-generation Cisco data center
- Originally Galliker Transport AG only intended to upgrade its bandwidth to 10 gigabit per second in the core network of the data center...
- Oracle TimesTen In-Memory Database on Oracle Exalogic Elastic Cloud
- This white paper describes configuration considerations, best practices and performance results of TimesTen running on Exalogic.
- Overcome Top 7 Admin Challenges of Active Directory
- As Active Directory's role in the enterprise has drastically increased, so has the need to secure the data. Gain insight on creating repeatable,...
- Insiders Can Ruin Your Company. Take Action.
- Did you know that 80 percent of threats to an organization come from the inside? The threat from insiders is often overlooked in... All Databases White Papers
- Oracle Database Appliance - Simplifying your High Availability Database
- Date: February 29, 2012
Time: 1:00 PM EST
Seasoned IT managers know from experience that in many cases the bulk of the cost of an... - Optimizing Networks for the Cloud
- Join guest speaker, Rohit Mehra, IDC Director of Enterprise Communications Infrastructure, to explore current trends, discuss best practices for optimizing Data Center and...
- Apps QuickStart Series Part 2: Designing and Deploying SQL Server on VMware vSphere
- Download this webcast to learn about the design considerations for virtualizing SQL workloads, performance and scalability information and high-availability options, as well as...
- Apps QuickStart Series Part 1: Designing and Deploying Exchange 2010 on VMware vSphere
- Download this webcast to learn the virtual hardware design considerations for Exchange 2010, deployment using the building block approach, options for high-availability and...
- Customer Spotlight: How IPC The Hospitalist Company Implemented Oracle on VMware
- Have you been looking to hear about customer's experiences with the new VMware vCenter Site Recovery Manager product? View this webcast to learn... All Databases Webcasts