Data Finds a Place on the Grid
Vendor support and standards are just evolving, but companies are looking to share data across grids.
Computerworld - The data grid has been playing second fiddle to the compute grid when it comes to media attention. But companies and public institutions searching for better ways to share and manage large amounts of data are beginning to take notice.
A compute grid allows users to take the computing resources in a distributed and heterogeneous environment, manage those disparate resources as one and focus them on problem solving.
A data grid acts in a similar way. It has a middleware layer and metadata framework to give users a centralized view of distributed data without physically centralizing the data.
That means the data can be located on Windows, Unix or Linux systems running multiple formats. It can be structured or unstructured and can consist of different media types. A data grid and a compute grid can operate togetherthe principles are the same.
But there are limits to what a grid can do. A grid, for instance, doesn't offer a means for discovering and categorizing unstructured data. What the data grid provides is a standards-based framework for interconnecting that information once those tasks are addressed.
Data-grid technology is in the early-adopter phase, drawing the interest of research institutions with large and scattered data repositories, such as Pfizer Global Research & Development in Groton, Conn., and the University of Arkansas Center for Advanced Spatial Studies in Fayetteville, as well as research consortiums such as the European Union's DataGrid project, led by particle physics research center CERN.
Data grids will find broader applications as standards mature and technology problems, such as managing security in a grid's distributed environment, are solved, say analysts and users.
"I think the whole promise of grid is pretty exciting," says Paul Lewis, director of research information architecture at Pfizer. But more work is needed, he adds.
Seeking Support
Products that support data in a grid environment are emerging. For example, Pfizer uses Avaki Inc.'s data-grid software. The Center for Advanced Spatial Studies takes advantage of the grid capabilities in Oracle 10g, Oracle Corp.'s flagship database.
But the very concept of grids involves interconnectedness among disparate applications and data sources. Until vendors include standards-based grid capabilities, interfaces and processes in their products, data-grid adoptions are going to be limited.
"Vendors have got to step up and say, 'We're going to make our products grid-enabled,' " says Lewis. "If more vendors grid-enable products, it makes our job easier, because then we can plug in more computers when we need more capacity."
Emerging data-grid products, such as Avaki's, are being used within companies. But some of the leading thinkers behind the data-grid effort imagine developing systems that connect large numbers of enterprises, entire supply chains and customer bases.
"The equivalent of the Internet Protocol for remote access to data is still a work in progress," says Ian Foster, senior scientist and head of the Distributed Systems Lab at Argonne National Laboratory in Illinois and co-director of the grid standards effort at the Globus Alliance.



- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- X-Ray of the PCI Process-4 Proactive Steps
- This white paper from Forrester Research Inc., helps break PCI into understandable components. Security and risk professionals will gain knowledge and insight into...
- Forrester: Economic Impact of Switching to Google Apps
- Content provided by Google
Read this Forrester report on the "total economic impact" of Google Apps, and learn how switching to Google Apps creates... - Intelligent Systems: Unlocking Hidden Business Value with Data
- An intelligent system enables data to flow across an enterprise infrastructure, spanning the devices where valuable data is gathered from employees and customers,...
- Concepts of NonStop SQL/MX
- For DBAs and developers who are familiar with Oracle solutions and want to learn about NonStop SQL/MX, this whitepaper provides an overview of...
- HP Advanced Information Services for SAP In-Memory Appliance (SAP HANA)
- Organizations are eager to connect the vast amounts of data available within and outside their businesses to compete more effectively and make better... All BI and Analytics White Papers
- Quantifying the Business Value of VMware View - Webcast
- Many enterprises have discovered that the use of virtualization to support desktop workloads creates a range of significant benefits. These benefits include price...
- Good to Great - How to Take Business Analytics to the Next Level
- By attending this webcast you will learn how you can implement an effective BA strategy that will deliver maximum strategic value to your...
- Supporting Mobile Productivity With A Limited IT Budget
- Join us and hear from Kaseya mobile IT management experts as we discuss core strategies for supporting the mobile revolution on a shoestring...
- User Experience Monitoring
- In this webinar, you will learn hints & tips for improving end-user response times from Forrester Research analyst, Jean-Pierre Garbani.
- Hints & Tips Cisco
- Overwhelmed by tracking your Vblock, Flexpod or Cisco UCS performance? Spend one hour with Nimsoft to learn how you can eliminate the overhead... All BI and Analytics Webcasts