SNW: IT managers put data dedupe at the top of their future tech list
Being able to address data management through consolidation is key
Computerworld - PHOENIX -- IT managers interviewed at Storage Networking World here this week said the key technology in their near future is data deduplication, though how they would implement that technology differed from person to person.
Most managers said their data silos have grown to the point where they're becoming difficult to manage, and growth over the next several years is expected to be exponential. Data deduplication would offer them significant relief in that it could drastically reduce capacity requirements and costs by allowing them to use their storage assets more effectively, they said.
J. Travis Martin, CIS infrastructure services manager for Lawrence Livermore National Laboratory in Livermore, Calif., said his operation manages 750TB of data that will soon be growing to more than a petabyte.
Martin said Lawrence Livermore uses Data Domain appliances to deduplicate its backup data, but he wants to move to a technology that performs deduplication globally, across geographically dispersed nodes.
Martin is considering deduplication vendor Exagrid Systems Inc. in Westborough, Mass., which uses byte-level data de-duplication on a grid architecture.
"That's what tips Exagrid over the top for us, global dedupe across nodes," said Eric Ghere, a systems architect with Lawrence Livermore.
Martin said he would like to get any data deduplication technology as close to the source of data as possible versus deploying it as he does today, as part of the data backup stream.
Brett Michalak, CIO at online ticket retailer Tickets.com, said data deduplication and WAN optimization technology would help him deal with about 100TB of virtualized storage capacity on arrays from 3Par Inc. Through virtualization, Michalak said he is able to provision storage on the fly and keep up with changing customer service level agreements, but that doesn't address growing bandwidth requirements.
"I'm looking at those two primarily because as we start rolling out more assets globally, and the fact that our data will be distributed ... the impact on our networks is going to grow," he said. "I think deduplication will be necessary for us for obvious reasons - for backup and recovery."
Michalak sees in his company's future the complete elimination of tape-based backup and a move toward nearline disk-based storage.
"It's just a waste of time in my opinion. The time it takes to retrieve that data and bring it back takes too long," he said.
Mark Saussure, director of digital library infrastructure for Penn State University, said his 160TB of disk-based data is expected to grow exponentially over the next few years. To address that growth, he has been rolling out the eXtensible Access Method (XAM), a specification developed by the Storage Networking Industry Association that will help him not only to automate backup across tiers of storage, but also allow anyone to search silos and retrieve data through the use of standardized meta data.
"Information silos, if not controlled, will outstrip our ability to manage the objects in them," he said. "The demand is just phenomenal. We can't continue to manage the silos the way we've it done for years."
Saussure hopes to go live with a gateway appliance in front of his backend disk storage that will automatically populate data with standardized meta data that will in turn assist him with data routing, meta data extraction for reporting, data retention and give him low-level search capabilities.
Saussure also hopes to deploy a grid-based storage architecture that will assist him in seamlessly moving data objects around his various storage silos.
Read more about Storage in Computerworld's Storage Topic Center.



- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Datacenter Consolidation Best Practices Whitepaper
- The benefits of storage consolidation are being realized by companies and seen as a way to streamline many storage-driven applications. Learn why the...
- Eliminating VMware / Storage Related Performance Challenges
- How to proactively monitor the performance in a Fibre Channel SAN / vSphere environment is always a concern. Understand the importance of a...
- Cloud Environments Have Familiar Storage Challenges
- Cloud environments have many storage challenges that are familiar to data center managers, but due to their density and abstraction, the issues become...
- Eight Considerations for Evaluating Disk-Based Backup Solutions
- In the past, the movement from tape- to disk-based backup has been less compelling due to the expense of storing backup data on...
- ExaGrid Helps U.S. Federal Government Agencies Reduce Backup Windows and Improve Data Protection
- The U.S. Government has been the largest user of tape-based backup systems since the 1970s. Most agencies have begun to deploy disk storage... All Storage White Papers
- Understand Your Data: The Future of Backup and Archiving
- Archiving and Backup are the foundation of the next generation of information governance. However, commodity data protection tools and basic archives are only...
- Optimizing Networks for the Cloud
- Join guest speaker, Rohit Mehra, IDC Director of Enterprise Communications Infrastructure, to explore current trends, discuss best practices for optimizing Data Center and...
- Apps QuickStart Series Part 2: Designing and Deploying SQL Server on VMware vSphere
- Download this webcast to learn about the design considerations for virtualizing SQL workloads, performance and scalability information and high-availability options, as well as...
- Apps QuickStart Series Part 1: Designing and Deploying Exchange 2010 on VMware vSphere
- Download this webcast to learn the virtual hardware design considerations for Exchange 2010, deployment using the building block approach, options for high-availability and...
- Customer Spotlight: How IPC The Hospitalist Company Implemented Oracle on VMware
- Have you been looking to hear about customer's experiences with the new VMware vCenter Site Recovery Manager product? View this webcast to learn... All Storage Webcasts