Addicted to data
Computerworld - In his recent State of the Union speech, the president asserted that we're "addicted to oil." At the risk of stepping on a land mine, I'm guessing that this is a statement that most people would agree with regardless of their political persuasion. Well, the same can be said for data. We love all its shapes and forms and want to keep it all forever -- we're addicted!
Unlike oil, the problem isn't a shrinking supply -- quite the opposite. We are smothering in its abundance. Data storage continues double-digit growth rates, and while per-unit costs continue to fall, our appetite outpaces the decline. Even more significant are the ongoing multiyear costs to manage, support and protect the data that consumes this storage.
It isn't the creation of new data that is at the heart of the data management problem, but the mountains of data we retain and never eliminate that is overwhelming our capacity and putting enormous strain on tasks like backup and disaster recovery. Data archiving and expiration policies and processes are missing elements of data management in most organizations. As a consequence, huge quantities of the data sitting on expensive storage systems and consuming thousands of tapes is old, infrequently accessed information and may very well have outlived its value to the organization.
There are at least two reasons for this. First, organizations often lack an authoritative voice to say when a particular piece of data can be expired. Second, the actual process of deleting or even archiving data is usually very difficult. There are often application-specific concerns that make it difficult or impossible to identify and separate the data wheat from the chaff.
Most current data-archiving activities focus on e-mail, a well-understood application with existing archiving tools that can be implemented relatively easily. Other applications are not as straightforward. A basic prerequisite is some degree of application data classification -- a daunting notion when one considers the thousands of applications found in large organizations. But some organizations are finding that by starting with a few select applications and setting practical goals, significant savings can be realized. Don't develop an elaborate classification scheme. Establish a few basics rules, identify where they apply, and quantify the potential cost savings. The approach is to focus on finding the big nuggets rather than the flakes of gold.
Jim Damoulakis is chief technology officer at GlassHouse Technologies Inc., a leading provider of independent storage services. He can be reached at jimd@glasshouse.com.
Read more about Storage in Computerworld's Storage Topic Center.



- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Datacenter Consolidation Best Practices Whitepaper
- The benefits of storage consolidation are being realized by companies and seen as a way to streamline many storage-driven applications. Learn why the...
- Eliminating VMware / Storage Related Performance Challenges
- How to proactively monitor the performance in a Fibre Channel SAN / vSphere environment is always a concern. Understand the importance of a...
- Cloud Environments Have Familiar Storage Challenges
- Cloud environments have many storage challenges that are familiar to data center managers, but due to their density and abstraction, the issues become...
- Eight Considerations for Evaluating Disk-Based Backup Solutions
- In the past, the movement from tape- to disk-based backup has been less compelling due to the expense of storing backup data on...
- ExaGrid Helps U.S. Federal Government Agencies Reduce Backup Windows and Improve Data Protection
- The U.S. Government has been the largest user of tape-based backup systems since the 1970s. Most agencies have begun to deploy disk storage... All Storage White Papers
- Understand Your Data: The Future of Backup and Archiving
- Archiving and Backup are the foundation of the next generation of information governance. However, commodity data protection tools and basic archives are only...
- Optimizing Networks for the Cloud
- Join guest speaker, Rohit Mehra, IDC Director of Enterprise Communications Infrastructure, to explore current trends, discuss best practices for optimizing Data Center and...
- Apps QuickStart Series Part 2: Designing and Deploying SQL Server on VMware vSphere
- Download this webcast to learn about the design considerations for virtualizing SQL workloads, performance and scalability information and high-availability options, as well as...
- Apps QuickStart Series Part 1: Designing and Deploying Exchange 2010 on VMware vSphere
- Download this webcast to learn the virtual hardware design considerations for Exchange 2010, deployment using the building block approach, options for high-availability and...
- Customer Spotlight: How IPC The Hospitalist Company Implemented Oracle on VMware
- Have you been looking to hear about customer's experiences with the new VMware vCenter Site Recovery Manager product? View this webcast to learn... All Storage Webcasts