Best practices for implementing disk-to-disk backup: Part 2
Computerworld - In this article, we continue our series on the best disk-to-disk backup strategies and wrap up our discussion on the challenges associated with software-based disk-to-disk backup.
As discussed in the previous article, software-based disk-to-disk backup creates some potential implementation and operation issues. We also noted that over time, backup software providers will resolve most of these issues.
The one area that backup software providers won't be able to address however, is the file system itself, specifically file system size, fragmentation and sharing. Unlike tape, in order for backup software to use disk as a backup destination, it must first have a file system installed on it. Ideally, this file system is large enough to hold the entire disk backup. If you have 10TB of backups, you'd like to make a 10TB file system.
The trouble with file systems
Many file systems, both practically and theoretically, cannot support anywhere close to this size. In fact, a 2TB file system size is large. Consequently, if we have 10TB of backup data and we can create only a 2TB file system, we will have to create five file systems. Each of these file systems must be independently managed and monitored, and more file systems must be created as the backup data set grows.
With disk-to-disk backup, fragmentation results from saving the backup jobs to the backup disk area. These jobs vary in size; they are made smaller, larger and eventually get deleted when you migrate the job to tape. This change and variation over time causes fragmentation. Since the backup-to-disk process is nothing but file changes and deletions, the resulting fragmentation happens faster and more severely than in other applications. All operating systems suffer from this problem, which can be solved only by using a disk defragmenter. However, using a disk defragmenter is very processor-intensive and unpopular with system administrators. With a multiple terabyte file system, a defragmentation job can run for days.
Another issue with a standard file system is its inability to be shared. In a tape storage-area network (SAN) environment, you can have multiple servers (even with different operating systems) accessing the same tape library at the same time. This is because each backup server using the same backup software writes its data stream to its own dedicated tape drive. This isn't true with a disk backup target on a standard file system, where multiple servers share the same disk destination at the same time. With a standard disk file system, each server performing backups needs its own file system


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Finding the right cloud solutions for your organization
- HP is driving the evolution of what we call the Instant-On Enterprise. It is an enterprise that embeds technology into everything it does...
- Converged Infrastructure for Dummies
- As you know, everything is mobile, connected, interactive, and immediate. This is exactly why organizations need a highly agile IT infrastructure in order...
- Seven Priorities for Integrated Network Management - How HP Intelligent Management Center Delivers an Enterprise-class Solution
- This white paper describes the major requirements for network management solutions to help the organizations become more profitable, efficient and reliable.
Intel and the... - Building Cloud-Optimized Data Center Networks white paper
- Enterprises are turning to the Cloud to improve business agility, reduce expenses and accelerate business innovation. Cloud computing redefines the way IT assets...
- Gartner on the Network Infrastructure Market
- The network infrastructure market has evolved rapidly, from one in which most organizations adhered to a single-vendor architecture to a more business-driven network... All Networking White Papers
- The Higher-Bandwidth, Lower-Cost Connection of Choice: 10GBASE-T LAN on Motherboard
- Learn how Expedient, a cloud provider, is using 10 Gigabit Ethernet to boost its services and rein in costs.
- Distributed Database Security with Real-time Monitoring
- View this demo and learn how IBM InfoSphere Guardium database activity monitoring can help protect your sensitive data in distributed DBMS environments with...
- InfoSphere Warehouse Packs Demo
- These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
- Seven Deadly Sins of Cloud Security (Video)
- As cloud computing gains popularity, too few people are aware of the security threats that are emerging. In this short video, experts from...
- Delivery Management -- Extending Lifecycle Management
- Date: Wednesday, June 20, 2012, 1:00 PM EDT
Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,...
All Networking Webcasts