Skip the navigation

Object-based storage for Linux clusters

By Garth Gibson, co-founder and CTO, Panasas Inc.
October 30, 2003 12:00 PM ET

Computerworld - Linux cluster computing has transformed the architecture of high-performance computing applications. High-cost supercomputers are being replaced by low-cost Linux clusters to solve the most challenging computing problems. To complement the performance potential of these Linux compute clusters, a new storage paradigm is needed. Object-based storage clustering is the foundation for a new class of storage systems that scale in capacity and performance to meet the demands of the most powerful Linux-based clusters.


For years, high-performance cluster computing has delivered solutions to the world's most challenging technical computing problems. More recently, these successes have been replicated in high-performance commercial applications using Linux clusters. Geophysicists are developing more capable seismic-analysis techniques to create images of the Earth's substructure and guide oil-field drilling and extraction operations. Pharmaceutical companies mine massive genomic data sets to provide better insight into human disease and develop more effective therapies. And Internet portals such as Yahoo Inc. and Google Inc. index and serve the content of the Internet.


An increasing appetite for shared storage performance


In addition to hefty computational requirements, these applications are characterized by high-performance I/O needs. Rapid access to shared data sets, often multiple terabytes in size, is critical for ensuring optimal use of compute cluster assets. Without it, already scant resources sit idle. These data sets need to be made globally available to all processes executing on the compute cluster in order to simplify development and systems management activities. Traditional networked storage systems are incapable of providing the necessary performance to serve the aggressive shared-access requirements of these expanding clusters.


For example, animation-rendering applications distribute scene generation tasks to hundreds of cluster compute nodes—each generating an individual frame of the final segment. Shared-scene and character information and per-frame rendering instructions must be accessed by each of the participating compute nodes, and each node generates as much as 50MB of output per frame. The individual frames are then sequenced and assembled into their final form for review. This is a common data-access scenario across many cluster computing applications.


Shortcomings of traditional shared storage


The natural inclination of cluster computing developers is to deploy shared storage that can be accessed by all nodes in the cluster. However, standard shared-storage technologies provided by file servers built from direct-attached storage are only sufficient for small clusters. Larger clusters require more scalable storage. Storage-area networks (SAN) and optimized network-attached storage (NAS) architectures have been employed for modest-sized clusters, however, these architectures have severe limitations as clusters become larger. Neither SAN nor NAS architectures support the aggressive concurrency and high per-client throughput requirements of these cluster computing applications.



Additional Resources
Forrester Consulting - Optimizing Users and Applications in a Mobile World
WHITE PAPER
Solving application issues over the WAN requires careful consideration. Based on their independent research, Forrester Consulting offers recommendations on how to tackle application performance issues, insufficient bandwidth and the inability to quickly restore users in a disaster.

Read now.

Security KnowledgeVault
WHITE PAPER
Security is not an option. This KnowledgeVault Series offers professional advice how to be proactive in the fight against cybercrimes and multi-layered security threats; how to adopt a holistic approach to protecting and managing data; and how to hire a qualified security assessor. Make security your Number 1 priority.

Read now.

Cut Communications Costs Once and for All
WHITE PAPER
New IP-based communications systems are being deployed by small and midsized businesses at a rapid rate. Learn how these organizations are enabling faster responsiveness, creating better customer experiences, speeding office or mobile interactions, and dramatically reducing existing communications costs.

Read now.

Storage White Papers
Datacenter Consolidation Best Practices Whitepaper
The benefits of storage consolidation are being realized by companies and seen as a way to streamline many storage-driven applications. Learn why the...
Eliminating VMware / Storage Related Performance Challenges
How to proactively monitor the performance in a Fibre Channel SAN / vSphere environment is always a concern. Understand the importance of a...
Cloud Environments Have Familiar Storage Challenges
Cloud environments have many storage challenges that are familiar to data center managers, but due to their density and abstraction, the issues become...
Eight Considerations for Evaluating Disk-Based Backup Solutions
In the past, the movement from tape- to disk-based backup has been less compelling due to the expense of storing backup data on...
ExaGrid Helps U.S. Federal Government Agencies Reduce Backup Windows and Improve Data Protection
The U.S. Government has been the largest user of tape-based backup systems since the 1970s. Most agencies have begun to deploy disk storage...
All Storage White Papers
Storage Webcasts
Understand Your Data: The Future of Backup and Archiving
Archiving and Backup are the foundation of the next generation of information governance. However, commodity data protection tools and basic archives are only...
Optimizing Networks for the Cloud
Join guest speaker, Rohit Mehra, IDC Director of Enterprise Communications Infrastructure, to explore current trends, discuss best practices for optimizing Data Center and...
Apps QuickStart Series Part 2: Designing and Deploying SQL Server on VMware vSphere
Download this webcast to learn about the design considerations for virtualizing SQL workloads, performance and scalability information and high-availability options, as well as...
Apps QuickStart Series Part 1: Designing and Deploying Exchange 2010 on VMware vSphere
Download this webcast to learn the virtual hardware design considerations for Exchange 2010, deployment using the building block approach, options for high-availability and...
Customer Spotlight: How IPC The Hospitalist Company Implemented Oracle on VMware
Have you been looking to hear about customer's experiences with the new VMware vCenter Site Recovery Manager product? View this webcast to learn...
All Storage Webcasts
Newsletter Sign-Up

Receive the latest news test, reviews and trends on your favorite technology topics

Choose a newsletter
  1. View all newsletters | Privacy Policy
IT Jobs