Ads by TechWords

See your link here
Subscribe to our e-mail newsletters
For more info on a specific newsletter, click the title. Details will be displayed in a new window.
Storage
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
More E-Mail Newsletters 
 

Object-based storage for Linux clusters

October 30, 2003 12:00 PM ET

Computerworld - Linux cluster computing has transformed the architecture of high-performance computing applications. High-cost supercomputers are being replaced by low-cost Linux clusters to solve the most challenging computing problems. To complement the performance potential of these Linux compute clusters, a new storage paradigm is needed. Object-based storage clustering is the foundation for a new class of storage systems that scale in capacity and performance to meet the demands of the most powerful Linux-based clusters.


For years, high-performance cluster computing has delivered solutions to the world's most challenging technical computing problems. More recently, these successes have been replicated in high-performance commercial applications using Linux clusters. Geophysicists are developing more capable seismic-analysis techniques to create images of the Earth's substructure and guide oil-field drilling and extraction operations. Pharmaceutical companies mine massive genomic data sets to provide better insight into human disease and develop more effective therapies. And Internet portals such as Yahoo Inc. and Google Inc. index and serve the content of the Internet.


An increasing appetite for shared storage performance


In addition to hefty computational requirements, these applications are characterized by high-performance I/O needs. Rapid access to shared data sets, often multiple terabytes in size, is critical for ensuring optimal use of compute cluster assets. Without it, already scant resources sit idle. These data sets need to be made globally available to all processes executing on the compute cluster in order to simplify development and systems management activities. Traditional networked storage systems are incapable of providing the necessary performance to serve the aggressive shared-access requirements of these expanding clusters.


For example, animation-rendering applications distribute scene generation tasks to hundreds of cluster compute nodes—each generating an individual frame of the final segment. Shared-scene and character information and per-frame rendering instructions must be accessed by each of the participating compute nodes, and each node generates as much as 50MB of output per frame. The individual frames are then sequenced and assembled into their final form for review. This is a common data-access scenario across many cluster computing applications.


Shortcomings of traditional shared storage


The natural inclination of cluster computing developers is to deploy shared storage that can be accessed by all nodes in the cluster. However, standard shared-storage technologies provided by file servers built from direct-attached storage are only sufficient for small clusters. Larger clusters require more scalable storage. Storage-area networks (SAN) and optimized network-attached storage (NAS) architectures have been employed for modest-sized clusters, however, these architectures have severe limitations as clusters become larger. Neither SAN nor NAS architectures support the aggressive concurrency and high per-client throughput requirements of these cluster computing applications.



Additional Resources

POLL RESULTS
Accelerate your knowledge of the IT world you inhabit by viewing the results of a series of polls taken by your IT peers. These polls of 100+ IT professionals each are available for full viewing. They cover key topics such as virtualization, processor performance, green IT, cloud computing and many others. Be a part of the buzz.
WHITE PAPER
Technology is complex. Keeping it running productively shouldn't be. To that end, you want to minimize the number of solutions needed in-house to simplify operations, maintenance, and support. Kodak offers a best-practices model. One company provides support for both scanner and software, for fast problem resolution without vendor finger-pointing. Download now!
WHITE PAPER
Utilizing demand intelligence improves the precision of pricing, product assortments, channel/store placement, and promotion, which are all essential for sustainable revenue management performance. Learn more, download this free whitepaper today.

White Papers & Webcasts

Speeding business innovation with HP Data Center Transformation solutions
Data center transformation enables your IT organization to focus more on business priorities and innovation by decreasing spending on maintenance and management by...  

Four Principles for Reducing Storage TCO
(Source: Hitachi Data Systems) Difficult economic times require new strategies for reducing costs. Where storage technology and economics meet, there are...

HP Data Center Transformation Solutions
CIOs today are challenged to respond to economic and business pressures, to change from being cost centers to becoming strategic business enablers. There...  

Boost your CAE productivity, and break-away from the pack
(Source: Sun) Join Clemson University as they present their groundbreaking engineering simulations research at their Computational Center for Mobility Systems. Dr. James Leylek,...

Using Symark PowerBroker™ to Enrich Your Organization's RBAC Model
The essential notion of Role-Based Access Control (RBAC) for IT security administration is establishing permissions based on the functional roles within the enterprise,...  

Deduplication and Other Strategies for Protecting Your Assets with the Veritas NetBackup Platform
(Source: Symantec) Many companies find their backup and storage resources strained by data growth and increased regulatory requirements for data retention. In today's...

Using VMware Site Recovery Manager to Simplify DR
(Source: NetApp) Nothing is scarier than the prospect of having to recover an entire site after a disaster. VMware® Site Recovery Manager (SRM)...  

Controlling Email and File Server Growth and Costs with Intelligent Archiving
(Source: Symantec) According to IDC 54% of the storage capacity added by organizations in 2008 will be dedicated to the storage of file-based...

NetApp and VMware Virtual Infrastructure 3 Storage Best Practices
(Source: NetApp) NetApp has been providing advanced storage features to VMware ESX solutions since the product began shipping in 2001. During that time,...  

Maximize Storage Assets with Thin Provisioning, Tiered Storage, and Cluster File Systems
(Source: Symantec) Thin Provisioning is an opportunity to immediately optimize your storage systems and make more capacity available to your applications. In order...