Ads by TechWords

See your link here
Receive the latest technology news and information.
Storage
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
Cloud Computing
View all newsletters




Privacy Policy
 

Object-based storage for Linux clusters

October 30, 2003 12:00 PM ET

Computerworld - Linux cluster computing has transformed the architecture of high-performance computing applications. High-cost supercomputers are being replaced by low-cost Linux clusters to solve the most challenging computing problems. To complement the performance potential of these Linux compute clusters, a new storage paradigm is needed. Object-based storage clustering is the foundation for a new class of storage systems that scale in capacity and performance to meet the demands of the most powerful Linux-based clusters.


For years, high-performance cluster computing has delivered solutions to the world's most challenging technical computing problems. More recently, these successes have been replicated in high-performance commercial applications using Linux clusters. Geophysicists are developing more capable seismic-analysis techniques to create images of the Earth's substructure and guide oil-field drilling and extraction operations. Pharmaceutical companies mine massive genomic data sets to provide better insight into human disease and develop more effective therapies. And Internet portals such as Yahoo Inc. and Google Inc. index and serve the content of the Internet.


An increasing appetite for shared storage performance


In addition to hefty computational requirements, these applications are characterized by high-performance I/O needs. Rapid access to shared data sets, often multiple terabytes in size, is critical for ensuring optimal use of compute cluster assets. Without it, already scant resources sit idle. These data sets need to be made globally available to all processes executing on the compute cluster in order to simplify development and systems management activities. Traditional networked storage systems are incapable of providing the necessary performance to serve the aggressive shared-access requirements of these expanding clusters.


For example, animation-rendering applications distribute scene generation tasks to hundreds of cluster compute nodes—each generating an individual frame of the final segment. Shared-scene and character information and per-frame rendering instructions must be accessed by each of the participating compute nodes, and each node generates as much as 50MB of output per frame. The individual frames are then sequenced and assembled into their final form for review. This is a common data-access scenario across many cluster computing applications.


Shortcomings of traditional shared storage


The natural inclination of cluster computing developers is to deploy shared storage that can be accessed by all nodes in the cluster. However, standard shared-storage technologies provided by file servers built from direct-attached storage are only sufficient for small clusters. Larger clusters require more scalable storage. Storage-area networks (SAN) and optimized network-attached storage (NAS) architectures have been employed for modest-sized clusters, however, these architectures have severe limitations as clusters become larger. Neither SAN nor NAS architectures support the aggressive concurrency and high per-client throughput requirements of these cluster computing applications.



Jump to comments

Storage

Additional Resources

WHITE PAPER
Approximately 60 percent of data migration projects overrun time or budget, while some fail completely. Download this white paper, "Enhancing Your Chance for Successful Data Migration," to learn the critical steps you need to take to execute a data migration project with minimum cost and risk to your business.
WHITE PAPER
Read the Gartner research note to learn why the TCO of a server-based computing deployment used to deliver all applications to users is around 50% lower than that of an unmanaged desktop deployment.
WHITE PAPER
Economic downturns have a tendency to accelerate emerging technologies, boost the adoption of effective solutions, and punish solutions that are not cost competitive or that are out of synch with industry trends. This IDC White Paper presents the results of an IDC survey of 330 companies in Western Europe, Asia/Pacific and the Americas that measures the receptiveness to Linux and takes into consideration changing views driven by the disruptive economic environment that businesses face today.

White Papers & Webcasts

Data Manager Report Excerpt: File System Inventory
Cut storage costs and boost operational efficiencies.  

Key Strategies for Managing Data Growth
What are you storage challenges?

Reducing Storage Costs with F5 ARX
Save money- deploy ARX Solutions.  

Data Protection is not an insurance policy -you cannot buy-back lost data
Find out why you need to maintain access to critical information to run your business and remain competitive.

Strategic ECM Webinar
Learn what new strategic business benefits can be realized through ECM!

5 Architecture Issues that Impact BES performance
Register to attend this LIVE Webinar to learn 5 Architecture Issues that Impact BES performance!