Army makes major Linux HPC cluster move
Purchase will more than double MSRC's computing capability
Computerworld - A U.S. Army supercomputing center with a legacy that dates to the first large computer, the Electronic Numerical Integrator and Computer (ENIAC) launched in 1946, is moving to Linux-based clusters in a major hardware purchase that will more than double its computing capability.
The Army Research Laboratory Major Shared Resource Center (MSRC)is buying four Linux Networx Inc. Advanced Technology Clusters, including a system with 4,488 processing cores, or 1,122 nodes, with each node made up of two dual-core Intel Xeon chips. A second system has 842 nodes.
In total, this purchase will increase its computing capability from 36 trillion floating-point operations per second (TFLOPS) to more than 80 TFLOPS, Army officials said.
The MSRC, which is based at the Aberdeen Proving Ground in Harford County, Md., has been involved in every aspect of computing technology since its beginning, and this decision to move into commodity clusters was not made quickly, said Charles J. Nietubicz, director of the MSRC.
The lab held a symposium in 2003 to explore the issue and began running a small, 256-processor cluster system. "We saw that cluster computing was this new kid on the block and was interesting," said Nietubicz, but the center wasn't about to start scrapping its other systems made by Silicon Graphics Inc., Sun Microsystems Inc. and IBM, he said.
The MSRC isn't disclosing the purchase price, but Earl Joseph, an analyst at IDC in Framingham, Mass., said the average cost for a cluster works out to about $2,000 per processor compared with $12,000 per processor for a RISC-based system.
Nietubicz said other vendors "are going to have to begin to recognize that either they provide some other kind of performance to try to gain the increased price, or they are going to have to reduce the price to provide equivalent performance."
Bluffdale, Utah-based Linux Networx builds systems using Advanced Micro Devices Inc. and Intel Corp. chips. In addition to the four systems sold to the MSRC, it also sold one to the Dugway Proving Ground. In total, the sale of the five systems is the company's largest supercomputing order ever. The sale was announced today.
Nietubicz said he was convinced that clusters can work based on the MSRC's ability to get certain computational codes used in fluid dynamics, structural mechanics and other processes to scale to multiple processors mostly by using Message Passing Interface (MPI) protocol-based code. MPI is used to create parallel applications.
The major competitor to supercomputing clusters and their distributed memory systems is symmetric multiprocessing, or SMP, a shared-memory system primarily used in RISC-based systems. Of the total$9.1 billion high-performance computing market last year, clusters accounted for about half of the sales, according to IDC.
A major limitation for moving to clusters is whether the high-performance software can scale to multiple processors. Systems that have been written in MPI can do so, but Joseph said it's difficult to accomplish for companies since many off-the-shelf software packages don't use MPI. Government labs and universities, which own their own code, can usually invest the time to convert their codes into MPI, he said.
Nietubicz doesn't see any major limitations to clusters, and while not all codes can scale on clusters, he said the same problems occurred as the center moved from vector to shared memory. "In each major transition, there were always people saying, 'I can't use that, I need my old stuff.'"
Read more about Hardware in Computerworld's Hardware Topic Center.
- Google I/O 2013's Coolest Products and Services
- 10 Star Trek Technologies That are Almost Here
- 19 Generations of Computer Programmers
- 25 Must-Have Technologies for SMBs
- A walking tour: 33 questions to ask about your company's security
- 15 social media scams
- The 7 elements of a successful security awareness program
- IT Certification Study Tips
- Register for this Computerworld Insider Study Tip guide and gain access to hundreds of premium content articles, cheat sheets, product reviews and more.
- Deploying Flash in the Enterprise Flash is quickly emerging as the preferred way to overcome the nagging performance limitations of hard disk drives.
- FTP vs MFT: Why It's Time to Make the Change Get the facts you need to make the case for managed file transfer. Read the report to get head-to-head comparisons of cost, reliability,...
- ESG Lab Validation Report Preview - QLogic FabricCache QLE10000 Adapter This ESG Lab preview summarizes the results of independent, third-party testing of QLogic's 10000 Series 8Gb Fibre Channel Adapter.
- QLE10000 Series Adapter Provides Application Benefits Through I/O Applications that are Web 2.0, mission-critical, I/O intensive, virtualized, and clustered continue to put an additional burden on processors and slower storage, which...
- Live Webcast
Storage Validation at Go Daddy: Best Practices from the World's #1 Web Hosting Provider - Storage Validation at Go Daddy: Best Practices from the World's #1 Web Hosting Provider
- Live Webcast
On-Demand Webcast: 7 Reasons to Choose VoIP - Thinking about a new phone system for your business?
Be sure to watch this informative webcast. Steve Strauss, small business columnist for USA... - Live Webcast
Unified Communications 101 - Learn more!
- Lenovo & Windows 8 Innovative Devices Podcast Learn about the innovated devices that Lenovo designed to take full advantage of the new touch interface of Microsoft's Windows 8 Pro.
- Technology Support Solutions case study - Calvary Chapel Learn how Calvary Chapel leverages technology to support the church's mission and educational programs, with the help of PC Connection and Lenovo.