Languages for Supercomputing Get 'Suped' Up
Three new programming languages seek to speed up the development of supercomputer applications.
For years, the name of the game in supercomputing has been raw speed, with hardware and software designers striving to boost the number of instructions per second — FLOPS — that could be crunched. Gigaflops computers gave way to teraflops machines, which are now yielding to petaflops models — those able to execute 1 quadrillion computations per second.
But those performance ratings are misleading, because they ignore a huge portion of the time required to solve a problem with these multiprocessor computers — the hours, weeks or even years it can take for software designers to formulate a solution and for programmers to code and test it.
That’s why the Defense Advanced Research Projects Agency in 2002 changed the name of its High Performance Computing Systems program to High Productivity Computing Systems (HPCS). DARPA hoped that its contractors — Cray Inc., IBM and Sun Microsystems Inc. — could come up with programming languages and tools to improve software development productivity tenfold.
Sun recently lost its bid to go to the next phase of the DARPA job, but that hasn’t stopped it from forging ahead with its HPCS programming language, called Fortress. In January, Sun released an early version of a Fortress interpreter. Similarly, Cray and IBM have released their own first-draft implementations of new languages.
The three languages, all available as open-source software, differ substantially when it comes to details, but they have this much in common:
- They are intended to boost programmer productivity by a factor of 10 or more while running at least as efficiently as existing languages such as Fortran and C.
- They are aimed at a wide range of multiprocessor computers and clusters, from the “petascale” behemoths at national laboratories to the multicore processors now appearing on desktops. Similarly, they are intended for use in at least some mainstream, business-oriented applications, not just in science and engineering.
- They try to make it easier for programmers to exploit the various levels of parallelism in application software threads, multicores, multiprocessors and distributed clusters.
- They employ techniques to relieve programmers of work and help them avoid opportunities for coding errors. For example, all use a technique called “type-inference,” so programmers don’t have to specify the type of every variable, which is tedious and error-prone. And they use techniques for synchronizing operations without locking, so that common problems such as deadlocking are avoided.
A Mighty Fortress
But he says it won’t be easy to evolve the nascent languages — which now run on single, shared-memory systems — to run efficiently on big, distributed-memory parallel systems. “Until then, these languages won’t see much attention,” Mellor-Crummey says.
Eric Allen, a co-leader of the Fortress project at Sun Labs, says the language is ideally suited for relatively static environments. But applications that do a lot of dynamic code-loading or Web accessing would probably still be coded in Java, he adds. He says a full-function Fortress compiler will be developed and will include optimization features that have never existed in a language before (see diagram).
Like Fortress, Cray’s Chapel is a brand-new language. A few alpha users are working with an early Chapel compiler for serial code, but a production-grade compiler for parallel codes is several years away, according to Chief Technology Officer Steve Scott. He says Cray is also developing debugging and performance-analysis tools that, unlike existing tools, will be able to scale up to systems with 1 million processors.
Scott says Chapel will be well suited for machines with low communications overhead, globally addressable memory and many possible parallel threads of execution. He says the most important advance in Chapel is its separation of algorithm specification from machine-dependent structural considerations. That makes it possible for programmers first to code and debug algorithms in relatively simple programs, then later specify how the data is to be laid out in the machine for the most efficient access.
IBM’s entry, code-named X10, is a parallel, distributed, object-oriented language developed as an extension of Java. It is designed for systems built out of multicore symmetric multiprocessing chips — such as IBM’s Power processors — interconnected in scalable cluster configurations.
X10 takes the advantages of object orientation in Java for serial code and adds language constructs for parallel and distributed processing, says Vijay Saraswat, a researcher at IBM. The early version of X10 simply translates X10 code into Java, but a full-function optimizing compiler will be available to meet DARPA’s 2010 deadline, he says.
DARPA says it has “no plan” to pick a winner among the new languages, but it clearly hopes that at least one of them will be a commercial success. And, as multicore processor chips become ubiquitous, that would be a welcome outcome, says Mellor-Crummey.
“What we are seeing,” he says, “is not a gradual shift but a cataclysmic shift from the sequential world to one in which every processor is parallel. In a small number of years, if your language does not support parallelism, that language will just whither and die.”
Read more about Applications in Computerworld's Applications Topic Center.
- Path Selection Infographic Path Selection Infographic
- Hyperconvergence Infographic A wide range of observers agree that data centers are now entering an era of "hyperconvergence" that will raise network traffic levels faster...
- Preparing Your Infrastructure for the Hyperconvergence Era From cloud computing and virtualization to mobility and unified communications, an array of innovative technologies is transforming today's data centers.
- Increase IT Performance from the Enterprise to the Cloud with WAN Optimization Massive consolidation and data mobility, enabled by virtualization, have radically altered how we build servers, design applications, and deploy storage for the emerging...
- Live Webcast
Transforming Finance, Procurement and Supply Chain Effectiveness with Cross-Functional Analytics
Date: May 6th, 2014
Time: 1 PM EDT
Attend this Webcast to find out how Oracle's packaged analytic applications enable line-of-business managers to examine all...
- Video Stream Quality Impacts Viewer Behavior This scientific white paper, using statistical data from Amakai's streaming network, analyzes how changes in video quality cause changes in viewer behavior.
- Service-Enabling CICS Applications: Best Practices This informative webcast provides an informed, thorough look into CICS service-enablement options and how they can affect your environment. You'll learn how to... All Applications White Papers | Webcasts