tug
Volume 4, Number 17 -- May 10, 2007

ClearSpeed Tweaks Math Coprocessors, Shows Off Benchmarks

Published: May 10, 2007

by Timothy Prickett Morgan

ClearSpeed Technology, which makes math co-processor boards for workstations and servers, has added a new accelerator board to keep pace with the peripheral slot changes in motherboards. But perhaps more significantly for its efforts to sell its products, ClearSpeed has run some benchmarks that show the kinds of performance customers running real number-crunching workloads can expect to see and how they can play off performance gains against unplugging server footprints in clustered supercomputer environments by using the company's Advance accelerators.

The existing Advance board is called the X620, and it has two 96-core CSX600 math processors on each PCI-X peripheral card; this is a three-quarter length, half height PCI-X card. The new card, called the E620, is a PCI-Express x8 peripheral card that is a half-length card that is still full height. The Advance cards burn about 25 watts of power per board, and deliver around 55 gigaflops of double-precision, 64-bit floating point math power. The CSX600 chip has four banks of 24 cores running at 210 MHz, with about one quarter of the chip being floating point units and the rest being memory and supporting interfaces and ports. The whole shebang has 128 million transistors and each CSX600 chip burns about 10 watts.

According to Peter ffoulkes, director of outbound marketing for ClearSpeed, the extra bandwidth that comes with the new PCI-Express card doesn't really help all that much on the kinds of HPC workloads that customers buy the Advance cards to goose. "You would think that the bandwidth would help, but when doing matrix math, the amount of computation completely overwhelms the I/O," says ffoulkes.

Both the existing PCI-X and new PCI-Express cards cost around $8,000 in single unit quantities, with unspecified but significant volume discounts. This is, says ffoulkes, about what a field programmable gate array co-processor costs for an X86 server, and it is less than half of what IBM is charging for its QS20 blade server, which has two if its "Cell" Power vector processors on it. "Everything is in the same order of magnitude of price except for the graphics processing units, which are an order of magnitude cheaper than any of these options," says ffoulkes. Of course, as popular as the GPU idea is as a co-processor for HPC workloads, GPUs only offer single-precision floating point math and they are not yet supporting 64-bit math. So they have their limits, too.

In addition to launching the new PCI-Express card, ClearSpeed has also launched a random number generator for the cards, which allows the Monte Carlo simulations that the financial services industry uses to balance their portfolios and make money off your money. The company has also updated its vector math library, called CSXL, to the 2.5 release, which that provides native support for Microsoft's Windows platform; Linux was supported in prior releases. The random number generator was an add-on before CSXL 2.5. ClearSpeed has also created a tool called Visual Profiler, which allows programmers to look deep into the way the code is running on their X64 systems with Advance cards and see where the bottlenecks are in the system that might be inhibiting performance.

To help it better sell its Advance boards, ClearSpeed ran a Monte Carlo simulation on an X64 server with two 3 GHz processors equipped with 3 GB of main memory and Red Hat Enterprise Linux 4. The simulation, which included 400 million data samples using a European pricing model for stocks, could run at 6.4 million samples per second with one X64 core activated, and doubled to 12.9 million samples per second with two cores turned on. Then, ClearSpeed slipped in one of its Advance cards, and the rate rose to 130.9 million samples per second (which means the job that took 60 seconds with a single CPU core finished in 2.9 seconds with one Advance card). A second Advance board added to this server nearly doubled performance to 260 million samples per second, a third card drove it to 386.1 million samples per second, and the fourth card pushed it up to 505.1 million samples per second. While that last card didn't deliver a linear performance improvement, this is an incredible increase in the speed at which the Monte Carlo simulation can run--from 60 seconds down to 0.8 seconds.

ClearSpeed has also ran the industry-standard Linpack benchmark on a cluster of X64-based ProLiant DL380 GHz servers with two dual-core "Woodcrest" Xeon 5150 processors (which run at 3 GHz) and 14 GB of memory. (Linpack is, of course, the Fortran matrix math benchmark that it used to rank the Top 500 supercomputers.) A four-node cluster of these DL380 machines was able to do 114.8 gigaflops on the Linpack test, but adding an X620 Advance card to each node more than doubled performance to 251.3 gigaflops. To get roughly the same performance (258.3 gigaflops, specifically), it took nine of those ProLiant DL380 server nodes, which took up twice as much space and burned twice as much power. (The four-node cluster burned an average of 1,722 watts, the accelerated four-node cluster burned an average of 1,750 watts, and the nine-node cluster without accelerators burned 3,875 watts.) Clearly, it makes more sense to use the co-processor than to double up on the servers, and if you look carefully, you can see that offloading some work to the Advance co-processor boards actually makes the server CPUs a little bit less hot, compensating some for the increase in power added by the Advance boards themselves.


RELATED STORIES

ClearSpeed Ships New Math Accelerator, Inks Deal with IBM

ClearSpeed Ships Advance Co-Processors in Giant Sun Supercomputer

Sun, NEC, and AMD Partner for 50 Teraflops Opteron Cluster



                     Post this story to del.icio.us
               Post this story to Digg
    Post this story to Slashdot


Sponsored By
VIBRANT TECHNOLOGIES

HP, IBM and Sun Server Deals via RSS

                                                  · Subscribe to our Specials via RSS
                                                  · Up to 80% off manufacturer's list price
                                                  · Multi-million dollar inventory

We Buy & Sell new and remarketed servers,
upgrades, peripherals and parts.

HP Proliant, IBM xSeries, IBM pSeries, RS6000,
HP Integrity, Sun Microsystems, Cisco, more…
888-443-8606

View or Subscribe to:
Special Offers on Servers and Upgrades


Editor: Timothy Prickett Morgan
Contributing Editors: Dan Burger, Joe Hertvik,
Shannon O'Donnell, Timothy Prickett Morgan
Publisher and Advertising Director: Jenny Thomas
Advertising Sales Representative: Kim Reed
Contact the Editors: To contact anyone on the IT Jungle Team
Go to our contacts page and send us a message.

Sponsored Links

Vibrant Technologies:  Quality Used Servers, Storage & Networking Hardware at up to 80% off new
World Data Products:  FREE 84-page Unix/Midrange Server Spec Book
COMMON:  Join us at the Annual 2008 conference, March 30 - April 3, in Nashville, Tennessee


The Four Hundred
IBM Focusing on i5 Account Sales, Not i5 Sales

Dr. Frank Soltis at COMMON: A Show Worth Watching

i5/OS Curriculum Contingent on Job Prospects, Business Community

As I See It: Education--the Other Dysfunction

The Linux Beacon
Brazilian Game Site Chooses Hybrid Mainframe-Cell Platform

Q&A with HP's Paul Miller: The X64 Server Biz

How To Build a Green Data Center

As I See It: Induced Labor

Four Hundred Stuff
Arcad Positions for Growth in Change Management

Profound Releases Genie, Lauded for Disney Work

iMessaging Adopts SIP for Call Center Software

ABL Unveils Strategi SOA

Big Iron
Micro Focus Buys COBOL App Modernization Rival Acucorp

Top Mainframe Stories From Around the Web

Chats, Webinars, Seminars, Shows, and Other Happenings

Four Hundred Guru
WHERE Versus HAVING

Error-Checking Email Addresses, for Intelligent People

Admin Alert: The i5 Battery Checking Process

System i PTF Guide
May 5, 2007: Volume 9, Number 18

April 28, 2007: Volume 9, Number 17

April 21, 2007: Volume 9, Number 16

April 14, 2007: Volume 9, Number 15

April 7, 2007: Volume 9, Number 14

March 31, 2007: Volume 9, Number 13

The Windows Observer
Patch Tuesday Yields Seven Critical Patches for 19 Flaws

Microsoft Moves Forefront as Security Market Changes

Q&A with HP's Paul Miller: The X64 Server Biz

Microsoft Taps Packeteer for Branch Office Server

Four Hundred Monitor
Four Hundred Monitor's
Full iSeries Events Calendar

THIS ISSUE SPONSORED BY:

Lakeview Technology
Arkeia
MKS
Roaring Penguin
Vibrant Technologies



TABLE OF CONTENTS
IBM Lengthens and Broadens AIX Support on Power Iron

Sun Backs QuickTransit for Sparc to X64 Migration

IBM Sees Green in Going Green in Data Centers

As I See It: Education--the Other Dysfunction

But Wait, There's More:


HP Raises Its Guidance for Fiscal Second Quarter . . . Intel Establishes Cross-Divisional HPC Unit . . . ClearSpeed Tweaks Math Coprocessors, Shows Benchmarks . . . Workstation 6 Previews VMware's Future Server Virtualization . . . IBM Grows Chips Like Snowflakes Using Natural Processes . . . SOA Will Be Used in Half of the Enterprise Applications Created in 2007 . . .

The Unix Guardian

BACK ISSUES





 
Subscription Information:
You can unsubscribe, change your email address, or sign up for any of IT Jungle's free e-newsletters through our Web site at http://www.itjungle.com/sub/subscribe.html.

Copyright © 1996-2008 Guild Companies, Inc. All Rights Reserved.
Guild Companies, Inc., 50 Park Terrace East, Suite 8F, New York, NY 10034

Privacy Statement