Table of Contents


Back

All campus utilities are within the physical plant chart of accounts. Departments are not charged.

Overview

Cluster Blue Sky Dell HP GPU1) CPU2) Comment
12/2006 11/2010 04/2013 04/2013
Age (yrs) 11 5.5 1.5 0 0
Nodes (Nr) 45 36 32 4 13
Cores (Nr) 90 256 256 3)16 gpu, 64 cpu 208
Ram (GB) 1,080 384 340 80 gpu, 512 cpu 1,664
Teraflops 4)0.5-0.7 5)0.66 6)1.5 7)18.71 + 8)1.1 9)3.6
L6-30 (Nr) 4 8 4 2 2
Capacity10) (Watts) 24,960 49,920 24,960 12,480 12,480
Power (Watts11)) N/A12) 13)11,500 14)10,602 15)5,900 16)5,250
Cooling (Btus17)) 18)39,240 19)36,175 20)20,171 21)17,913
Annual Costs22) ($) 46,091 42,930 25,864 23,506 23)using 2009 data
$1,000/Tflop 69.8 28.6 1.3 6.5 24)using 2009 data

Details

All dollar figures based on spreadsheet using March 2009 values

Notes

HP Nov-2010

Dell Jan-2007

CPU HPC 2012

GPU HPC 2012

Replace Dell

This is what the picture would look like using the 2009 spreadsheet data

What we should look at is teraflops replacement …

And if we double the teraflops…1.6 Tflops (actually + 1 Tflops), 1.5 years ROI for 96 cores.

L6-30 Connections

Dell April-2013

Actual data obtained April - May, 2013

Kill-A-Watt Meter Stats; pulled one power supply … mean/1000 * 24 * 365 * 30!

Node jobs kwh time hh:mm mean comment annual kwh power annual +cooling annual (x2)
c04 8 2.29 06:30 352 idle, no jobs 92,509 $21,393 $42,786 wow
c32 8 6.94 19:35 356 jobs finished 93,556 $21,610 $43,220
c27 8 7.42 19:30 381 jobs finished 100,127 $22,971 $45,942
c06 8 27.46 65:11 395 4 running 103,806 $23,734 $47,468
c00 8 8.26 21:00 393 still running 103,280 $23,625 $47,250 *
c03 8 10.36 24:03 432 still running 113,530 $25,748 $51,496 *
c04 8 12.02 28:08 429 still running 112,741 $25,585 $51,170 *
c09 8 10.36 24:00 432 still running 113,530 $25,748 $51,496 *
c10 8 10.21 23:55 425 still running 111,690 $25,367 $50,734 *
c17 8 27.07 71:00 381 still running *
c18 8 11.23 24:41 455 still running *
c20 8 50.23 143: 351 still running *
c23 8 8.71 24:00 363 still running *
c25 8 9.80 25:02 392 still running *
c29 8 30.22 66:22 455 still running 119,574 $27,001 $54,002 *
c31 8 10.30 25:10 409 still running 107,485 $24,496 $48,992 *
c32 8 11.42 24:00 476 still running 123,379 $27,789 $55,578 *
c33 8 46.23 113: 409 still running *
c35 8 13.69 28:53 474 still running *

We have measured 19 nodes power consumption (pulling one unit out) with a Kill-A-Watt meter over 775+ hours to arrive at a mean consumption rate of 418.4 watts. That totals to 109,956 KwH/year in power consumption ((watts/1000 Kw per hour) * 24 hours * 365 days * 30 servers), which is a low water mark as the racks in question also contain switches, two UPS units, and a disk array. (Note: Peter did a side calculation using the 4 watt measurement on panel directly and came up with 126,000 KwH/year which can be considered the high water mark estimate).

Based on 12.5 cents per Kwh (this is an all inclusive cost including natural gas cost, heat recoup costs, distribution, maintenance etc) the hardware burns away $13,744.50 per year. Best guess is cooling costs are at least that (another possible low water mark) so the total cost for all power consumption is $27,489 per year. If we run the hardware for another 3 years that total cost is $82,467.

If we could replace, or approximate, the 30 compute nodes' computational power (0.6 teraflops) and job slots (240 cores) with new hardware that consumes 50% less in power, our ROI is 6 years based on the low water mark numbers. When using the high water mark the ROI is 5 years.

Quotes Oct-2013

For $82K one could get CPU-HPC wise …

vendorTnodeschipghzcachecoresTcoresdiskramram/coreTrampdueswitchwarrwattcostgflops
dell30X53552.664mb824080gb4-16gb0.5-2gb340YYN12,552na670
micro142650v22.6020mb162241tb128gb8gb1,792gbYY25,400$77K4,659
advcl102680v22.8025mb2020060gb256gb12.8gb2,560gbYY34,597$84K4,480
exxct122660v22.2025mb202401tb128gb6.4gb1,536gbNN?4,860$60K4,224
hp


Back

1)
graphical processing unit, learn more about GPU History
2)
central processing unit
3)
provides 40,000 total cuda-cores
4)
cpu estimated
5) , 6)
cpu measured with Linpack
7)
gpu published by Nvidia
8) , 9)
cpu theoretical
10)
at 208 Volts
11)
per hour, KwH=Watts/1,000
12)
only turned on when needed, perhaps 8 weeks per year
13)
2009 Wesleyan calculation
14) , 15) , 16) , 19) , 20) , 21)
vendor published
17)
per hour, 1,000 Btus=293 Watts per hour
18)
2009 Wesleyan estimate
22)
power and cooling
23) , 24)
for Wesleyan/vendors data