This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
cluster:123 [2013/10/10 21:03] hmeij [Replace Dell Racks] |
cluster:123 [2013/10/23 18:52] (current) hmeij [Summary] |
||
---|---|---|---|
Line 1: | Line 1: | ||
\\ | \\ | ||
**[[cluster: | **[[cluster: | ||
- | |||
- | To be send to Dave Baird. | ||
==== Replace Dell Racks ==== | ==== Replace Dell Racks ==== | ||
Line 9: | Line 7: | ||
Subtitle: A win-win solution proposed by Physical Plant and ITS | Subtitle: A win-win solution proposed by Physical Plant and ITS | ||
- | Once upon a time, back in 2013, two Dell racks full of compute nodes, sat noisily chewing away energy on the 5th floor of Science Tower. | + | Once upon a time, back in 2013, two Dell racks full of compute nodes, sat noisily chewing away energy on the 5th floor of Science Tower. |
The Dell racks contain 30 compute nodes, two UPS units, two disks arrays and two switches. We have measured 19 nodes power consumption (pulling one of the dual power units out) with a Kill-A-Watt meter for over 775+ total hours. The mean power consumption rate is 418.4 watts. That totals to 109,956 KwH/year in power consumption ((watts/ | The Dell racks contain 30 compute nodes, two UPS units, two disks arrays and two switches. We have measured 19 nodes power consumption (pulling one of the dual power units out) with a Kill-A-Watt meter for over 775+ total hours. The mean power consumption rate is 418.4 watts. That totals to 109,956 KwH/year in power consumption ((watts/ | ||
Line 19: | Line 17: | ||
Next step was to collect vendor quotes for a target budget of $82K, 3 years of Dell energy consumption, | Next step was to collect vendor quotes for a target budget of $82K, 3 years of Dell energy consumption, | ||
- | Old hardware: | + | Old hardware: |
- | 30 nodes, 2.66 ghz, 4 mb L-cache (for cpu), 240 cores (job slots), | + | 30 nodes, 2.66 ghz, 4 mb L-cache (for cpu), 240 cores (job slots),\\ |
80 gb local drive, 340 gb total ram, 12,555 watts (power no cooling), 670 gigaflops (actual measure) | 80 gb local drive, 340 gb total ram, 12,555 watts (power no cooling), 670 gigaflops (actual measure) | ||
- | New hardware: | + | New hardware |
- | 14 nodes, 2.60 ghz, 20 mb L-cache (for cpu), 224 cores (job slots), | + | 14 nodes, 2.60 ghz, 20 mb L-cache (for cpu), 224 cores (job slots),\\ |
1TB local drive, 1,792 gb total ram, 5,400 watts (power no cooling), 4,659 gigaflops (theoretical) | 1TB local drive, 1,792 gb total ram, 5,400 watts (power no cooling), 4,659 gigaflops (theoretical) | ||
- | |||
- | In the representative example for new hardware the total energy consumption would be 10,800 watts. | ||
- | The total cost of running the new hardware would be $5,913 per year. That would imply savings of $21,576 per year. The job slot count would be 112 but with hyperthreading technology that can be doubled. We'd still want the 1,792 memory footprint (8 gb/core) and the gigaflops (2,329) still far exceeds Dell's performance. | + | New hardware v2 (half of v1): 23,652 KwH/year for cooling or 22% of Old hardware\\ |
+ | 7 nodes, 2.60 ghz, 20 mb L-cache (for cpu), 112 cores (job slots),\\ | ||
+ | 1TB local drive, 1,792 gb total ram, 2,700 watts (power no cooling), 2,329 gigaflops (theoretical) | ||
+ | |||
+ | If we reduced the node count to 7 (the minimum configuration to meet the job slot count of the Dell hardware), the total energy consumption (power plus cooling) would be 5,400 watts. | ||
In two years, the new hardware would have saved $43,152 on energy costs based on the low water mark (Dell' | In two years, the new hardware would have saved $43,152 on energy costs based on the low water mark (Dell' | ||
Line 35: | Line 35: | ||
* There are enough Infiniband ports available to put all new hardware nodes on such a switch (add cards and cables cost for each node) | * There are enough Infiniband ports available to put all new hardware nodes on such a switch (add cards and cables cost for each node) | ||
* The internal disks on each node need to be of a high speed (10K or better) and of a certain size (300 GB or larger) mimicking the Dell disk arrays (adds costs) | * The internal disks on each node need to be of a high speed (10K or better) and of a certain size (300 GB or larger) mimicking the Dell disk arrays (adds costs) | ||
- | * we maybe able to add two more nodes by switching to a more exapnsive | + | * we maybe able to add two more nodes by switching to a more exapansive |
+ | * accomplished by switching from 8 core 2650v2 (130 watt) 2.6 ghz CPU to 10 core 2660v2 (95 watt) 2.2 ghz CPU | ||
But it is all very doable within a budget of $45-$50K. And it can be the solution for: | But it is all very doable within a budget of $45-$50K. And it can be the solution for: | ||
Line 46: | Line 47: | ||
The Libert family rejoices. | The Libert family rejoices. | ||
+ | ==== Update ==== | ||
+ | |||
+ | The table below contains data for a cluster whose nodes are all on the Infiniband switch (and also ethernet switch for provision and data). | ||
+ | |||
+ | |||
+ | ^ Tnodes^ | ||
+ | | 10| 160| 320| 2, | ||
+ | | 9| 144| 288| 2, | ||
+ | | 8| 128| 256| 2, | ||
+ | | 7| 112| 224| 1, | ||
+ | |||
+ | |||
+ | ==== Summary ==== | ||
+ | |||
+ | The Dell racks were bought in 2006. They contain 30 compute nodes, two UPS units, two disks arrays and two switches. Measurements of 2/3rds of the compute nodes with a Kill-A-Watt meter yields an average consumption of 418.4 watts (if the nodes are computing or not). That totals to 109,956 KwH/year for power, a low water mark. Measurements at the utility panel for one of the racks yields a consumption of 126,000 KwH/year. Cooling requirements (not measured) are assumed to be equal to that. | ||
+ | |||
+ | Using the low water mark, and a cost per KwH (inclusive of cogen generation costs, maintenance, | ||
+ | |||
+ | New hardware could replace the Dell's functionality and reduce power/ | ||
+ | |||
+ | Such a cluster would consume 77% less energy generating $21,094 in saving per year (after accounting for energy needs of that 8-node cluster). | ||
\\ | \\ | ||
**[[cluster: | **[[cluster: |