This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
cluster:164 [2017/10/26 18:05] hmeij07 [Bench] |
cluster:164 [2017/10/26 18:26] hmeij07 |
||
---|---|---|---|
Line 99: | Line 99: | ||
* Running across more gpus (2 or 4) decreases performance | * Running across more gpus (2 or 4) decreases performance | ||
* One Amber process per MPI thread per GPU is optimal | * One Amber process per MPI thread per GPU is optimal | ||
+ | |||
+ | **Wow, I just realized the most important metric: Our k20 has a job throughput of 20 per unit of time. The amber128 queue will have a throughput of 4*4.5 or 18 per same unit of time. One new server matches five old ones, well purchased in 2013. From an amber only perspective.** | ||
< | < |