Differences

This shows you the differences between two versions of the page.

--- cluster:175 [2018/09/21 14:29]
hmeij07 [Amber]
+++ cluster:175 [2018/09/21 15:16]
hmeij07 [Lammps]
@@ Line 33: / Line 33: @@
 ==== Lammps ====
-We can also not complain about gpu utilization in this example.  We tend to achieve better performance with cpu:gpu ratios in the 4:1 range but not this time.  Best performance was obtained when cpu equaled gpu.
+We can also not complain about gpu utilization in this example.  We tend to achieve better performance with cpu:gpu ratios in the 4:1 range On the GTX server but not this time.  Best performance was obtained when cpu threads equaled the number of gpus.
+On our GTX server best performance was a ratio of 16:4 cpu:gpu for 932,493 tau/day (11x faster than our K20). However scaling the job to a ratio of cpu:gpu of 4:2 yields 819,207 tau/day which means a quad server can deliver about 1.6 million tau/day.
+A single P100 beat this easily coming in at 2.6 million tau/day. Spreading the problem over more gpus did raise overall performance to 3.3 million tau/day. However, four cpu:gpu 1:1 jobs would achieve slightly over 10 million tau/day. That is almost 10x faster than the GTX server.
-On our GTX server best performance was a ratio of 16:4 cpu:gpu for 932,493 tau/day (11x faster than our K20). However scaling the job to a ration cpu:gpu of 4:2 yields 819,207 tau/day which means a quad server can deliver about 1.6 million tau/day.
 <code>
@@ Line 57: / Line 59: @@
 , Tesla P100-PCIE-16GB, 37, 596 MiB, 15684 MiB, 81 %, 2 %
 , Tesla P100-PCIE-16GB, 37, 596 MiB, 15684 MiB, 80 %, 2 %
 </code>
+==== Gromacs ====
+Gromacs has shown vastly improved performance between version. v5 delivered about 20 ns/day per K20 server and 350 ns/day on GTX server. v2018 delivered 75 ns/day per K20 server and 900 ns/day on GTX server. A roughly 3x improvement.
+<code>
+</code>
 \\

DokuWiki

User Tools

Site Tools

Differences

Page Tools