This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Next revision Both sides next revision | ||
cluster:175 [2018/09/21 14:29] hmeij07 [Amber] |
cluster:175 [2018/09/21 15:16] hmeij07 [Lammps] |
||
---|---|---|---|
Line 33: | Line 33: | ||
==== Lammps ==== | ==== Lammps ==== | ||
- | We can also not complain about gpu utilization in this example. | + | We can also not complain about gpu utilization in this example. |
+ | |||
+ | On our GTX server best performance was a ratio of 16:4 cpu:gpu for 932,493 tau/day (11x faster than our K20). However scaling the job to a ratio of cpu:gpu of 4:2 yields 819,207 tau/day which means a quad server can deliver about 1.6 million tau/day. | ||
+ | |||
+ | A single P100 beat this easily coming in at 2.6 million tau/day. Spreading the problem over more gpus did raise overall performance to 3.3 million tau/day. However, four cpu:gpu 1:1 jobs would achieve slightly over 10 million tau/day. That is almost 10x faster than the GTX server. | ||
- | On our GTX server best performance was a ratio of 16:4 cpu:gpu for 932,493 tau/day (11x faster than our K20). However scaling the job to a ration cpu:gpu of 4:2 yields 819,207 tau/day which means a quad server can deliver about 1.6 million tau/day. | ||
- | | ||
< | < | ||
Line 57: | Line 59: | ||
2, Tesla P100-PCIE-16GB, | 2, Tesla P100-PCIE-16GB, | ||
3, Tesla P100-PCIE-16GB, | 3, Tesla P100-PCIE-16GB, | ||
- | |||
</ | </ | ||
+ | ==== Gromacs ==== | ||
+ | |||
+ | Gromacs has shown vastly improved performance between version. v5 delivered about 20 ns/day per K20 server and 350 ns/day on GTX server. v2018 delivered 75 ns/day per K20 server and 900 ns/day on GTX server. A roughly 3x improvement. | ||
+ | |||
+ | < | ||
+ | |||
+ | |||
+ | </ | ||
\\ | \\ |