This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Last revision Both sides next revision | ||
cluster:182 [2019/08/13 08:41] hmeij07 [Gromacs] |
cluster:182 [2019/08/26 08:03] hmeij07 [Amber] |
||
---|---|---|---|
Line 54: | Line 54: | ||
| DPFP | 5.21| 18.35| | | DPFP | 5.21| 18.35| | ||
| SXFP | 11.82| | | SXFP | 11.82| | ||
- | | | + | | |
Like last testing outcome, in the SFFP precision mode it is best to run four individual jobs, one per GPU (mpi=1, gpu=1). Best performance is the P100 at 47.64 vs the RTX at 39.69 ns/day per node. The T4 runs about 1/3 as fast and really falters in DPFP precision mode. But in SXFP (experimental) precision mode the T4 makes up in performance. | Like last testing outcome, in the SFFP precision mode it is best to run four individual jobs, one per GPU (mpi=1, gpu=1). Best performance is the P100 at 47.64 vs the RTX at 39.69 ns/day per node. The T4 runs about 1/3 as fast and really falters in DPFP precision mode. But in SXFP (experimental) precision mode the T4 makes up in performance. |