cluster:182
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| cluster:182 [2019/08/13 12:37] – [Gromacs] hmeij07 | cluster:182 [2019/12/13 13:33] (current) – hmeij07 | ||
|---|---|---|---|
| Line 2: | Line 2: | ||
| **[[cluster: | **[[cluster: | ||
| + | |||
| ==== P100 vs RTX 6000 & T4 ==== | ==== P100 vs RTX 6000 & T4 ==== | ||
| Line 54: | Line 54: | ||
| | DPFP | 5.21| 18.35| | | DPFP | 5.21| 18.35| | ||
| | SXFP | 11.82| | | SXFP | 11.82| | ||
| - | | | + | | |
| Like last testing outcome, in the SFFP precision mode it is best to run four individual jobs, one per GPU (mpi=1, gpu=1). Best performance is the P100 at 47.64 vs the RTX at 39.69 ns/day per node. The T4 runs about 1/3 as fast and really falters in DPFP precision mode. But in SXFP (experimental) precision mode the T4 makes up in performance. | Like last testing outcome, in the SFFP precision mode it is best to run four individual jobs, one per GPU (mpi=1, gpu=1). Best performance is the P100 at 47.64 vs the RTX at 39.69 ns/day per node. The T4 runs about 1/3 as fast and really falters in DPFP precision mode. But in SXFP (experimental) precision mode the T4 makes up in performance. | ||
| Line 90: | Line 90: | ||
| ==== Gromacs ==== | ==== Gromacs ==== | ||
| - | Gromacs was build on each of the nodes locally letting it select the optimal CPU (AVX, SSE) and GPU accelerators. The '' | + | Gromacs was build on each of the nodes locally letting it select the optimal CPU (AVX, SSE) and GPU accelerators. |
| Line 99: | Line 99: | ||
| | Mixed | | | | | 733| gpu=4, 01-16 | | | Mixed | | | | | 733| gpu=4, 01-16 | | ||
| - | The T4 is P100's equal in mixed precision performance. Add the wattage factor and you have a favorite. | + | The T4 is P100's equal in mixed precision performance. Add the wattage factor and you have a favorite. And GPU utilization was outstanding. |
| + | [heme@login1 gromacs-2018]$ ssh node9 ./ | ||
| + | id, | ||
| + | 0, Tesla T4, 66, 866 MiB, 14213 MiB, 98 %, 9 %\\ | ||
| + | 1, Tesla T4, 67, 866 MiB, 14213 MiB, 98 %, 9 %\\ | ||
| + | 2, Tesla T4, 66, 866 MiB, 14213 MiB, 99 %, 9 %\\ | ||
| + | 3, Tesla T4, 64, 866 MiB, 14213 MiB, 97 %, 9 %\\ | ||
| ==== Scripts ==== | ==== Scripts ==== | ||
| Line 191: | Line 197: | ||
| </ | </ | ||
| - | And GPU utilization was outstanding. | ||
| - | [heme@login1 gromacs-2018]$ ssh node9 ./ | ||
| - | id, | ||
| - | 0, Tesla T4, 66, 866 MiB, 14213 MiB, 98 %, 9 %\\ | ||
| - | 1, Tesla T4, 67, 866 MiB, 14213 MiB, 98 %, 9 %\\ | ||
| - | 2, Tesla T4, 66, 866 MiB, 14213 MiB, 99 %, 9 %\\ | ||
| - | 3, Tesla T4, 64, 866 MiB, 14213 MiB, 97 %, 9 %\\ | ||
| \\ | \\ | ||
| **[[cluster: | **[[cluster: | ||
cluster/182.1565699820.txt.gz · Last modified: by hmeij07
