This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
cluster:164 [2017/10/26 18:26] hmeij07 |
cluster:164 [2017/10/30 14:02] hmeij07 [PPMA Bench] |
||
---|---|---|---|
Line 560: | Line 560: | ||
exit $? | exit $? | ||
+ | |||
+ | </ | ||
+ | |||
+ | ==== PPMA Bench ==== | ||
+ | |||
+ | * Runs fastest when constrined to one gpu with 4 mpi threads | ||
+ | * Room for improvement as gpu and gpu memory are not fully utilized | ||
+ | * Adding mpi threads or more gpus reduces ns/day performance | ||
+ | * No idea if adding omp threads shows a different picture | ||
+ | * No idea how it compares to K20 gpus | ||
+ | |||
+ | < | ||
+ | |||
+ | nvidia-smi -pm 0; nvidia-smi -c 0 | ||
+ | # gpu_id is done via CUDA_VISIBLE_DEVICES | ||
+ | export CUDA_VISIBLE_DEVCES=[0, | ||
+ | |||
+ | # on n78 | ||
+ | cd / | ||
+ | rm -f / | ||
+ | time / | ||
+ | / | ||
+ | -in nvt.in -var t 310 > /dev/null 2>& | ||
+ | |||
+ | |||
+ | PMMA Benchmark Performance Metric ns/day (x nr of gpus for node output) | ||
+ | |||
+ | |||
+ | Lammps 11Aug17 on GTX1080Ti (n78) | ||
+ | |||
+ | -n 1, -gpu_id 3 | ||
+ | Performance: | ||
+ | 3, GeForce GTX 1080 Ti, 38, 219 MiB, 10953 MiB, 30 %, 1 % | ||
+ | -n 2, -gpu_id 3 | ||
+ | Performance: | ||
+ | 3, GeForce GTX 1080 Ti, 57, 358 MiB, 10814 MiB, 47 %, 3 % | ||
+ | -n 4, -gpu_id 3 | ||
+ | Performance: | ||
+ | 3, GeForce GTX 1080 Ti, 59, 690 MiB, 10482 MiB, 76 %, 4 % | ||
+ | -n 8, -gpu_id 3 | ||
+ | Performance: | ||
+ | 3, GeForce GTX 1080 Ti, 47, 1332 MiB, 9840 MiB, 90 %, 4 % | ||
+ | -n 4, -gpu_id 01 | ||
+ | Performance: | ||
+ | 0, GeForce GTX 1080 Ti, 48, 350 MiB, 10822 MiB, 50 %, 3 % | ||
+ | 1, GeForce GTX 1080 Ti, 37, 344 MiB, 10828 MiB, 49 %, 3 % | ||
+ | -n 8, -gpu_id 01 | ||
+ | Performance: | ||
+ | 0, GeForce GTX 1080 Ti, 66, 670 MiB, 10502 MiB, 77 %, 4 % | ||
+ | 1, GeForce GTX 1080 Ti, 51, 670 MiB, 10502 MiB, 81 %, 4 % | ||
+ | -n 12, -gpu_id 01 | ||
+ | Performance: | ||
+ | 0, GeForce GTX 1080 Ti, 65, 988 MiB, 10184 MiB, 82 %, 4 % | ||
+ | 1, GeForce GTX 1080 Ti, 50, 990 MiB, 10182 MiB, 85 %, 4 % | ||
+ | -n 8, -gpu_id 0123 | ||
+ | Performance: | ||
+ | 0, GeForce GTX 1080 Ti, 56, 340 MiB, 10832 MiB, 57 %, 3 % | ||
+ | 1, GeForce GTX 1080 Ti, 41, 340 MiB, 10832 MiB, 52 %, 2 % | ||
+ | 2, GeForce GTX 1080 Ti, 43, 340 MiB, 10832 MiB, 57 %, 3 % | ||
+ | 3, GeForce GTX 1080 Ti, 42, 340 MiB, 10832 MiB, 55 %, 2 % | ||
+ | -n 12, -gpuid 0123 | ||
+ | Performance: | ||
+ | -n 16 | ||
+ | Performance: | ||
+ | |||
+ | |||
+ | |||
+ | # on n34 | ||
+ | unable to get it to run... | ||
+ | |||
+ | K20 on n34 | ||
+ | |||
+ | -n 1, -gpu_id 0 | ||
+ | -n 4, -gpu_id 0 | ||
+ | -n 4, -gpuid 0123 | ||
+ | |||
+ | # comparison of binaries running PMMA | ||
+ | |||
+ | # lmp_mpi-double-double-with-gpu.log | ||
+ | Performance: | ||
+ | # lmp_mpi-single-double-with-gpu.log | ||
+ | Performance: | ||
+ | # lmp_mpi-single-single-with-gpu.log | ||
+ | Performance: | ||
+ | |||
</ | </ |