Differences

This shows you the differences between two versions of the page.

--- cluster:164 [2017/10/27 19:30]
hmeij07
+++ cluster:164 [2017/10/30 13:54]
hmeij07 [PPMA Bench]
@@ Line 564: / Line 564: @@
 ==== PPMA Bench ====
+  * Runs fastest when constrined to one gpu with 4 mpi threads
+  * Room for improvement as gpu and gpu memory are not fully utilized
+  * Adding mpi threads or more gpus reduces ns/day performance
+  * No idea if adding omp threads shows a different picture
+  * No idea how it compares to K20 gpus
 <code>
-PMMA Benchmark Performance Metric (x  nr of gpus)
+nvidia-smi -pm 0; nvidia-smi -c 0
+# gpu_id is done via CUDA_VISIBLE_DEVICES
+export CUDA_VISIBLE_DEVCES=[0,1,2,3]
+# on n78
+cd /home/hmeij/lammps/benchmark
+rm -f /tmp/lmp-run.log;rm -f *.jpg;\
+time /usr/local/mpich-3.1.4/bin/mpirun -launcher ssh -f ./hostfile  -n $STRING_1 \
+/usr/local/lammps-11Aug17/lmp_mpi-double-double-with-gpu -suffix gpu -pk gpu $STRING_2 \
+-in nvt.in -var t 310 > /dev/null 2>&1; grep ^Performance /tmp/lmp-run.log
+PMMA Benchmark Performance Metric ns/day (x  nr of gpus for node output)
-GTX on n78
+Lammps 11Aug17 on GTX1080Ti (n78)
 -n 1, -gpu_id 3
@@ Line 618: / Line 636: @@
 -n 4, -gpuid 0123
+# comparison of binaries running PMMA
 </code>

DokuWiki