User Tools

Site Tools


cluster:164

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision Both sides next revision
cluster:164 [2017/10/27 19:30]
hmeij07
cluster:164 [2017/10/27 19:38]
hmeij07
Line 564: Line 564:
  
 ==== PPMA Bench ==== ==== PPMA Bench ====
 +
 +  * Runs fastest when constrined to one gpu with 4 mpi threads
 +  * Room for improvement as gpu and gpu memory are not fully utilized
 +  * Adding mpi threads or more gpus reduces ns/day performance
 +  * No idea if adding omp threads shows a different picture
  
 <code> <code>
 +nvidia-smi -pm 0; nvidia-smi -c 0
 +# gpu_id is done via CUDA_VISIBLE_DEVICES
 +export CUDA_VISIBLE_DEVCES=[0,1,2,3]
 +
 +# on n78
 +cd /home/hmeij/lammps/benchmark
 +rm -f /tmp/lmp-run.log;rm -f *.jpg;\
 +time /usr/local/mpich-3.1.4/bin/mpirun -launcher ssh -f ./hostfile  -n $STRING_1 \
 +/usr/local/lammps-11Aug17/lmp_mpi-double-double-with-gpu -suffix gpu -pk gpu $STRING_2 \
 +-in nvt.in -var t 310 > /dev/null 2>&1; grep ^Performance /tmp/lmp-run.log
 +
  
-PMMA Benchmark Performance Metric (x  nr of gpus)+PMMA Benchmark Performance Metric ns/day (x  nr of gpus for node output)
  
  
-GTX on n78+Lammps 11Aug17 on GTX1080Ti (n78)
  
 -n 1, -gpu_id 3 -n 1, -gpu_id 3
cluster/164.txt ยท Last modified: 2018/09/21 11:59 by hmeij07