User Tools

Site Tools


cluster:192

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
cluster:192 [2020/02/26 18:25]
hmeij07 [EXX96]
cluster:192 [2020/02/26 18:40]
hmeij07 [Usage]
Line 12: Line 12:
 The new queue ''exx96'' will be comprised of nodes ''n79-n90'' Each node holds 4x RTX2080S gpus, 2x Xeon Silver 4214 2.2 Ghz cpus, 96 GB memory and a 1TB SSD. ''/localscratch'' is around 800 GD. The new queue ''exx96'' will be comprised of nodes ''n79-n90'' Each node holds 4x RTX2080S gpus, 2x Xeon Silver 4214 2.2 Ghz cpus, 96 GB memory and a 1TB SSD. ''/localscratch'' is around 800 GD.
  
 +A new static resource is introduced for all nodes holding gpus. ''n78'' in queue ''amber128'' and ''n33-n37'' in queue ''mwgpu'' and the nodes mentioned above.  The name of this resource is ''gpu4'' Moving forward please use it instead of ''gpu'' or ''gputest''.
  
 +The wrappers provided assume your cpu:gpu ratio is 1:1 hence in your submit code you will have ''#BSUB -n 1'' and in your resource allocation line ''gpu4=1'' If your ratio is something else you can set CPU_GPU_REQUEST. For example CPU_GPU_REQUEST=4:2 expects the lines ''#BSUB -n 4'' and ''gpu4=2'' in your submit script. SAmple script at ''/home/hmeij/k20redo/run.rtx''
 +
 +The wrappers (78.mpich3.wrapper for ''n78'', and n37.openmpi.wrapper for all others) are located in ''/usr/local/bin'' and will set up your environment and start either of these applications: amber, lammps, gromacs, matlab and namd from ''/usr/local''.
 + 
  
 <code> <code>
 +
 +# command that shows gpu reservations
 bhosts -l n79 bhosts -l n79
              gputest gpu4              gputest gpu4
Line 20: Line 27:
  Reserved        0.0  0.1  Reserved        0.0  0.1
  
 +# old way of doing that
 lsload -l n79 lsload -l n79
  
 HOST_NAME               status  r15s   r1m  r15m   ut    pg    io  ls    it   tmp   swp   mem    gpu HOST_NAME               status  r15s   r1m  r15m   ut    pg    io  ls    it   tmp   swp   mem    gpu
 n79                         ok   0.0   0.0   0.0   0%   0.0       0 2e+08  826G   10G   90G    3.0 n79                         ok   0.0   0.0   0.0   0%   0.0       0 2e+08  826G   10G   90G    3.0
- 
-mdout.325288: Master Total CPU time:          982.60 seconds     0.27 hours  1:1 
-mdout.325289: Master Total CPU time:          611.08 seconds     0.17 hours  4:2 
-mdout.326208: Master Total CPU time:          537.97 seconds     0.15 hours 36:4 
- 
-#BSUB -n 4 
-#BSUB -R "rusage[gpu4=2:mem=6288],span[hosts=1]" 
-export CPU_GPU_REQUEST=4:2 
  
 </code> </code>
cluster/192.txt ยท Last modified: 2022/03/08 18:29 by hmeij07