User Tools

Site Tools


cluster:218

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
cluster:218 [2022/07/18 14:12]
hmeij07 [Basic Commands]
cluster:218 [2022/09/08 08:51] (current)
hmeij07 [Resources]
Line 28: Line 28:
   * August 2022 is designated **migration** period   * August 2022 is designated **migration** period
   * Queues ''hp12'' and ''mwgpu'' (centos6) will be serviced by Openlava, not Slurm   * Queues ''hp12'' and ''mwgpu'' (centos6) will be serviced by Openlava, not Slurm
 +
 +
 +==== Quick Start Slurm  Guide ====
 +
 +Jump to the **Rocky8/CentOs7 script templates** listed in the menu of this page, top right.
 +
 +There is also detailed information on Amber20/Amber22 on this page with script examples.
 +
 +  * [[cluster:214|Tada]] new head node
  
 ==== Basic Commands ==== ==== Basic Commands ====
Line 76: Line 85:
  
 You must request **resources**, that is for example number of cpu cores or which gpu model to use. ** If you do not request resources, Slurm will assume you need all the node's resources** and thus prevent other jobs from running on that node. You must request **resources**, that is for example number of cpu cores or which gpu model to use. ** If you do not request resources, Slurm will assume you need all the node's resources** and thus prevent other jobs from running on that node.
 +
 +Details
 +
 +  * https://slurm.schedmd.com/cons_res_share.html
  
 Some common examples are:  Some common examples are: 
Line 87: Line 100:
 #SBATCH -n 8     # tasks=S*C*T #SBATCH -n 8     # tasks=S*C*T
 #SBATCH -B 2:4:1 # S:C:T=sockets/node:cores/socket:threads/core #SBATCH -B 2:4:1 # S:C:T=sockets/node:cores/socket:threads/core
 +#SBATCH --mem=250           # needed to override oversubscribe
 +#SBATCH --ntasks-per-node=1 # perhaps needed to override oversubscribe
 +#SBATCH --cpus-per-task=1   # needed to override oversubscribe
 +
  
 GPU control GPU control
-#SBATCH --cpus-per-gpu=1 +#SBATCH --cpus-per-gpu=1                  # needed to override oversubscribe 
-#SBATCH --mem-per-gpu=7168+#SBATCH --mem-per-gpu=7168                # needed to override oversubscribe
 #SBATCH --gres=gpu:geforce_gtx_1080_ti: # n[78], amber128 #SBATCH --gres=gpu:geforce_gtx_1080_ti: # n[78], amber128
 #SBATCH --gres=gpu:geforce_rtx_2080_s:  # n[79-90], exx96 #SBATCH --gres=gpu:geforce_rtx_2080_s:  # n[79-90], exx96
Line 492: Line 509:
 #SBATCH --nodelist=n88 #SBATCH --nodelist=n88
  
 +# may or may not be needed, centos7 login env
 +source $HOME/.bashrc  
 +which ifort           # should be the parallel studio 2016 version
  
 # unique job scratch dirs # unique job scratch dirs
cluster/218.1658167960.txt.gz · Last modified: 2022/07/18 14:12 by hmeij07