Warning: Undefined array key "DOKU_PREFS" in /usr/share/dokuwiki/inc/common.php on line 2082
cluster:218 [DokuWiki]

User Tools

Site Tools


cluster:218

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
cluster:218 [2022/07/18 14:12]
hmeij07 [Basic Commands]
cluster:218 [2022/09/08 08:51]
hmeij07 [Resources]
Line 28: Line 28:
   * August 2022 is designated **migration** period   * August 2022 is designated **migration** period
   * Queues ''hp12'' and ''mwgpu'' (centos6) will be serviced by Openlava, not Slurm   * Queues ''hp12'' and ''mwgpu'' (centos6) will be serviced by Openlava, not Slurm
 +
 +
 +==== Quick Start Slurm  Guide ====
 +
 +Jump to the **Rocky8/CentOs7 script templates** listed in the menu of this page, top right.
 +
 +There is also detailed information on Amber20/Amber22 on this page with script examples.
 +
 +  * [[cluster:214|Tada]] new head node
  
 ==== Basic Commands ==== ==== Basic Commands ====
Line 76: Line 85:
  
 You must request **resources**, that is for example number of cpu cores or which gpu model to use. ** If you do not request resources, Slurm will assume you need all the node's resources** and thus prevent other jobs from running on that node. You must request **resources**, that is for example number of cpu cores or which gpu model to use. ** If you do not request resources, Slurm will assume you need all the node's resources** and thus prevent other jobs from running on that node.
 +
 +Details
 +
 +  * https://slurm.schedmd.com/cons_res_share.html
  
 Some common examples are:  Some common examples are: 
Line 87: Line 100:
 #SBATCH -n 8     # tasks=S*C*T #SBATCH -n 8     # tasks=S*C*T
 #SBATCH -B 2:4:1 # S:C:T=sockets/node:cores/socket:threads/core #SBATCH -B 2:4:1 # S:C:T=sockets/node:cores/socket:threads/core
 +#SBATCH --mem=250           # needed to override oversubscribe
 +#SBATCH --ntasks-per-node=1 # perhaps needed to override oversubscribe
 +#SBATCH --cpus-per-task=1   # needed to override oversubscribe
 +
  
 GPU control GPU control
-#SBATCH --cpus-per-gpu=1 +#SBATCH --cpus-per-gpu=1                  # needed to override oversubscribe 
-#SBATCH --mem-per-gpu=7168+#SBATCH --mem-per-gpu=7168                # needed to override oversubscribe
 #SBATCH --gres=gpu:geforce_gtx_1080_ti: # n[78], amber128 #SBATCH --gres=gpu:geforce_gtx_1080_ti: # n[78], amber128
 #SBATCH --gres=gpu:geforce_rtx_2080_s:  # n[79-90], exx96 #SBATCH --gres=gpu:geforce_rtx_2080_s:  # n[79-90], exx96
Line 492: Line 509:
 #SBATCH --nodelist=n88 #SBATCH --nodelist=n88
  
 +# may or may not be needed, centos7 login env
 +source $HOME/.bashrc  
 +which ifort           # should be the parallel studio 2016 version
  
 # unique job scratch dirs # unique job scratch dirs
cluster/218.txt · Last modified: 2023/10/14 15:24 by hmeij07