This is an old revision of the document!
The Simple Linux Utility for Resource Management (SLURM) is an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for large and small Linux clusters. The architecture is described here https://computing.llnl.gov/linux/slurm/quickstart.html.
Then I created a simple file to test Slurm
#!/bin/bash #SBATCH --time=1:30:10 #SBATCH --job-name="NUMBER" #SBATCH --output="tmp/outNUMBER" #SBATCH --begin=10:35:00 echo "$SLURMD_NODENAME JOB_PID=$SLURM_JOB_ID" echo DONE
Slurm is installed on a PE2950 with dual quad cores and 16 GB. It is part of my high priority queue and allocated to Openlava (v2.2).
My nodes are created in a virtual KVM environment also on a PE2950 (2.6 Ghz, 16 GB ram) dual quad core with hyperthreading and virtualization turned on in the BIOS. Comments on how to build that KVM environment are here LXC Linux Containers