techstaff:slurm
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
techstaff:slurm [2018/05/04 12:47] – [Partitions / Queues] kauffman | techstaff:slurm [2018/12/07 12:35] – [Using the GPU] kauffman | ||
---|---|---|---|
Line 112: | Line 112: | ||
| ^ SLURM ^ Example ^ | | ^ SLURM ^ Example ^ | ||
^ Submit a batch serial job | sbatch | sbatch runscript.sh | | ^ Submit a batch serial job | sbatch | sbatch runscript.sh | | ||
- | ^ Run a script | + | ^ Run a script |
^ Kill a job | scancel | scancel 4585 | | ^ Kill a job | scancel | scancel 4585 | | ||
^ View status of queues | squeue | squeue -u cnetid | | ^ View status of queues | squeue | squeue -u cnetid | | ||
Line 246: | Line 246: | ||
====== Using the GPU ====== | ====== Using the GPU ====== | ||
+ | [[ techstaff: | ||
+ | ===== CUDA_VISIBLE_DEVICES ===== | ||
+ | Do not set this variable. It will be set for you by SLURM. | ||
+ | |||
+ | The variable name is actually misleading; since it does NOT mean the amount of devices, but rather the physical device number assigned by the kernel (e.g. / | ||
+ | |||
+ | For example: If you requested multiple gpu's from SLURM (--gres=gpu: | ||
+ | |||
===== GRES Multiple GPU's on one system ===== | ===== GRES Multiple GPU's on one system ===== | ||
Line 319: | Line 327: | ||
GRES: Don't depend on this being accurate, however it will definitely give you a clue as to how many generic resources are in a partition. | GRES: Don't depend on this being accurate, however it will definitely give you a clue as to how many generic resources are in a partition. | ||
+ | |||
+ | ==== Checking how many Generic RESources are being consumed ==== | ||
+ | |||
+ | Simple use the '' | ||
+ | < | ||
+ | $ squeue -O username, | ||
+ | USER NODELIST | ||
+ | someusername | ||
+ | otherusername | ||
+ | ... | ||
+ | </ | ||
/var/lib/dokuwiki/data/pages/techstaff/slurm.txt · Last modified: 2021/01/06 16:13 by kauffman