techstaff:slurm
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revision | Next revisionBoth sides next revision | ||
techstaff:slurm [2018/03/08 10:02] – [Storage] kauffman | techstaff:slurm [2018/05/03 09:53] – [Using the GPU] kauffman | ||
---|---|---|---|
Line 245: | Line 245: | ||
====== Using the GPU ====== | ====== Using the GPU ====== | ||
+ | |||
+ | ===== GRES Multiple GPU's on one system ===== | ||
+ | Jobs will not be allocated any generic resources unless specifically requested at job submit time using the --gres option supported by the salloc, sbatch and srun commands. The option requires an argument specifying which generic resources are required and how many resources. The resource specification is of the form name[: | ||
+ | sbatch --gres=gpu: | ||
+ | |||
+ | Jobs will be allocated specific generic resources as needed to satisfy the request. If the job is suspended, those resources do not become available for use by other jobs. | ||
+ | |||
+ | Job steps can be allocated generic resources from those allocated to the job using the --gres option with the srun command as described above. By default, a job step will be allocated all of the generic resources allocated to the job. If desired, the job step may explicitly specify a different generic resource count than the job. This design choice was based upon a scenario where each job executes many job steps. If job steps were granted access to all generic resources by default, some job steps would need to explicitly specify zero generic resource counts, which we considered more confusing. The job step can be allocated specific generic resources and those resources will not be available to other job steps. A simple example is shown below. | ||
+ | |||
+ | |||
===== Paths ===== | ===== Paths ===== | ||
You will need to add the following to your $PATH and $LD_LIBRARY_PATH. | You will need to add the following to your $PATH and $LD_LIBRARY_PATH. |
/var/lib/dokuwiki/data/pages/techstaff/slurm.txt · Last modified: 2021/01/06 16:13 by kauffman