techstaff:slurm
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
techstaff:slurm [2018/05/04 12:32] – kauffman | techstaff:slurm [2018/05/04 12:47] – [Partitions / Queues] kauffman | ||
---|---|---|---|
Line 98: | Line 98: | ||
| **debug** | The partition your job will be submitted to if none is specified. The purpose of this partition is to make sure your code is running as it should before submitting a long running job to the general queue. | | | **debug** | The partition your job will be submitted to if none is specified. The purpose of this partition is to make sure your code is running as it should before submitting a long running job to the general queue. | | ||
| **general** | All jobs that have been thoroughly tested can be submitted here. This partition will have access to more nodes and will process most of the jobs. If you need to use the '' | | **general** | All jobs that have been thoroughly tested can be submitted here. This partition will have access to more nodes and will process most of the jobs. If you need to use the '' | ||
- | | **gpu** | Contains servers with graphics cards. As of May 2016 there is only one node containing a Tesla M2090. You will be forced to use this server exclusively for now. Please keep your time in interactive mode to a minimum.| | + | | **pascal** | 2018-05-04: 1x Nvidia GTX1080. You will be forced to use this server exclusively for now. Please keep your time in interactive mode to a minimum.| |
+ | | **titan** | 2018-05-04: 4x Nvidia GTX1080Ti. This partition is shared and you MUST use the '' | ||
====== Job Submission ====== | ====== Job Submission ====== | ||
Line 268: | Line 269: | ||
Example when using tensorflow: | Example when using tensorflow: | ||
- | Give the file ' | + | Given the file '' |
- | Depends on: | + | < |
- | | + | # |
- | '' | + | from tensorflow.python.client import device_lib |
- | < | + | print(device_lib.list_local_devices()) |
- | # | + | </ |
- | from tensorflow.python.client import device_lib | + | |
- | print(device_lib.list_local_devices()) | + | |
- | </ | + | |
Here we can see that no GPU was allocated to us because we did not specify the '' | Here we can see that no GPU was allocated to us because we did not specify the '' | ||
< | < | ||
- | kauffman3@bulldozer: | + | user@bulldozer: |
- | | + | user@gpu3:~$ ./f 2>&1 | grep physical_device_desc |
- | | + | user@gpu3:~$ |
</ | </ | ||
If we request only 1 GPU. | If we request only 1 GPU. | ||
< | < | ||
- | kauffman3@bulldozer: | + | user@bulldozer: |
- | | + | user@gpu3:~$ ./f 2>&1 | grep physical_device_desc |
- | physical_device_desc: | + | physical_device_desc: |
</ | </ | ||
If we request 2 GPUs. | If we request 2 GPUs. | ||
< | < | ||
- | kauffman3@bulldozer: | + | user@bulldozer: |
- | kauffman3@gpu3:~$ ./f 2>&1 | grep physical_device_desc | + | user@gpu3:~$ ./f 2>&1 | grep physical_device_desc |
- | physical_device_desc: | + | physical_device_desc: |
- | physical_device_desc: | + | physical_device_desc: |
</ | </ | ||
If we request more GPUs then are available. | If we request more GPUs then are available. | ||
< | < | ||
- | | + | kauffman3@bulldozer: |
- | srun: error: Unable to allocate resources: Requested node configuration is not available | + | srun: error: Unable to allocate resources: Requested node configuration is not available |
</ | </ | ||
Line 324: | Line 322: | ||
===== Paths ===== | ===== Paths ===== | ||
- | You will need to add the following to your $PATH and $LD_LIBRARY_PATH. | + | You will need to add the following to your '' |
export PATH=$PATH:/ | export PATH=$PATH:/ |
/var/lib/dokuwiki/data/pages/techstaff/slurm.txt · Last modified: 2021/01/06 16:13 by kauffman