Differences

This shows you the differences between two versions of the page.

--- techstaff:slurm [2016/05/06 15:25] – [Partitions / Queues] kauffman
+++ techstaff:slurm [2016/05/09 15:06] – kauffman
@@ Line 142: / Line 142: @@
 Make sure to replace all instances of the word ''%%cnetid%%'' with your CNETID.
+=== Submitting job script ===
+Using the above example you will want to place your tested code into a file. 'hostname.job' is the file name in this example.
+<code>
+sbatch hostname.job
+</code>
+You can then check the status via squeue or see the output in the output directory '$HOME/slurm/slurm_out'.
 ==== srun ====
 Used to submit a job to the cluster that doesn't necessarily need a script.
@@ Line 219: / Line 226: @@
 | JOB <jobid> CANCELLED AT <time> DUE TO NODE FAILURE | There can be many reasons for this message, but most often it means that the node your job was set to run on can no longer be contacted by the the SLURM controller.|
 | error: Unable to allocate resources: More processors requested than permitted | It usually has **nothing** to do with priviledges you may or may not have. Rather, it usually means that you have allocated more processors than one compute node actually has. |
+====== Using the GPU ======
+===== Example =====
+This sbatch script will get device information from the installed Tesla gpu.
+<code>
+#!/bin/bash
+#
+#SBATCH --mail-user=cnetid@cs.uchicago.edu
+#SBATCH --mail-type=ALL
+#SBATCH --output=/home/cnetid/slurm/slurm_out/%j.%N.stdout
+#SBATCH --error=/home/cnetid/slurm/slurm_out/%j.%N.stderr
+#SBATCH --workdir=/home/cnetid/slurm
+#SBATCH --partition=gpu
+#SBATCH --job-name=get_tesla_info
+cat << EOF > /tmp/getinfo.cu
+#include <stdio.h>
+int main() {
+  int nDevices;
+  cudaGetDeviceCount(&nDevices);
+  for (int i = 0; i < nDevices; i++) {
+    cudaDeviceProp prop;
+    cudaGetDeviceProperties(&prop, i);
+    printf("Device Number: %d\n", i);
+    printf("  Device name: %s\n", prop.name);
+    printf("  Memory Clock Rate (KHz): %d\n",
+           prop.memoryClockRate);
+    printf("  Memory Bus Width (bits): %d\n",
+           prop.memoryBusWidth);
+    printf("  Peak Memory Bandwidth (GB/s): %f\n\n",
+.0*prop.memoryClockRate*(prop.memoryBusWidth/8)/1.0e6);
+  }
+}
+EOF
+/usr/local/cuda/bin/nvcc /tmp/getinfo.cu -o /tmp/a.out
+/tmp/a.out
+rm /tmp/a.out
+rm /tmp/getinfo.cu
+</code>
 ====== More ======
 If you feel this documentation is lacking in some way please let techstaff know. Email [[techstaff@cs.uchicago.edu]], call (773-702-1031), or stop by our office (Ryerson 154).