User Tools

Site Tools


techstaff:slurm

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
Next revisionBoth sides next revision
techstaff:slurm [2015/12/04 13:24] – created kauffmantechstaff:slurm [2015/12/07 16:57] kauffman
Line 37: Line 37:
  
 ==== Storage ==== ==== Storage ====
-Available network storage info. +Shared scratch storage is being planned, but not yet availableTechstaff hopes to have this done in time for the winter quarter.
- +
-Mounted on /mnt/scratch via NFSv4 on a 1Gb network.+
  
 ==== Utilization Dashboard ==== ==== Utilization Dashboard ====
Line 184: Line 182:
 | JOB <jobid> CANCELLED AT <time> DUE TO NODE FAILURE | There can be many reasons for this message, but most often it means that the node your job was set to run on can no longer be contacted by the the SLURM controller.| | JOB <jobid> CANCELLED AT <time> DUE TO NODE FAILURE | There can be many reasons for this message, but most often it means that the node your job was set to run on can no longer be contacted by the the SLURM controller.|
 | error: Unable to allocate resources: More processors requested than permitted | It usually has **nothing** to do with priviledges you may or may not have. Rather, it usually means that you have allocated more processors than one compute node actually has. | | error: Unable to allocate resources: More processors requested than permitted | It usually has **nothing** to do with priviledges you may or may not have. Rather, it usually means that you have allocated more processors than one compute node actually has. |
 +
 +===== More =====
 +If you feel this documentation is lacking in some way please let techstaff know. Email(techstaff@cs.uchicago.edu), call(773-702-1031), or stop by our office (Ryerson 154).
/var/lib/dokuwiki/data/pages/techstaff/slurm.txt · Last modified: 2021/01/06 16:13 by kauffman

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki