User Tools

Site Tools


slurm:ai

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
slurm:ai [2021/01/06 16:10] kauffmanslurm:ai [2021/02/10 14:33] – [Fairshare/QOS] kauffman
Line 5: Line 5:
 Feedback is requested: Feedback is requested:
  
- [[https://discord.gg/ZVjX8Gv|#ai-cluster Discord channel]] or email Phil Kauffman (kauffman@cs dot uchicago dot edu).+ [[https://discord.gg/ZVjX8Gv|#slurm Discord channel]]
  
-Knowledge of how to use Slurm already is preferred at this stage of testing. 
  
  
-The information from the older cluster mostly applies and I suggest you read that documentation: https://howto.cs.uchicago.edu/techstaff:slurm+The information from the older cluster mostly applies and I suggest you read that documentation: https://howto.cs.uchicago.edu/slurm
  
  
 ====== Infrastructure ====== ====== Infrastructure ======
-Summary of nodes installed on the cluster+Summary of nodes installed on the cluster
 + 
 +[[ http://monitor.ai.cs.uchicago.edu|Ganglia Monitoring ]]
  
 ===== Computer/GPU Nodes ===== ===== Computer/GPU Nodes =====
Line 122: Line 123:
 0,1,2,3 0,1,2,3
 </code> </code>
- 
-==== Notes on CUDA_VISIBLE_DEVICES ==== 
-CUDA_VISIBLE_DEVICES: Displays relative gpu device number available to you.  
- 
-  * This variable should NOT be modified. Ever. 
-  * Relative means that if you requested one gpu it will show up as 0. Even if all other gpus on the server are being used by others. 
  
  
-===== Fairshare/QOS ===== 
-By default all usage is tracked and charged to a users default account. A fairshare value is computed and used in prioritizing a job on submission. 
  
-Details are being worked out for anyone that donates to the cluster. This will be some sort of tiered system where you get to use a higher priority when you need it. 
-You will need to charge an account on job submission ''%%--account=<name>%%'' and most likely select the priority level you wish to use and that you are allowed to use: ''%%--qos=<level>%%'' 
  
  
/var/lib/dokuwiki/data/pages/slurm/ai.txt · Last modified: 2022/04/04 10:58 by chaochunh

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki