User Tools

Site Tools


techstaff:aicluster-admin

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
techstaff:aicluster-admin [2021/02/10 13:56] – [How we modify job priority to favor contributors] kauffmantechstaff:aicluster-admin [2021/02/23 19:58] (current) kauffman
Line 1: Line 1:
 ====== AI Cluster Policy Description ====== ====== AI Cluster Policy Description ======
- 
-After various iterations we (Bob, Har, and I) believe to have an 
-implementation of the policy that meets the requirements discussed 
-previously. 
- 
 ===== TODO ===== ===== TODO =====
   - There are multiple methods used to calculate priority reflected on the spreadsheet.   - There are multiple methods used to calculate priority reflected on the spreadsheet.
Line 80: Line 75:
 <code> <code>
 PartitionName=general Nodes=a[001-008] PartitionName=general Nodes=a[001-008]
-PartitionName=cdac-own Nodes=a[005-008] AllowGroups=cdac Priority=100+#PartitionName=cdac-own Nodes=a[005-008] AllowGroups=cdac Priority=100
 PartitionName=cdac-contrib Nodes=a[001-008] AllowGroups=cdac Priority=5 PartitionName=cdac-contrib Nodes=a[001-008] AllowGroups=cdac Priority=5
 </code> </code>
Line 86: Line 81:
 ^Partition^Description^Priority^ ^Partition^Description^Priority^
 |general| For all users| 0 | |general| For all users| 0 |
-|${group}-own | Machines $group has donated | 100 |+|${group}-own | Machines $group has donated. Enabled when asked. | 100 |
 |${group}-contrib | A method to give slightly higher job priority to groups who have donated but do not own machines.| Variable based on spreadsheet calculation. | |${group}-contrib | A method to give slightly higher job priority to groups who have donated but do not own machines.| Variable based on spreadsheet calculation. |
  
Line 103: Line 98:
  
 The percent will end up as an integer. The percent will end up as an integer.
 +
 +There is a [[https://vcs.cs.uchicago.edu/kauffman/slurm-tools/blob/master/cluster_partition_usage.py|python script]] that does this and sends techstaff a report. The repo is currently not available for everyone to see but I think that it should be eventually. In the mean time you can take a look at it on the front end nodes (/usr/local/slurm-tools/cluster_partition_usage.py).
  
  
/var/lib/dokuwiki/data/attic/techstaff/aicluster-admin.1612986975.txt.gz · Last modified: 2021/02/10 13:56 by kauffman

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki