Why are my jobs waiting due to QOSGrpCpuLimit?

On the DCSR clusters long jobs (more that one day) are only allowed to occupy 2/3 of the CPUs at any one time.

This is for the following reasons:

When you submit a job it is automatically assigned a Quality of Service (QoS) policy which is used to apply this restriction.

If you see your jobs pending with the reason QOSGrpCpuLimit then it means that long running jobs are currently occupying all the available CPU slots and it will not run until some long tasks complete.

 

 


Revision #3
Created 21 April 2020 06:46:08 by Ewan Roche
Updated 21 April 2020 10:42:09 by Ewan Roche