LSF Version 7.3 - Administering Platform LSF

About Resource Allocation Limits
390 Administering Platform LSF
Job limits Job limits, specified by JOBS in a Limit section in lsb.resources, correspond to
the maximum number of running and suspended jobs that can run at any point in
time. If both job limits and job slot limits are configured, the most restrictive limit
is applied.
Resource
reservation and
backfill
When processor or memory reservation occurs, the reserved resources count
against the limits for users, queues, hosts, projects, and processors. When
backfilling of parallel jobs occurs, the backfill jobs do not count against any limits.
MultiCluster Limits apply only to the cluster where lsb.resources is configured. If the cluster
leases hosts from another cluster, limits are enforced on those hosts as if they were
local hosts.
Switched jobs can
exceed resource
allocation limits
If a switched job (bswitch) has not been dispatched, then the job behaves as if it
were submitted to the new queue in the first place, and the JOBS limit is enforced
in the target queue.
If a switched job has been dispatched, then resource allocation limits like SWP.
TMP. and JOBS can be exceeded in the target queue. For example, given the
following JOBS limit configuration:
Begin Limit
USERS QUEUES SLOTS TMP JOBS
- normal - 20 2
- short - 20 2
End Limit
Submit 3 jobs to the normal queue, and 3 jobs to the short queue:
bsub -q normal -R"rusage[tmp=20]" sleep 1000
bsub -q short -R"rusage[tmp=20]" sleep 1000
bjobs shows 1 job in RUN state in each queue:
bjobs
JOBID USER STAT QUEUE FROM_HOST EXEC_HOST JOB_NAME SUBMIT_TIME
16 user1 RUN normal hosta hosta sleep 1000 Aug 30 16:26
17 user1 PEND normal hosta sleep 1000 Aug 30 16:26
18 user1 PEND normal hosta sleep 1000 Aug 30 16:26
19 user1 RUN short hosta hosta sleep 1000 Aug 30 16:26
20 user1 PEND short hosta sleep 1000 Aug 30 16:26
21 user1 PEND short hosta sleep 1000 Aug 30 16:26
blimits
shows the TMP limit reached:
blimits
INTERNAL RESOURCE LIMITS:
NAME USERS QUEUES SLOTS TMP JOBS
NONAME000 - normal - 20/20 1/2
NONAME001 - short - 20/20 1/2
Switch the running job in the normal queue to the short queue:
bswitch short 16