LSF Version 7.3 - Administering Platform LSF
Administering Platform LSF 15
About Platform LSF
Job A unit of work run in the LSF system. A job is a command submitted to LSF for
execution. LSF schedules, controls, and tracks the job according to configured
policies.
Jobs can be complex problems, simulation scenarios, extensive calculations,
anything that needs compute power.
Commands:
◆ bjobs—View jobs in the system
◆ bsub—Submit jobs
Job slot A job slot is a bucket into which a single unit of work is assigned in the LSF system.
If hosts are configured with a number of job slots, you can dispatch jobs from
queues until all the job slots are filled.
Commands:
◆ bhosts—View job slot limits for hosts and host groups
◆ bqueues—View job slot limits for queues
◆ busers—View job slot limits for users and user groups
Configuration:
◆ Define job slot limits in lsb.resources.
Job states LSF jobs have the following states:
◆ PEND—Waiting in a queue for scheduling and dispatch
◆ RUN—Dispatched to a host and running
◆ DONE—Finished normally with zero exit value
◆ EXIT—Finished with non-zero exit value
◆ PSUSP—Suspended while pending
◆ USUSP—Suspended by user
◆ SSUSP—Suspended by the LSF system
◆ POST_DONE—Post-processing completed without errors
◆ POST_ERR—Post-processing completed with errors
◆ UNKWN —mbatchd has lost contact with sbatchd on the host on which the
job runs
◆ WAIT—For jobs submitted to a chunk job queue, members of a chunk job that
are waiting to run
◆ ZOMBI—A job becomes ZOMBI if the execution host is unreachable for
various reasons
Queue A clusterwide container for jobs. All jobs wait in queues until they are scheduled
and dispatched to hosts.
Queues do not correspond to individual hosts; each queue can use all server hosts
in the cluster, or a configured subset of the server hosts.