LSF Version 7.3 - Administering Platform LSF

Administering Platform LSF 15
About Platform LSF
Job A unit of work run in the LSF system. A job is a command submitted to LSF for
execution. LSF schedules, controls, and tracks the job according to configured
policies.
Jobs can be complex problems, simulation scenarios, extensive calculations,
anything that needs compute power.
Commands:
bjobsView jobs in the system
bsub—Submit jobs
Job slot A job slot is a bucket into which a single unit of work is assigned in the LSF system.
If hosts are configured with a number of job slots, you can dispatch jobs from
queues until all the job slots are filled.
Commands:
bhostsView job slot limits for hosts and host groups
bqueues—View job slot limits for queues
busersView job slot limits for users and user groups
Configuration:
Define job slot limits in lsb.resources.
Job states LSF jobs have the following states:
PENDWaiting in a queue for scheduling and dispatch
RUNDispatched to a host and running
DONEFinished normally with zero exit value
EXITFinished with non-zero exit value
PSUSP—Suspended while pending
USUSP—Suspended by user
SSUSPSuspended by the LSF system
POST_DONEPost-processing completed without errors
POST_ERRPost-processing completed with errors
UNKWN —mbatchd has lost contact with sbatchd on the host on which the
job runs
WAIT—For jobs submitted to a chunk job queue, members of a chunk job that
are waiting to run
ZOMBI—A job becomes ZOMBI if the execution host is unreachable for
various reasons
Queue A clusterwide container for jobs. All jobs wait in queues until they are scheduled
and dispatched to hosts.
Queues do not correspond to individual hosts; each queue can use all server hosts
in the cluster, or a configured subset of the server hosts.