LSF Version 7.3 - Administering Platform LSF

Cluster Concepts
18 Administering Platform LSF
Master host Where the master LIM and mbatchd run. An LSF server host that acts as the overall
coordinator for that cluster. Each cluster has one master host to do all job
scheduling and dispatch. If the master host goes down, another LSF server in the
cluster becomes the master host.
All LSF daemons run on the master host. The LIM on the master host is the master
LIM.
Commands:
lsidView the master host name
Configuration:
The master host is defined along with other candidate master hosts by
LSF_MASTER_LIST in
lsf.conf.
LSF daemons
mbatchd Master Batch Daemon running on the master host. Started by sbatchd.
Responsible for the overall state of jobs in the system.
Receives job submission, and information query requests. Manages jobs held in
queues. Dispatches jobs to hosts as determined by
mbschd.
Configuration:
Port number defined in lsf.conf.
mbschd Master Batch Scheduler Daemon running on the master host. Works with mbatchd.
Started by
mbatchd.
Makes scheduling decisions based on job requirements and policies.
sbatchd Slave Batch Daemon running on each server host. Receives the request to run the
job from
mbatchd and manages local execution of the job. Responsible for
enforcing local policies and maintaining the state of jobs on the host.
sbatchd forks a child sbatchd for every job. The child sbatchd runs an instance of
res to create the execution environment in which the job runs. The child sbatchd
exits when the job is complete.
Commands:
badmin hstartup—Starts sbatchd
badmin hshutdown—Shuts down sbatchd
badmin hrestart—Restarts sbatchd
LSF daemon Role
mbatchd Job requests and dispatch
mbschd Job scheduling
sbatchd
res
Job execution
lim Host information
pim Job process information
elim Collect and track custom dynamic load indices