Platform LSF Administration Guide Version 6.2

Chapter 1
About Platform LSF
Administering Platform LSF
57
Master host
Where the master LIM and mbatchd run. An LSF server host that acts as the overall
coordinator for that cluster. Each cluster has one master host to do all job scheduling
and dispatch. If the master host goes down, another LSF server in the cluster becomes
the master host.
All LSF daemons run on the master host. The LIM on the master host is the master
LIM.
Commands
lsid—View the master host name
Configuration
The master host is the first host listed in the lsf.cluster.cluster_name file or
is defined along with other candidate master hosts by LSF_MASTER_LIST in
lsf.conf.
LSF daemons
mbatchd
Master Batch Daemon running on the master host. Started by sbatchd. Responsible
for the overall state of jobs in the system.
Receives job submission, and information query requests. Manages jobs held in queues.
Dispatches jobs to hosts as determined by
mbschd.
Configuration
Port number defined in lsf.conf.
mbschd
Master Batch Scheduler Daemon running on the master host. Works with mbatchd.
Started by
mbatchd.
Makes scheduling decisions based on job requirements and policies.
sbatchd
Slave Batch Daemon running on each server host. Receives the request to run the job
from
mbatchd and manages local execution of the job. Responsible for enforcing local
policies and maintaining the state of jobs on the host.
sbatchd forks a child sbatchd for every job. The child sbatchd runs an instance of
res to create the execution environment in which the job runs. The child sbatchd
exits when the job is complete.
Commands
badmin hstartup—Starts sbatchd
badmin hshutdown—Shuts down sbatchd
mbatchd
mbschd
lim
pim
job requests and dispatch
job scheduling
job execution
job process information
host information
res
sbatchd