LSF Version 7.3 - Administering Platform LSF
Cluster Concepts
20 Administering Platform LSF
◆ lsadmin limrestart—Restarts LIM
◆ lsload—View dynamic load values
◆ lshosts—View static host load values
Configuration:
◆ Port number defined in lsf.conf.
Master LIM The LIM running on the master host. Receives load information from the LIMs
running on hosts in the cluster.
Forwards load information to
mbatchd, which forwards this information to mbschd
to support scheduling decisions. If the master LIM becomes unavailable, a LIM on
another host automatically takes over.
Commands:
◆ lsadmin limstartup—Starts LIM
◆ lsadmin limshutdown—Shuts down LIM
◆ lsadmin limrestart—Restarts LIM
◆ lsload—View dynamic load values
◆ lshosts—View static host load values
Configuration:
◆ Port number defined in lsf.conf.
ELIM External LIM (ELIM) is a site-definable executable that collects and tracks custom
dynamic load indices. An ELIM can be a shell script or a compiled binary program,
which returns the values of the dynamic resources you define. The ELIM executable
must be named
elim and located in LSF_SERVERDIR.
pim Process Information Manager (PIM) running on each server host. Started by LIM,
which periodically checks on PIM and restarts it if it dies.
Collects information about job processes running on the host such as CPU and
memory used by the job, and reports the information to
sbatchd.
Commands:
◆ bjobs—View job information
Batch jobs and tasks
You can either run jobs through the batch system where jobs are held in queues, or
you can interactively run tasks without going through the batch system, such as
tests for example.
Job A unit of work run in the LSF system. A job is a command submitted to LSF for
execution, using the
bsub command. LSF schedules, controls, and tracks the job
according to configured policies.
Jobs can be complex problems, simulation scenarios, extensive calculations,
anything that needs compute power.
Commands:
◆ bjobs—View jobs in the system