LSF Version 7.3 - Using Platform LSF HPC

How the Generic PJL Framework Works
Terminology
The host name at the top of the execution host list as determined by LSF. Starts PAM.
The most suitable hosts to execute the batch job as determined by LSF
A process that runs on a host; the individual process of a parallel application
A parallel job consists of multiple tasks that could be executed on different hosts.
(Parallel Job Launcher) Any executable script or binary capable of starting parallel tasks
on all hosts assigned for a parallel job (for example,
mpirun.)
Slave Batch Daemons (SBDs) are batch job execution agents residing on the execution
hosts.
sbatchd receives jobs from mbatchd in the form of a job specification and
starts RES to run the job according the specification.
sbatchd reports the batch job
status to
mbatchd whenever job state changes.
Reads the environment variable LSF_PJL_TYPE, and generates the appropriate pam
command line to invoke the PJL. The esub programs provided in LSF_SERVERDIR
set this variable to the proper type.
(TaskStarter) An executable responsible for starting a parallel task on a host and
reporting the process ID and host name to PAM. TS is located in LSF_BINDIR.
(Parallel Application Manager) The supervisor of any parallel LSF job. PAM allows LSF
to collect resources used by the job and perform job control.
PAM starts the PJL and maintains connection with RES on all execution hosts. It
collects resource usage, updates the resource usage of tasks and its own PID and PGID
to
sbatchd. It propagates signals to all process groups and individual tasks, and cleans
up tasks as needed.
A script that starts the PJL. The wrapper is typically used to set up the environment for
the parallel job and invokes the PJL.
(Remote Execution Server) An LSF daemon running on each server host. Accepts
remote execution requests to provide transparent and secure remote execution of jobs
and tasks.
RES manages all remote tasks and forwards signals, standard I/O, resources
consumption data, and parallel job information between PAM and the tasks.