LSF Version 7.3 - Using Platform LSF HPC

LSF can run an appropriate script that is responsible for setup and cleanup of the job
launching environment. You can specify the name of the appropriate script in an
application profile in
lsb.applications.
Use DJOB_ENV_SCRIPT to define the path to a script that sets the environment for
the parallel or distributed job launcher. The script runs as the user, and is part of the job.
DJOB_ENV_SCRIPT only applies to the
blaunch distributed application framework.
If a full path is specified, LSF uses the path name for the execution. If a full path is not
specified, LSF looks for it in LSF_BINDIR.
The specified script must support a
setup argument and a cleanup argument. LSF
invokes the script with the
setup argument before launching the actual job to set up
the environment, and with
cleanup argument after the job is finished.
LSF assumes that if setup cannot be performed, the environment to run the job does
not exist. If the script returns a non-zero value at setup, an error is printed to
stderr
of the job, and the job exits.
Regardless of the return value of the script at cleanup, the real job exit value is used. If
the return value of the script is non-zero, an error message is printed to
stderr of the
job.
When defined in an application profile, the LSB_DJOB_ENV_SCRIPT variable is set
when running
bsub -app for the specified application.
For example, if
DJOB_ENV_SCRIPT=mpich.script, LSF runs
$LSF_BINDIR/mpich.script setup
to set up the environment to run an MPICH job. After the job completes, LSF runs
$LSF_BINDIR/mpich.script cleanup
On cleanup, the mpich.script file could, for example, remove any temporary files
and release resources used by the job. Changes to the LSB_DJOB_ENV_SCRIPT
environment variable made by the script are visible to the job.
Use DJOB_HB_INTERVAL in an application profile in lsb.applications to
configure an interval in seconds used to update the heartbeat between LSF and the tasks
of a parallel or distributed job. DJOB_HB_INTERVAL only applies to the
blaunch
distributed application framework.
When DJOB_HB_INTERVAL is specified, the interval is scaled according to the
number of tasks in the job:
max(DJOB_HB_INTERVAL, 10) +
host_factor
where
host_factor = 0.01 * number of hosts allocated for the job
When defined in an application profile, the LSB_DJOB_HB_INTERVAL variable is set
in the parallel or distributed job environment. You should not manually change the value
of LSB_DJOB_HB_INTERVAL.
By default, the interval is equal to SBD_SLEEP_TIME in
lsb.params, where the
default value of SBD_SLEEP_TIME is 30 seconds.