Platform LSF Reference Version 6.2

Parameters
Platform LSF Reference
562
LSF_NIOS_JOBSTATUS_INTERVAL
Syntax
LSF_NIOS_JOBSTATUS_INTERVAL=
time_minutes
Description Applies only to interactive batch jobs.
Time interval at which NIOS polls
mbatchd to check if a job is still running. Used to
retrieve a job’s exit status in the case of an abnormal exit of NIOS, due to a network
failure for example.
Use this parameter if you run interactive jobs and you have scripts that depend on an
exit code being returned.
When this parameter is undefined and a network connection is lost,
mbatchd cannot
communicate with NIOS and the return code of a job is not retrieved.
When this parameter is defined, before exiting, NIOS polls
mbatchd on the interval
defined by LSF_NIOS_JOBSTATUS_INTERVAL to check if a job is still running.
NIOS continues to poll
mbatchd until it receives an exit code or mbatchd responds
that the job does not exist (if the job has already been cleaned from memory for
example).
If an exit code cannot be retrieved, NIOS generates an error message and the code -11.
Valid Values
Any integer greater than zero
Default
Undefined
Notes
Set this parameter to large intervals such as 15 minutes or more so that performance is
not negatively affected if interactive jobs are pending for too long. NIOS always calls
mbatchd on the defined interval to confirm that a job is still pending and this may add
load to
mbatchd.
See also
Environment variable LSF_NIOS_PEND_TIMEOUT
LSF_NIOS_RES_HEARTBEAT
Syntax LSF_NIOS_RES_HEARTBEAT=time_minutes
Description
Applies only to interactive non-parallel batch jobs.
Defines how long NIOS waits before sending a message to RES to determine if the
connection is still open.
Use this parameter to ensure NIOS exits when a network failure occurs instead of
waiting indefinitely for notification that a job has been completed. When a network
connection is lost, RES cannot communicate with NIOS and as a result, NIOS does not
exit.
When this parameter is defined, if there has been no communication between RES and
NIOS for the defined period of time, NIOS sends a message to RES to see if the
connection is still open. If the connection is no longer available, NIOS exits.
Valid values Any integer greater than zero
Default
Undefined