Platform LSF Reference Version 6.2

lsb.params
Platform LSF Reference
387
MAX_PEND_JOBS
Syntax
MAX_PEND_JOBS=
integer
Description
The maximum number of pending jobs in the system.
This is the hard system-wide pending job threshold. No user or user group can exceed
this limit unless the job is forwarded from a remote cluster.
If the user or user group submitting the job has reached the pending job threshold as
specified by
MAX_PEND_JOBS, LSF will reject any further job submission requests sent
by that user or user group. The system will continue to send the job submission requests
with the interval specified by
SUB_TRY_INTERVAL in lsb.params until it has made
a number of attempts equal to the
LSB_NTRIES environment variable. If LSB_NTRIES
is undefined and LSF rejects the job submission request, the system will continue to
send the job submission requests indefinitely as the default behavior.
Default
INFINIT_INT
INFINIT_INT is defined in
lsf.h
See also
SUB_TRY_INTERVAL
MAX_PREEXEC_RETRY
Syntax
MAX_PREEXEC_RETRY=
integer
Description
MultiCluster job forwarding model only. The maximum number of times to attempt the
pre-execution command of a job from a remote cluster.
If the job’s pre-execution command fails all attempts, the job is returned to the
submission cluster.
MAX_SBD_CONNS
Syntax MAX_SBD_CONNS=integer
Description
The maximum number of file descriptors mbatchd can have open and connected
concurrently to
sbatchd
Controls the maximum number of connections that LSF can maintain to sbatchds in
the system. Many sites require more than 32 connections.
Do not exceed the file descriptor limit of the root process (the usual limit is 1024).
Setting it equal or larger than this limit can cause
mbatchd to constantly die because
mbatchd allocates all file descriptors to sbatchd connection. This could cause
mbatchd to run out of descriptors, which results in an mbatchd fatal error, such as
failure to open
lsb.events.
Example
Reasonable settings are:
MAX_SBD_CONNS=512
MAX_SBD_CONNS=768
Default
32
MAX_SBD_FAIL
Syntax
MAX_SBD_FAIL=
integer