LSF Version 7.3 - Platform LSF Configuration Reference

all file descriptors to sbatchd connection. This could cause mbatchd to run out of descriptors,
which results in an mbatchd fatal error, such as failure to open lsb.events.
Use together with LSB_MAX_JOB_DISPATCH_PER_SESSION in lsf.conf.
Example
A reasonable setting is:
MAX_SBD_CONNS=768
For a large cluster, specify a value equal to the number of hosts in your cluster plus a buffer.
For example, if your cluster includes 4000 hosts:MAX_SBD_CONNS=4100
Important:
Set LSB_MAX_JOB_DISPATCH_PER_SESSION in lsf.conf
equal to one-half the value of MAX_SBD_CONNS.
Default
64
MAX_SBD_FAIL
Syntax
MAX_SBD_FAIL=integer
Description
The maximum number of retries for reaching a non-responding slave batch daemon,
sbatchd.
The interval between retries is defined by MBD_SLEEP_TIME. If mbatchd fails to reach a host
and has retried MAX_SBD_FAIL times, the host is considered unreachable.
If you define LSB_SYNC_HOST_STAT_LIM=Y, mbatchd obtains the host status from the
master LIM before it polls sbatchd. When the master LIM reports that a host is unavailable
(LIM is down) or unreachable (sbatchd is down) MAX_SBD_FAIL number of times,
mbatchd reports the host status as unavailable or unreachable.
When a host becomes unreachable, mbatchd assumes that all jobs running on that host have
exited and that all rerunnable jobs (jobs submitted with the bsub -r option) are scheduled to
be rerun on another host.
Default
3
MAX_USER_PRIORITY
Syntax
MAX_USER_PRIORITY=integer
lsb.params
Platform LSF Configuration Reference 263