LSF Version 7.3 - Platform LSF Configuration Reference

External load indices behavior
How LSF manages multiple elim executables
The LSF administrator can write one elim executable to collect multiple external load indices,
or the LSF administrator can divide external load index collection among multiple elim
executables. On each host, the load information manager (LIM) starts a master elim
(MELIM), which manages all elim executables on the host and reports the external load index
values to the LIM. Specifically, the MELIM
Starts elim executables on the host. The LIM checks the ResourceMap section
LOCATION settings (default, all, or host list) and directs the MELIM to start elim
executables on the corresponding hosts.
Note:
If the ResourceMap section contains even one resource
mapped as default, and if there are multiple elim executables
in LSF_SERVERDIR, the MELIM starts all of the elim
executables in LSF_SERVERDIR on all hosts in the cluster.
Not all of the elim executables continue to run, however.
Those that use a checking header could exit with
ELIM_ABORT_VALUE if they are not programmed to report
values for the resources listed in LSF_RESOURCES.
Restarts an elim if the elim exits. To prevent system-wide problems in case of a fatal error
in the elim, the maximum restart frequency is once every 90 seconds. The MELIM does
not restart any elim that exits with ELIM_ABORT_VALUE.
Collects the load information reported by the elim executables.
Checks the syntax of load update strings before sending the information to the LIM.
Merges the load reports from each elim and sends the merged load information to the
LIM. If there is more than one value reported for a single resource, the MELIM reports the
latest value.
Logs its activities and data into the log file LSF_LOGDIR/melim.log.host_name
Increases system reliability by buffering output from multiple elim executables; failure of
one elim does not affect other elim executables running on the same host.
How LSF determines which hosts should run an elim executable
LSF provides configuration options to ensure that your elim executables run only when they
can report the resources values expected on a host. This maximizes system performance and
simplifies the implementation of external load indices. To control which hosts run elim
executables, you
Must map external resource names to locations in lsf.cluster.cluster_name
Optionally, use the environment variables LSF_RESOURCES, LSF_MASTER, and
ELIM_ABORT_VALUE in your elim executables
How resource mapping determines elim hosts
The following table shows how the resource mapping defined in
lsf.cluster.cluster_name determines the hosts on which your elim executables start.
Feature: External load indices
110 Platform LSF Configuration Reference