HP XC System Software Administration Guide Version 4.0

Table Of Contents
To move LSF and SLURM back to the original primary node, follow the same procedure with
the assumption that the original primary node is now the backup node, and the original backup
node is now the primary node.
16.16 Enhancing LSF with SLURM
You can set environment variables to influence the operation of LSF with SLURM in the HP XC
system. These environment variables affect the operation directly and set thresholds for LSF with
SLURM and SLURM interplay.
16.16.1 LSF with SLURM Enhancement Settings
Table 16-3 describes the environment variables in the lsf.conf file that you can use to enhance
LSF with SLURM.
Table 16-3 Environment Variables for LSF with SLURM Enhancement (lsf.conf File)
DescriptionEnvironment Variable
This entry specifies the TCP port used for communication between
the LSF with SLURM allocation adapter (RLA) and the SLURM
scheduler plug-in.
The default port number is 6883.
LSB_RLA_PORT=port_number
This entry defines the communications timeout between RLA and its
clients (for example, sbatchd and the SLURM scheduler plug-in.)
The default value is 10 seconds.
LSB_RLA_TIMEOUT=seconds
This entry specifies how often the LSF with SLURM scheduler
refreshes free node information from RLA.
The default value is 600 seconds.
LSB_RLA_UPDATE=seconds
This entry specifies the directory to store the RLA status file. It enables
RLA to recover its original state when it restarts. When RLA first
starts, it creates the directory defined by LSB_RLA_WORKDIR if it
does not exist, then creates subdirectories for each host. Avoid using
/tmp or any other directory that is automatically cleaned up by the
system. Unless your installation has restrictions on the
LSB_SHAREDIR directory, use the default for LSB_SHAREDIR.
The default value is LSB_SHAREDIR/cluster_name/rla_workdir.
LSB_RLA_WORKDIR=directory
This setting provides for either of two systemwide scheduling modes:
first-fit Under this mode, the scheduler chooses the first-met
free nodes to allocate.
By default, HPC SLURM integration works under
first-fit mode.
best-fit Under this mode, the scheduler applies a set of criteria
to choose nodes with minimal capacities that satisfy
the job request.
You can set LSB_SLURM_BESTFIT=Y to switch the
scheduler to work under best-fit mode.
In a heterogeneous HP XC system, a best-fit allocation
may be preferable for clusters running a mix of serial
and parallel jobs. In this context, best fit means: "the
nodes that minimally satisfy the requirements." Nodes
with the maximum number of processors are chosen
first. For parallel and serial jobs, the nodes with
minimal memory, minimal tmp space, and minimal
weight are chosen.
LSB_SLURM_BESTFIT=Y
16.16 Enhancing LSF with SLURM 207