HP XC System Software Administration Guide Version 3.1

Table 15-3 Environment Variables for LSF-HPC with SLURM Enhancement (lsf.conf File) (continued)
DescriptionEnvironment Variable
This setting enables Platform LSF extensions.
This setting is undefined by default.
The following extension names are supported:
SHORT_EVENTFILE
This compresses long host name lists when event records are written to
the lsb.events and lsb.acct files for large parallel jobs. The short host
string has the format:
number_of_hosts*real_host_name
When SHORT_EVENTFILE is enabled, older daemons and commands
(prior to LSF Version 6.2) cannot recognize the lsb.acct and lsb.events
file format. For example, the original host list record is as follows:
6 "hostA" "hostA" "hostA" "hostA" "hostB" "hostC"
Redundant host names are removed and the host count is changed so that
the short host list record is as follows:
3 "4*hostA" "hostB" "hostC"
When LSF_HPC_EXTENSION="SHORT_EVENTFILE" is set, and LSF reads
the host list from the lsb.events or lsb.acct files, the compressed
host list is expanded into a normal host list. This setting applies to the
following events:
JOB_START — when a normal job is dispatched.
JOB_FORCE — when a job is forced with the brun command.
JOB_CHUNK — when a job is inserted into a job chunk.
JOB_FORWARD — when a job is forwarded to a MultiCluster leased
host.
JOB_FINISH in lsb.acct.
SHORT_PIDLIST
This shortens the output from the bjobs command to eliminate many of
the process IDs (PIDs) for a job. The bjobs command displays only the
first ID and a count of the process group IDs (PGIDs) and process IDs for
the job. Without the SHORT_PIDLIST setting, the bjobs -l command
displays all the PGIDs and PIDs for the job. With SHORT_PIDLIST set,
the bjobs -l command displays a count of the PGIDs and PIDs.
RESERVE_BY_STARTTIME
LSF-HPC with SLURM selects the reservation that gives the job the earliest
predicted start time. By default, if multiple host groups are available for
reservation, LSF-HPC with SLURM chooses the largest possible reservation
based on the number of slots. When backfill is configured, this can lead to
larger jobs not running as their start times are pushed further into the
future.
BRUN_WITH_TOPOLOGY
If a topology request can be satisfied for a brun job, brun preserves the
topology request. LSF-HPC with SLURM allocates the resource according
to the request and tries to run the job with the requested topology. If
allocation fails because the topology request cannot be satisfied, the job is
queued again. By default, the job topology request is ignored by the
scheduler when it creates an allocation if BRUN_WITH_TOPOLOGY is not
specified.
LSF_HPC_EXTENSIONS="ext_name,..."
This entry in the lsf.conf file defines how any two LSF_HPC_NCPU_*
thresholds are combined.
The default value is or.
LSF_HPC_NCPU_COND=and|or
This entry in the lsf.conf file defines the upper limit for the number of
processors that are changed since the last checking cycle.
The default value is 0.
LSF_HPC_NCPU_INCREMENT=increment
This entry specifies the minimum number of consecutive cycles in which the
number of processors changed does not exceed LSF_HPC_NCPU_INCREMENT.
LSF-HPC with SLURM checks total usable processors every 2 minutes.
LSF_HPC_NCPU_INCR_CYCLES=icycles
This entry specifies the percentage of total usable processors in the LSF
partition.
The default is 80.
LSF_HPC_NCPU_THRESHOLD=threshold
15.14 Enhancing LSF-HPC with SLURM 193