HP XC System Software Administration Guide Version 3.0
DescriptionEnvironment Variable
This setting enables Platform LSF-HPC extensions.
This setting is undefined by default.
The following extension names are supported:
• SHORT_EVENTFILE
This compresses long host name lists when event records are written to the
lsb.events and lsb.acct files for large parallel jobs. The short host
string has the format:
number_of_hosts*real_host_name
When SHORT_EVENTFILE is enabled, older daemons and commands
(pre-LSF Version 6.1) cannot recognize the lsb.acct and lsb.events
file format. For example, the original host list record is as follows:
6 "hostA" "hostA" "hostA" "hostA" "hostB" "hostC"
Redundant host names are removed and the host count is changed so that
the short host list record is as follows:
3 "4*hostA" "hostB" "hostC"
When LSF_HPC_EXTENSION="SHORT_EVENTFILE" is set, and LSF reads
the host list from the lsb.events or lsb.acct files, the compressed host
list is expanded into a normal host list. This setting applies to the following
events:
• JOB_START — when a normal job is dispatched.
• JOB_FORCE — when a job is forced with the brun command.
• JOB_CHUNK — when a job is inserted into a job chunk.
• JOB_FORWARD — when a job is forwarded to a MultiCluster leased
host.
• JOB_FINISH in lsb.acct.
• SHORT_PIDLIST
This shortens the output from the bjobs command to eliminate many of the
process IDs (PIDs) for a job. The bjobs command displays only the first
ID and a count of the process group IDs (PGIDs) and process IDs for the
job. Without the SHORT_PIDLIST setting, the bjobs -l command displays
all the PGIDs and PIDs for the job. With SHORT_PIDLIST set, the bjobs
-l command displays a count of the PGIDs and PIDs.
• RESERVE_BY_STARTTIME
LSF selects the reservation that gives the job the earliest predicted start time.
By default, if multiple host groups are available for reservation, LSF chooses
the largest possible reservation based on the number of slots. When backfill
is configured, this can lead to larger jobs not running as their start times are
pushed further into the future.
• BRUN_WITH_TOPOLOGY
If a topology request can be satisfied for a brun job, brun preserves the
topology request. LSF allocates the resource according to the request and
tries to run the job with the requested topology. If allocation fails because
the topology request cannot be satisfied, the job is requeued. By default,
the job topology request is ignored by the scheduler when it creates an
allocation if BRUN_WITH_TOPOLOGY is not specified.
LSF_HPC_EXTENSIONS="ext_name,..."
This entry in the lsf.conf file defines how any two LSF_HPC_NCPU_*
thresholds are combined.
The default value is or.
LSF_HPC_NCPU_COND=and|or
This entry in the lsf.conf file defines the upper limit for the number of
processors that are changed since the last checking cycle.
The default value is 0.
LSF_HPC_NCPU_INCREMENT=increment
This entry specifies the minimum number of consecutive cycles in which the
number of processors changed does not exceed LSF_HPC_NCPU_INCREMENT.
LSF checks total usable processors every 2 minutes.
LSF_HPC_NCPU_INCR_CYCLES=icycles
This entry specifies the percentage of total usable processors in the LSF partition.
The default is 80.
LSF_HPC_NCPU_THRESHOLD=threshold
132 Managing LSF