Platform LSF Reference Version 6.2
lsf.conf
Platform LSF Reference
545
◆
CUMULATIVE_RUSAGE—when a parallel job script runs multiple pam
commands, resource usage is collected for jobs in the job script, rather than being
overwritten when each
pam command is executed.
◆
DISP_RES_USAGE_LIMITS—bjobs displays resource usage limits configured
in the queue as well as job-level limits.
◆
LSB_HCLOSE_BY_RES—If res is down, host is closed with a message
Host is closed because RES is not available.
The status of the closed host is closed_Adm. No new jobs are dispatched to this
host, but currently running jobs are not suspended.
◆
RESERVE_BY_STARTTIME—LSF selects the reservation that will give the job
the earliest predicted start time.
By default, if multiple host groups are available for reservation, LSF chooses the
largest possible reservation based on number of slots.
◆
SHORT_EVENTFILE—compresses long host name lists when event records are
written to
lsb.events and lsb.acct for large parallel jobs. The short host
string has the format:
number_of_hosts*real_host_name
When SHORT_EVENTFILE is enabled, older daemons and commands (pre-LSF
Version 6.2) cannot recognize the lsb.acct and lsb.events file format.
For example, if the original host list record is
6 "hostA" "hostA" "hostA" "hostA" "hostB" "hostC"
redundant host names are removed and the short host list record becomes
3 "4*hostA" "hostB" "hostC"
When LSF_HPC_EXTENSIONS="SHORT_EVENTFILE" is set, and LSF
reads the host list from
lsb.events or lsb.acct, the compressed host list is
expanded into a normal host list.
SHORT_EVENTFILE affects the following events and fields:
❖
JOB_START in lsb.events when a normal job is dispatched
✧
numExHosts (%d)
✧
execHosts (%s)
❖
JOB_CHUNK in lsb.events when a job is inserted into a job chunk
✧
numExHosts (%d)
✧
execHosts (%s)
❖
JOB_FORWARD in lsb.events when a job is forwarded to a MultiCluster
leased host
✧
numReserHosts (%d)
✧
reserHosts (%s)
❖
JOB_FINISH record in lsb.acct
✧
numExHosts (%d)
✧
execHosts (%s)