LSF Version 7.3 - Administering Platform LSF

Administering Platform LSF 237
Understanding Resources
How LSF Uses Resources
Jobs submitted through the LSF system will have the resources they use monitored
while they are running. This information is used to enforce resource usage limits
and load thresholds as well as for fairshare scheduling.
LSF collects information such as:
Total CPU time consumed by all processes in the job
Total resident memory usage in KB of all currently running processes in a job
Total virtual memory usage in KB of all currently running processes in a job
Currently active process group ID in a job
Currently active processes in a job
On UNIX, job-level resource usage is collected through a special process called
PIM (Process Information Manager). PIM is managed internally by LSF.
Viewing job resource usage
The -l option of the bjobs command displays the current resource usage of the job.
The usage information is sampled by PIM every 30 seconds and collected by
sbatchd at a maximum frequency of every SBD_SLEEP_TIME (configured in the
lsb.params file) and sent to mbatchd. The update is done only if the value for the
CPU time, resident memory usage, or virtual memory usage has changed by more
than 10 percent from the previous update, or if a new process or process group has
been created.
View load on a host
1 Run bhosts -l to check the load levels on the host, and adjust the suspending
conditions of the host or queue if necessary.
The
bhosts -l command gives the most recent load values used for the
scheduling of jobs. A dash (
-) in the output indicates that the particular
threshold is not defined.
bhosts -l hostB
HOST: hostB
STATUS CPUF JL/U MAX NJOBS RUN SSUSP USUSP RSV
ok 20.00 2 2 0 0 0 0 0
CURRENT LOAD USED FOR SCHEDULING:
r15s r1m r15m ut pg io ls t tmp swp
mem
Total 0.3 0.8 0.9 61% 3.8 72 26 0 6M 253
M 297M
Reserved 0.0 0.0 0.0 0% 0.0 0 0 0 0M 0M
0M
LOAD THRESHOLD USED FOR SCHEDULING:
r15s r1m r15m ut pg io ls it tmp swp mem
loadSched - - - - - - - - - - -
loadStop - - - - - - - - - - -
cpuspeed bandwidth