Platform LSF Administration Guide Version 6.2
Chapter 28
Running Parallel Jobs
Administering Platform LSF
451
Submitting a third job with same requirements will reserve one job slot, and reserve
all free memory, if the amount of free memory is between 20 MB and 200 MB (some
free memory may be used by the operating system or other software.)
◆
Job 4:
%
bsub -W 400 -q backfill -R "rusage[mem=50]" myjob4
The job will keep pending, since memory is reserved by job 3 and it will run longer
than job 1 and job 2.
◆
Job 5:
%
bsub -W 100 -q backfill -R "rusage[mem=50]" myjob5
The job will start running. It uses one free slot and memory reserved by job 3. If the
job does not finish in 100 minutes, it will be killed by LSF automatically.
◆
Job 6:
%
bsub -W 100 -q backfill -R "rusage[mem=300]" myjob6
The job will keep pending with no resource reservation because it cannot get
enough memory from the memory reserved by job 3.
◆
Job 7:
%
bsub -W 100 -q backfill myjob7
The job will start running. LSF assumes it does not require any memory and enough
job slots are free.
Parallel jobs
Each process of a parallel job requires 100 MB memory, and each parallel job needs 4
cpus. The first two of the following parallel jobs will run for 300 minutes.
◆
Job 1:
%
bsub -W 300 -n 4 -R "rusage[mem=100]" -q reservation
myJob1
The job will start running and use 4 slots and get 400MB memory.
◆
Job 2:
Submitting a second job with same requirements will get the same result.
◆
Job 3:
Submitting a third job with same requirements will reserve 2 slots, and reserve all
200 MB of available memory, assuming no other applications are running outside of
LSF.
◆
Job 4:
%
bsub -W 400 -q backfill -R "rusage[mem=50]" myJob4
The job will keep pending since all available memory is already reserved by job 3. It
will run longer than job 1 and job 2, so no backfill happens.
◆
Job 5:
%
bsub -W 100 -q backfill -R "rusage[mem=50]" myJob5
This job will start running. It can backfill the slot and memory reserved by job 3. If
the job does not finish in 100 minutes, it will be killed by LSF automatically.