LSF Version 7.3 - Administering Platform LSF

ManualsBrandsHP ManualsSoftwareHP XC System 4.x Software

411

412

413

414

415

416

417

418

419

420

Administering Platform LSF 415

Reserving Resources

◆ MXJ, JL/U in lsb.hosts

◆ PJOB_LIMIT, HJOB_LIMIT, QJOB_LIMIT, UJOB_LIMIT in lsb.queues

Memory request To request memory resources, configure RESOURCE_RESERVE in lsb.queues.

When RESOURCE_RESERVE is used, LSF will consider memory and slot requests

during time-based reservation calculation. LSF will not reserve slot or memory if

any other resources are not satisfied.

If SLOT_RESERVE is configured, time-based reservation will not make a slot

reservation if any other type of resource is not satisfied, including memory requests.

When SLOT_RESERVE is used, if job cannot run because of non-slot resources,

including memory, time-based reservation will not reserve slots. For example, if job

cannot run because it cannot get required license, job will be pending without any

reservation

Host partition and queue-level scheduling

If host partitions are configured, LSF first schedules jobs on the host partitions and

then goes through each queue to schedule jobs. The same job may be scheduled

several times, one for each host partition and last one at queue-level. Available

candidate hosts may be different for each time.

Because of this difference, the same job may get different estimated start times,

future allocation, and reservation in different host partitions and queue-level

scheduling. With time-based reservation configured, LSF always keeps the same

reservation and future allocation with the earliest estimated start time.

bjobs displays future allocation information

◆ By default, job future allocation contains LSF host list and number of CPUs per

host, for example:

alloc=2*hostA 3*hostB

◆ LSF integrations define their own future allocation string to override the

default LSF allocation. For example, in RMS, future allocation is displayed as:

rms_alloc=2*sierra0 3*sierra1

Predicted start time may be postponed for some jobs

If a pending job cannot be placed in a future resource allocation, the scheduler can

skip it in the start time reservation calculation and fall back to use greedy slot

reservation. There are two possible reasons:

◆ The job slot request cannot be satisfied in the future allocation

◆ Other non-slot resources cannot be satisfied.

Either way, the scheduler continues calculating predicted start time for the

remaining jobs without considering the skipped job.

Later, once the resource request of skipped job can be satisfied and placed in a

future allocation, the scheduler reevaluates the predicted start time for the rest of

jobs, which may potentially postpone their start times.

To minimize the overhead in recalculating the predicted start times to include

previously skipped jobs, you should configure a small value for

LSB_TIME_RESERVE_NUMJOBS in

lsf.conf.