LSF Version 7.3 - Platform LSF Configuration Reference

Configuration file Parameter and syntax Behavior
CHKPNT_PERIOD=chkpnt_period
CHKPNT_METHOD=
chkpnt_method
Configuration to enable automatic job migration
Automatic job migration assumes that if a job is system-suspended (SSUSP) for an extended
period of time, the execution host is probably heavily loaded. Configuring a queue-level or
host-level migration threshold lets the job to resume on another less loaded host, and reduces
the load on the original host. You can use bmig at any time to override a configured migration
threshold.
Configuration file
Parameter and syntax Behavior
lsb.queues
lsb.applications
MIG=minutes
LSF automatically migrates jobs that have been in the
SSUSP state for more than the specified number of
minutes
Specify a value of 0 to migrate jobs immediately upon
suspension
Applies to all jobs submitted to the queue
Job-level command line migration threshold (bsub -
mig) overrides threshold configuration in application
profile and queue. Application profile configuration
overrides queue level configuration.
lsb.hosts
HOST_NAME MIG
host_name minutes
LSF automatically migrates jobs that have been in the
SSUSP state for more than the specified number of
minutes
Specify a value of 0 to migrate jobs immediately upon
suspension
Applies to all jobs running on the host
Note:
When a host migration threshold is specified, and is lower than
the value for the job, the queue, or the application, the host value
is used.
Job migration behavior
LSF migrates a job by performing the following actions:
1.
Stops the job if it is running
2.
Checkpoints the job if the job is checkpointable
3.
Kills the job on the current host
4.
Restarts or reruns the job on the first available host, bypassing all pending jobs
Feature: Job migration
Platform LSF Configuration Reference 87