Platform LSF Administration Guide Version 6.2

Configuring Job Control Actions
Administering Platform LSF
508
Configuring Job Control Actions
Several situations may require overriding or augmenting the default actions for job
control. For example:
Notifying users when their jobs are suspended, resumed, or terminated
An application holds resources (for example, licenses) that are not freed by
suspending the job. The administrator can set up an action to be performed that
causes the license to be released before the job is suspended and re-acquired when
the job is resumed.
The administrator wants the job checkpointed before being:
Suspended when a run window closes
Killed when the RUNLIMIT is reached
A distributed parallel application must receive a catchable signal when the job is
suspended, resumed or terminated to propagate the signal to remote processes.
To override the default actions for the SUSPEND, RESUME, and TERMINATE job
controls, specify the JOB_CONTROLS parameter in the queue definition in
lsb.queues.
See the Platform LSF Reference for information about the
lsb.queues file.
JOB_CONTROLS parameter (lsb.queues)
The JOB_CONTROLS parameter has the following format:
Begin Queue
...
JOB_CONTROLS = SUSPEND[
signal
| CHKPNT |
command
]\
RESUME[
signal
|
command
]\
TERMINATE[
signal
| CHKPNT |
command
]
...
End Queue
When LSF needs to suspend, resume, or terminate a job, it invokes one of the following
actions as specified by SUSPEND, RESUME, and TERMINATE.
signal
A UNIX signal name (for example, SIGTSTP or SIGTERM). The specified signal is
sent to the job.
The same set of signals is not supported on all UNIX systems. To display a list of the
symbolic names of the signals (without the SIG prefix) supported on your system, use
the
kill -l command.
CHKPNT
Checkpoint the job. Only valid for SUSPEND and TERMINATE actions.
If the SUSPEND action is CHKPNT, the job is checkpointed and then stopped by
sending the SIGSTOP signal to the job automatically.
If the TERMINATE action is CHKPNT, then the job is checkpointed and killed
automatically.
command
A /bin/sh command line.
Do not quote the command line inside an action definition.
Do not specify a signal followed by an action that triggers the same signal (for
example, do not specify
JOB_CONTROLS=TERMINATE[bkill] or