Platform LSF Administration Guide Version 6.2
Automatically Checkpointing Jobs
Administering Platform LSF
404
Automatically Checkpointing Jobs
Automatically checkpointing jobs involves submitting a job to a queue that is configured
for periodic checkpointing. To configure a queue, edit
lsb.queues and specify a
checkpoint directory and a checkpoint period for the CHKPNT parameter for a queue.
The checkpoint directory must already exist, LSF will not create the directory. The
checkpoint period is specified in minutes. All jobs submitted to the queue will be
automatically checkpointed. For example, to configure a queue to periodically
checkpoint jobs every 4 hours (240 minutes) to a directory named
my_dir:
Begin Queue
...
CHKPNT=my_dir 240
DESCRIPTION=Auto chkpnt every 4 hrs (240 min) to my_dir
...
End Queue
All jobs submitted to a queue configured for checkpointing can also be checkpointed
using
bchkpnt. Jobs submitted and modified using -k, -r, -p, and -kn options
override queue configured options.