Platform LSF Reference Version 6.2

brestart
Platform LSF Reference
128
brestart
restarts checkpointed jobs
SYNOPSIS
brestart [bsub_options] [-f] checkpoint_dir [job_ID | "job_ID[index]"]
brestart [-h | -V]
OPTION LIST
-B
-f
-N
-x
-b
begin_time
-C core_limit
-c [hour:]minute[/host_name | /host_model]
-D data_limit
-E "pre_exec_command [argument ...]"
-F
file_limit
-m "host_name[+[pref_level]] | host_group[+[pref_level]] ..."
-G
user_group
-M mem_limit
-q "queue_name ..."
-S
stack_limit
-t term_time
-w dependency_expression
-W
run_limit[/host_name| /host_model]
checkpoint_dir [job_ID |
"job_ID[index]"]
DESCRIPTION
Restarts a checkpointed job using the checkpoint files saved in
checkpoint_dir/last_job_ID/. Only jobs that have been successfully checkpointed can
be restarted.
Jobs are re-submitted and assigned a new job ID. The checkpoint directory is renamed
using the new job ID, checkpoint_dir/new_job_ID/.
By default, jobs are restarted with the same output file and file transfer specifications,
job name, window signal value, checkpoint directory and period, and rerun options as
the original job.
To restart a job on another host, both hosts must be binary compatible, run the same
OS version, have access to the executable, have access to all open files (LSF must locate
them with an absolute path name), and have access to the checkpoint directory.
The environment variable LSB_RESTART is set to Y when a job is restarted.
LSF invokes the
erestart(8) executable found in LSF_SERVERDIR to perform the
restart.
Only the
bsub options listed here can be used with brestart.