LSF Version 7.3 - Administering Platform LSF

Understanding Job States
116 Administering Platform LSF
Exited jobs
An exited job ended with a non-zero exit status.
A job might terminate abnormally for various reasons. Job termination can happen
from any state. An abnormally terminated job goes into EXIT state. The situations
where a job terminates abnormally include:
The job is cancelled by its owner or the LSF administrator while pending, or
after being dispatched to a host.
The job is not able to be dispatched before it reaches its termination deadline,
and thus is aborted by LSF.
The job fails to start successfully. For example, the wrong executable is specified
by the user when the job is submitted.
The application exits with a non-zero exit code.
You can configure hosts so that LSF detects an abnormally high rate of job exit from
a host. See Handling Host-level Job Exceptions on page 98 for more information.
Post-execution states
Some jobs may not be considered complete until some post-job processing is
performed. For example, a job may need to exit from a post-execution job script,
clean up job files, or transfer job output after the job completes.
The DONE or EXIT job states do not indicate whether post-processing is complete,
so jobs that depend on processing may start prematurely. Use the
post_done and
post_err keywords on the bsub -w command to specify job dependency
conditions for job post-processing. The corresponding job states POST_DONE
and POST_ERR indicate the state of the post-processing.
After the job completes, you cannot perform any job control on the
post-processing. Post-processing exit codes are not reported to LSF.
See Chapter 36, “Pre-Execution and Post-Execution Commands” for more
information.