Platform LSF Administration Guide Version 6.2
LSF Job Termination Reason Logging
Administering Platform LSF
598
Termination reasons displayed by bacct
When LSF detects that a job is terminated, bacct -l displays one of the following
termination reasons:
See lsbatch.h for the mapping between the integer value logged to lsb.acct and
termination reason keyword.
Example output of bacct and bhist
Keyword Reason
TERM_ADMIN Job killed by root or LSF administrator
TERM_CHKPNT: Job killed after checkpointing
TERM_CPULIMIT Job killed after reaching LSF CPU usage limit
TERM_DEADLINE Job killed after deadline expires
TERM_EXTERNAL_SIGNAL Job killed by a signal external to LSF
TERM_FORCE_ADMIN Job killed by root or LSF administrator without time for
cleanup
TERM_FORCE_OWNER Job killed by owner without time for cleanup
TERM_LOAD Job killed after load exceeds threshold
TERM_MEMLIMIT Job killed after reaching LSF memory usage limit
TERM_OWNER Job killed by owner
TERM_PREEMPT Job killed after preemption
TERM_PROCESSLIMIT Job killed after reaching LSF process limit
TERM_REQUEUE_ADMIN Job killed and requeued by root or LSF administrator
TERM_REQUEUE_OWNER Job killed and requeued by owner
TERM_RUNLIMIT Job killed after reaching LSF run time limit
TERM_SLURM Job terminated abnormally in SLURM (node failure)
TERM_SWAP Job killed after reaching LSF swap usage limit
TERM_THREADLIMIT Job killed after reaching LSF thread limit
TERM_WINDOW Job killed after queue run window closed
TERM_ZOMBIE Job exited while LSF is not available
Termination cause Termination reason in bacct –l Example bhist output
bkill -s KILL
bkill job_ID
Completed <exit>; TERM_OWNER or
TERM_ADMIN
Thu Mar 13 17:32:05: Signal <KILL>
requested by user or administrator
<user2>;
Thu Mar 13 17:32:06: Exited by signal 2.
The CPU time used is 0.1 seconds;
bkill –r Completed <exit>; TERM_FORCE_ADMIN
or TERM_FORCE_OWNER when sbatchd is
not reachable.
Otherwise, TERM_USER or
TERM_ADMIN
Thu Mar 13 17:32:05: Signal <KILL>
requested by user or administrator
<user2>;
Thu Mar 13 17:32:06: Exited by signal 2.
The CPU time used is 0.1 seconds;