HP XC System Software Administration Guide Version 2.1

REQUEUE_EXIT_VALUES=122
12.10 LSF-HPC Monitoring
LSF-HPC i s monitored and controlled by N agio s using the check_lsf plug-in.
When LSF-HPC is down, the response of the check_l
sf plug-in depends on whether
LSF-HPC failo ver is enabled or disabled.
When L S F -HPC failover is disabled The check_lsf plug-in returns an imm ediate failure
notification to Nagios.
When LSF-HPC failover is enabled The check_lsf plug-in decides if LSF-HPC is
supposed to be running. If so, it acquires a list
of resou rce managem ent nodes and tries to restart
LSF-HPC on each of those nodes, in turn, until one
succeeds, or until the list is exhausted.
If successful, the check_lsf plug-in returns LSF OK
- restarted message.
If the restart procedure fails
,thecheck_lsf plug-in
returns a failur e notificatio
n.
The follo wing list s the Nagios
messages for LSF failover monitor status:
LSF OK - up
The LSF-H PC environm ent appears to be up and operational on t he HP XC system .
LSF OK - currently shut down
The LSF-HPC environment has not been started on the HP XC system.
LSF WARNING - restarted
The LSF-HPC environment was not running, and should have been; it is b eing restarted.
The m essage should change to LSF OK - up the next tim e Nagios is updated.
LSF CRITICAL - down
LSF-HPC was not foun d, and LSF-HPC failover is disabled.
LSF CRITICAL - {message}
An abnormal pro
blem occurred. The {message} provides useful diagnostic information.
12.10.1 LSF Execution Host Failure
Should the nod
e hosting LSF-HPC becom es unresponsive, the Nagios check_lsf plug-in
takes one of th
e following actions:
If LSF-HPC has been shut down cleanly, the message LSF OK - currently shut
down is disp layed.
If LSF-HPC failover is disabled, the message LSF CRITICAL - down is displayed.
If LSF-HPC fai
lover is enabled, check_lsf starts a new search for the LSF execution
host. The mes
sage LSF warning - restarted indicates success.
If LSF-HPC is down, the message LSF CRITICAL - message? is displayed.
12-12 LSF-HPC Administration