LSF Version 7.3 - Administering Platform LSF

Administering Platform LSF 705
Troubleshooting and Error Messages
Common LSF Problems
This section lists some other common problems with the LIM, RES, mbatchd,
sbatchd, and interactive applications.
Most problems are due to incorrect installation or configuration. Check the error
log files; often the log message points directly to the problem.
LIM dies quietly
1 Run the following command to check for errors in the LIM configuration files.
lsadmin ckconfig -v
This displays most configuration errors. If this does not report any errors,
check in the LIM error log.
LIM unavailable
Sometimes the LIM is up, but executing the lsload command prints the following
error message:
Communication time out.
If the LIM has just been started, this is normal, because the LIM needs time to get
initialized by reading configuration files and contacting other LIMs. If the LIM
does not become available within one or two minutes, check the LIM error log for
the host you are working on.
To prevent communication timeouts when starting or restarting the local LIM,
define the parameter LSF_SERVER_HOSTS in the
lsf.conf file. The client will
contact the LIM on one of the LSF_SERVER_HOSTS and execute the command,
provided that at least one of the hosts defined in the list has a LIM that is up and
running.
When the local LIM is running but there is no master LIM in the cluster, LSF
applications display the following message:
Cannot locate master LIM now, try later.
1 Check the LIM error logs on the first few hosts listed in the Host section of the
lsf.cluster.cluster_name file. If LSF_MASTER_LIST is defined in
lsf.conf, check the LIM error logs on the hosts listed in this parameter
instead.
Master LIM is down
Sometimes the master LIM is up, but executing the lsload or lshosts command
prints the following error message:
Master LIM is down; try later
If the /etc/hosts file on the host where the master LIM is running is configured
with the host name assigned to the loopback IP address (127.0.0.1), LSF client LIMs
cannot contact the master LIM. When the master LIM starts up, it sets its official