Platform LSF Administrator's Primer Version 6.2
Chapter 6
Troubleshooting LSF Problems
Platform LSF Administrator’s Primer
57
If lim has just been started, this is normal; lim needs time to read configuration files
and contact
lim daemons on other hosts. If lim does not respond within one or two
minutes, check the
lim error log (LSF_LOGDIR/lim.log.host_name) for the host
you are working on.
When the local
lim is running but there is no master lim in the cluster, LSF
applications display the following message:
Cannot locate master LIM now, try later.
lim problems can have several causes. Check:
◆
The host is defined in the configuration files:
The host must be listed in the HOSTNAME column of the Host section in
LSF_CONFDIR/lsf.cluster.cluster_name
❖
If the host is an LSF server:
The value in the SERVER column of the Host section in
LSF_CONFDIR/lsf.cluster.cluster_name must be 1.
❖
If the host is an LSF client:
✧
No LSF daemons should be running on the host
✧
The value in the SERVER column of the Host section in
LSF_CONFDIR/lsf.cluster.cluster_name must be 0.
✧
LSF_SERVER_HOSTS variable is set in LSF_CONFDIR/lsf.conf
◆
The LSF environment (LSF_ENVDIR) is set properly.
Use
cshrc.lsf or profile.lsf to set the LSF environment on the host.
◆
LSF licensing.
Run
lsadmin ckconfig -v and correct the problems shown in the command
output.
See problem “2 Host does not have a software license” on page 55 and Chapter 4,
“Working with LSF Licenses” for more information.
◆
Ownership of the LSF files and directories.
The LSF primary administrator should own all LSF directories and most files. The
LSF administration commands
lsadmin and badmin must be owned by root
and have the file permission mode
-rwsr-xr-x (user ID bit for the owner is
setuid).
See problem “5 lsadmin or badmin fails” on page 59 for more information.
◆
The ports for lim and res are not used by any other process, and are set correctly
in one of:
❖
LSF_CONFDIR/lsf.conf
❖
/etc/services
❖
NIS
A common mistake is a typo in port names in
LSF_CONFDIR/lsf.conf.
By default, the following ports are defined in
LSF_CONFDIR/lsf.conf:
❖
LSF_LIM_PORT=6879
❖
LSF_RES_PORT=6878
❖
LSB_MBD_PORT=6881