Platform LSF Administration Guide Version 6.2

Chapter 43
Error and Event Logging
Administering Platform LSF
593
Managing Error Logs
Error logs maintain important information about LSF operations. When you see any
abnormal behavior in LSF, you should first check the appropriate error logs to find out
the cause of the problem.
LSF log files grow over time. These files should occasionally be cleared, either by hand
or using automatic scripts.
Daemon error log
LSF log files are reopened each time a message is logged, so if you rename or remove a
daemon log file, the daemons will automatically create a new log file.
The LSF daemons log messages when they detect problems or unusual situations.
The daemons can be configured to put these messages into files.
The error log file names for the LSF system daemons are:
lim.log.host_name
res.log.host_name
pim.log.host_name
sbatchd.log.host_name
mbatchd.log.host_name
mbschd.log.host_name
LSF daemons log error messages in different levels so that you can choose to log all
messages, or only log messages that are deemed critical. Message logging is controlled
by the parameter LSF_LOG_MASK in
lsf.conf. Possible values for this parameter
can be any log priority symbol that is defined in
/usr/include/sys/syslog.h.
The default value for LSF_LOG_MASK is LOG_WARNING.
Error logging
If the optional LSF_LOGDIR parameter is defined in lsf.conf, error messages from
LSF servers are logged to files in this directory.
If LSF_LOGDIR is defined, but the daemons cannot write to files there, the error log
files are created in
/tmp.
If LSF_LOGDIR is not defined, errors are logged to the system error logs (
syslog)
using the LOG_DAEMON facility.
syslog messages are highly configurable, and the
default configuration varies widely from system to system. Start by looking for the file
/etc/syslog.conf, and read the man pages for syslog(3) and syslogd(1).
If the error log is managed by
syslog, it is probably already being automatically cleared.
If LSF daemons cannot find
lsf.conf when they start, they will not find the definition
of LSF_LOGDIR. In this case, error messages go to
syslog. If you cannot find any
error messages in the log files, they are likely in the
syslog.