LSF Version 7.3 - Administering Platform LSF
Administering Platform LSF 203
Managing LSF on Platform EGO
Logging and troubleshooting
LSF log files
LSF event and
account log
location
LSF uses directories for temporary work files, log files and transaction files and
spooling.
LSF keeps track of all jobs in the system by maintaining a transaction log in the
work subtree. The LSF log files are found in the directory
LSB_SHAREDIR/cluster_name/logdir.
The following files maintain the state of the LSF system:
lsb.events LSF uses the lsb.events file to keep track of the state of all jobs. Each job is a
transaction from job submission to job completion. LSF system keeps track of
everything associated with the job in the
lsb.events file.
lsb.events.n The events file is automatically trimmed and old job events are stored in
lsb.event.n files. When mbatchd starts, it refers only to the lsb.events file, not
the
lsb.events.n files. The bhist command can refer to these files.
LSF error log
location
If the optional LSF_LOGDIR parameter is defined in lsf.conf, error messages
from LSF servers are logged to files in this directory.
If LSF_LOGDIR is defined, but the daemons cannot write to files there, the error
log files are created in
/tmp.
If LSF_LOGDIR is not defined, errors are logged to the system error logs (
syslog)
using the LOG_DAEMON facility.
syslog messages are highly configurable, and
the default configuration varies widely from system to system. Start by looking for
the file
/etc/syslog.conf, and read the man pages for syslog(3) and
syslogd(1).
If the error log is managed by
syslog, it is probably already being automatically
cleared.
If LSF daemons cannot find
lsf.conf when they start, they will not find the
definition of LSF_LOGDIR. In this case, error messages go to
syslog. If you cannot
find any error messages in the log files, they are likely in the
syslog.
LSF daemon error
logs
LSF log files are reopened each time a message is logged, so if you rename or remove
a daemon log file, the daemons will automatically create a new log file.
The LSF daemons log messages when they detect problems or unusual situations.
The daemons can be configured to put these messages into files.
The error log file names for the LSF system daemons are:
◆ res.log.host_name
◆ sbatchd.log.host_name
◆ mbatchd.log.host_name
◆ mbschd.log.host_name
LSF daemons log error messages in different levels so that you can choose to log all
messages, or only log messages that are deemed critical. Message logging for LSF
daemons is controlled by the parameter LSF_LOG_MASK in
lsf.conf. Possible