Platform LSF Administration Guide Version 6.2
Chapter 44
Troubleshooting and Error Messages
Administering Platform LSF
619
Setting Daemon Timing Levels
The timing log level for LSF daemons is set in lsf.conf with the parameters
LSB_TIME_CMD, LSB_TIME_MBD, LSB_TIME_SBD, LSB_TIME_SCH,
LSF_TIME_LIM, LSF_TIME_RES.
The location of log files is specified with the parameter LSF_LOGDIR in
lsf.conf.
Timing is included in the same log files as messages.
To change the timing log level, you need to stop any running daemons, change
lsf.conf, and then restart the daemons.
It is useful to track timing to evaluate the performance of the LSF system. You can use
the
lsadmin and badmin commands to temporarily change the timing log level for
specific daemons such as LIM, RES,
mbatchd, sbatchd, and mbschd without
changing
lsf.conf.
How the timing
log level takes
effect
The timing log level you set will only be in effect from the time you set it until you
turn the timing log level off or the daemon stops running, whichever is sooner. If the
daemon is restarted, its timing log level is reset back to the value of the
corresponding parameter for the daemon (LSB_TIME_MBD, LSB_TIME_SBD,
LSF_TIME_LIM, LSF_TIME_RES). Timing log messages are stored in the same file as
other log messages in the directory specified with the parameter LSF_LOGDIR in
lsf.conf.
Limitations
When debug or timing level is set for RES with lsadmin resdebug, or lsadmin
restime
, the debug level only affects root RES. The root RES is the RES that runs
under the root user ID.
An application RES always uses
lsf.conf to set the debug environment. An
application RES is the RES that has been created by
sbatchd to service jobs and run
under the ID of the user who submitted the job.
This means that any RES that has been launched automatically by the LSF system will
not be affected by temporary debug or timing settings. The application RES will retain
settings specified in
lsf.conf.
Timing level commands for daemons
The total execution time of a function in the LSF system is recorded to evaluate
response time of jobs submitted locally or remotely.
The following commands set temporary timing options for LIM, RES,
mbatchd,
sbatchd, and mbschd.
lsadmin limtime
[
-l
timing_level
] [
-f
logfile_name
] [
-o
] [
host_name
]
lsadmin restime
[
-l
timing_level
] [
-f
logfile_name
] [
-o
] [
host_name
]
badmin mbdtime
[
-l
timing_level
] [
-f
logfile_name
] [
-o
]
badmin sbdtime
[
-l
timing_level
] [
-f
logfile_name
] [
-o
] [
host_name
]
badmin schdtime
[
-l
timing_level
] [
-f
logfile_name
] [
-o
]
For debug level examples, see “Setting Daemon Message Log to Debug Level” on
page 616.
For a detailed description of
lsadmin and badmin, see the Platform LSF Reference.