Platform LSF Administration Guide Version 6.2
Cluster Concepts
Administering Platform LSF
58
◆
badmin hrestart—Restarts sbatchd
Configuration
◆
Port number defined in lsf.conf
res
Remote Execution Server (RES) running on each server host. Accepts remote execution
requests to provide transparent and secure remote execution of jobs and tasks.
Commands
◆
lsadmin resstartup—Starts res
◆
lsadmin resshutdown—Shuts down res
◆
lsadmin resrestart—Restarts res
Configuration
◆
Port number defined in lsf.conf
lim
Load Information Manager (LIM) running on each server host. Collects host load and
configuration information and forwards it to the master LIM running on the master
host. Reports the information displayed by
lsload and lshosts.
Static indices are reported when the LIM starts up or when the number of CPUs
(
ncpus) change. Static indices are:
◆
Number of CPUs (ncpus)
◆
Number of disks (ndisks)
◆
Total available memory (maxmem)
◆
Total available swap (maxswp)
◆
Total available temp (maxtmp)
Dynamic indices for host load collected at regular intervals are:
◆
Hosts status (status)
◆
15 second, 1 minute, and 15 minute run queue lengths (r15s, r1m, and r15m)
◆
CPU utilization (ut)
◆
Paging rate (pg)
◆
Number of login sessions (ls)
◆
Interactive idle time (it)
◆
Available swap space (swp)
◆
Available memory (mem)
◆
Available temp space (tmp)
◆
Disk IO rate (io)
Commands
◆
lsadmin limstartup—Starts LIM
◆
lsadmin limshutdown—Shuts down LIM
◆
lsadmin limrestart—Restarts LIM
◆
lsload—View dynamic load values
◆
lshosts—View static host load values
Configuration
◆
Port number defined in lsf.conf.