Platform LSF Administration Guide Version 6.2
Cluster Concepts
Administering Platform LSF
54
Cluster Concepts
Clusters, jobs, and queues
Cluster
A group of computers (hosts) running LSF that work together as a single unit,
combining computing power and sharing workload and resources. A cluster provides a
single-system image for disparate computing resources.
Hosts can be grouped into clusters in a number of ways. A cluster could contain:
◆
All the hosts in a single administrative group
◆
All the hosts on one file server or sub-network
◆
Hosts that perform similar functions
Commands
◆
lshosts—View static resource information about hosts in the cluster
◆
bhosts—View resource and job information about server hosts in the cluster
◆
lsid—View the cluster name
◆
lsclusters—View cluster status and size
Configuration
◆
Define hosts in your cluster in lsf.cluster.cluster_name
The name of your cluster should be unique. It should not be the same as any host or
queue.
Job
A unit of work run in the LSF system. A job is a command submitted to LSF for
execution. LSF schedules, controls, and tracks the job according to configured policies.
Jobs can be complex problems, simulation scenarios, extensive calculations, anything
that needs compute power.
Commands
◆
bjobs—View jobs in the system
◆
bsub—Submit jobs
Compute Host
Compute Host
Compute Host
Commands
Submission Host
Master Host