LSF Version 7.3 - Using Platform LSF HPC

What Is Platform LSF?
Platform LSF™ HPC (“LSF”) is the distributed workload management solution for
maximizing the performance of High Performance Computing (HPC) clusters.
Platform LSF is fully integrated with Platform LSF, the industry standard workload
management software product, to provide load sharing in a distributed system and batch
scheduling for compute-intensive jobs. Platform LSF provides support for:
Dynamic resource discovery and allocation (resource reservation) for parallel batch
job execution
Full job-level control of the distributed processes to ensure no processes will
become un-managed. This effectively reduces the possibility of one parallel job
causing severe disruption to an organization's computer service
The standard MPI interface
Full integration with Platform LSF, providing heterogeneous resource-based batch
job scheduling including job-level resource usage enforcement
Advanced HPC scheduling policies
Platform LSF enhances the job management capability of your cluster through
advanced scheduling policies such as:
Policy-based job preemption
Advance reservation
Memory and processor reservation
Memory and processor backfill
Cluster-wide resource allocation limits
User and project-based fairshare scheduling
Topology-aware scheduling
Run on every node to collect resource information such as processor load, memory
availability, interconnect states, and other host-specific as well as cluster-wide resources.
These agents coordinate to create a single system image of the cluster.
Supports advanced HPC scheduling policies that match user demand with resource
supply.
Control sequential and parallel jobs (terminate, suspend, resume, send signals) running
on the same host and across hosts. Configure and monitor job-level and system-wide
CPU, memory, swap, and other runtime resource usage limits.
Application integration support
Packaged application integrations and tailored HPC configurations make Platform LSF
ideal for Industrial Manufacturing, Life Sciences, Government and Research sites using
large-scale modeling and simulation parallel applications involving large amounts of
data. Platform LSF helps Computer-Aided Engineering (CAE) users reduce the cost of
manufacturing, and increase engineer productivity and the quality of results.
Platform LSF is integrated to work out of the box with many HPC applications, such as
LSTC LS-Dyna, FLUENT, ANSYS, MSC Nastran, Gaussian, Lion Bioscience SRS, and
NCBI BLAST.