LSF Version 7.3 - Administering Platform LSF

Administering Platform LSF 501
Running Parallel Jobs
submits myjob as a parallel job. The job is started when 4 job slots are available.
Starting Parallel Tasks with LSF Utilities
For simple parallel jobs you can use LSF utilities to start parts of the job on other
hosts. Because LSF utilities handle signals transparently, LSF can suspend and
resume all components of your job without additional programming.
Running parallel tasks with lsgrun
The simplest parallel job runs an identical copy of the executable on every host. The
lsgrun command takes a list of host names and runs the specified task on each
host. The
lsgrun -p command specifies that the task should be run in parallel on
each host.
Example This example submits a job that uses lsgrun to run myjob on all the selected hosts
in parallel:
bsub -n 10 ’lsgrun -p -m "$LSB_HOSTS" myjob’
Job <3856> is submitted to default queue <normal>.
For more complicated jobs, you can write a shell script that runs lsrun in the
background to start each component.
Running parallel tasks with the blaunch distributed application framework
Most MPI implementations and many distributed applications use rsh and ssh as
their task launching mechanism. The
blaunch command provides a drop-in
replacement for
rsh and ssh as a transparent method for launching parallel and
distributed applications within LSF.
Similar to the
lsrun command, blaunch transparently connects directly to the
RES/SBD on the remote host, and subsequently creates and tracks the remote tasks,
and provides the connection back to LSF. There is no need to insert pam or
taskstarter into the
rsh or ssh calling sequence, or configure any wrapper scripts.
IMPORTANT: You cannot run blaunch directly from the command line.
blaunch
only works within an LSF job; it can only be used to launch tasks on
remote hosts that are part of a job allocation. It cannot be used as a standalone
command. On success
blaunch exits with 0.
blaunch is not supported on Windows.
See Using Platform LSF HPC for more information about using the blaunch
distributed applilcation framework.
Submitting jobs
with blaunch
Use bsub to call blaunch, or to invoke a job script that calls blaunch. The blaunch
command assumes that
bsub -n implies one remote task per job slot.
Submit a parallel job:
bsub -n 4 blaunch myjob
Submit a parallel job to launch tasks on a specific host:
bsub -n 4 blaunch hostA myjob