LSF Version 7.3 - Using Platform LSF HPC

Controlling Allocation and User Authentication for
IBM POE Jobs
About POE authentication
Establishing authentication for POE jobs means ensuring that users are permitted to
run parallel jobs on the nodes they intend to use. POE supports two types of user
authentication:
AIX authentication (the default)
Uses
/etc/hosts.equiv or $HOME/.rhosts
DFS/DCE authentication
When interactive remote login to HPS execution nodes is not allowed, you can still run
parallel jobs under Parallel Environment (PE) through LSF. PE jobs under LSF on the
system with restricted access to the execution nodes uses two wrapper programs to allow
user authentication:
poe_wwrapper for the poe driver program
pmd_wwrapper for pmd (PE Partition Manager Daemon)
Enabling user authentication for POE jobs
To enable user authentication through the poe_w and pmd_w wrappers, you must set
LSF_HPC_EXTENSIONS="LSB_POE_AUTHENTICATION" in
/etc/lsf.conf.
Enforcing node and CPU allocation for POE jobs
To enable POE Allocation control, use
LSF_HPC_EXTENSIONS="LSB_POE_ALLOCATION" in
/etc/lsf.conf.
poe_w enforces the LSF allocation decision from mbatchd.
For US jobs,
swtbl_api and ntbl_api validates network table windows data files
with
mbatchd. For IP and US jobs, poe_wrapper validates the POE host file with the
information from
mbatchd. If the information does not match with the information
from
mbatchd, the job is terminated.
When LSF_HPC_EXTENSIONS="LSB_POE_ALLOCATION" is set:
poe_w parses the POE host file and validates its contents with information from
mbatchd.
ntbl_api and swtbl_api parse the network table and switch table data files and
validate their contents with information from
mbatchd.
Host names from data files must match host names as allocated by LSF
The number of tasks per node cannot exceed the number of tasks per node as
allocated by LSF
Total number of tasks cannot exceed the total number of tasks requested at job
submission (
bsub -n)