Product specifications
Table Of Contents
- Table of Contents
- 1 Introduction
- 2 Feature Overview
- 3 Step-by-Step Cluster Setup and MPI Usage Checklists
- 4 InfiniPath Cluster Setup and Administration
- Introduction
- Installed Layout
- Memory Footprint
- BIOS Settings
- InfiniPath and OpenFabrics Driver Overview
- OpenFabrics Drivers and Services Configuration and Startup
- Other Configuration: Changing the MTU Size
- Managing the InfiniPath Driver
- More Information on Configuring and Loading Drivers
- Performance Settings and Management Tips
- Host Environment Setup for MPI
- Checking Cluster and Software Status
- 5 Using QLogic MPI
- Introduction
- Getting Started with MPI
- QLogic MPI Details
- Use Wrapper Scripts for Compiling and Linking
- Configuring MPI Programs for QLogic MPI
- To Use Another Compiler
- Process Allocation
- mpihosts File Details
- Using mpirun
- Console I/O in MPI Programs
- Environment for Node Programs
- Environment Variables
- Running Multiple Versions of InfiniPath or MPI
- Job Blocking in Case of Temporary InfiniBand Link Failures
- Performance Tuning
- MPD
- QLogic MPI and Hybrid MPI/OpenMP Applications
- Debugging MPI Programs
- QLogic MPI Limitations
- 6 Using Other MPIs
- A mpirun Options Summary
- B Benchmark Programs
- C Integration with a Batch Queuing System
- D Troubleshooting
- Using LEDs to Check the State of the Adapter
- BIOS Settings
- Kernel and Initialization Issues
- OpenFabrics and InfiniPath Issues
- Stop OpenSM Before Stopping/Restarting InfiniPath
- Manual Shutdown or Restart May Hang if NFS in Use
- Load and Configure IPoIB Before Loading SDP
- Set $IBPATH for OpenFabrics Scripts
- ifconfig Does Not Display Hardware Address Properly on RHEL4
- SDP Module Not Loading
- ibsrpdm Command Hangs when Two Host Channel Adapters are Installed but Only Unit 1 is Connected to the Switch
- Outdated ipath_ether Configuration Setup Generates Error
- System Administration Troubleshooting
- Performance Issues
- QLogic MPI Troubleshooting
- Mixed Releases of MPI RPMs
- Missing mpirun Executable
- Resolving Hostname with Multi-Homed Head Node
- Cross-Compilation Issues
- Compiler/Linker Mismatch
- Compiler Cannot Find Include, Module, or Library Files
- Problem with Shell Special Characters and Wrapper Scripts
- Run Time Errors with Different MPI Implementations
- Process Limitation with ssh
- Number of Processes Exceeds ulimit for Number of Open Files
- Using MPI.mod Files
- Extending MPI Modules
- Lock Enough Memory on Nodes When Using a Batch Queuing System
- Error Creating Shared Memory Object
- gdb Gets SIG32 Signal Under mpirun -debug with the PSM Receive Progress Thread Enabled
- General Error Messages
- Error Messages Generated by mpirun
- MPI Stats
- E Write Combining
- F Useful Programs and Files
- G Recommended Reading
- Glossary
- Index

4–InfiniPath Cluster Setup and Administration
Host Environment Setup for MPI
4-28 IB6054601-00 H
S
2. Enter a passphrase for your key pair when prompted. Note that the key
agent does not survive X11 logout or system reboot:
$ ssh-add
3. The following command tells ssh that your key pair should let you in:
$ cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
Edit the ~/.ssh/config file so that it reads like this:
Host*
ForwardAgent yes
ForwardX11 yes
CheckHostIP no
StrictHostKeyChecking no
This file forwards the key agent requests back to your desktop. When you
log into a front end node, you can use ssh to compute nodes without
passwords.
4. Follow your administrator’s cluster policy for setting up ssh-agent on the
machine where you will be running ssh commands. Alternatively, you can
start the ssh-agent by adding the following line to your
~/.bash_profile (or equivalent in another shell):
eval ‘ssh-agent‘
Use back quotes rather than single quotes. Programs started in your login
shell can then locate the
ssh-agent and query it for keys.
5. Finally, test by logging into the front end node, and from the front end node
to a compute node, as follows:
$ ssh frontend_node_name
$ ssh compute_node_name
For more information, see the man pages for ssh(1), ssh-keygen(1),
ssh-add(1), and ssh-agent(1).
Process Limitation with ssh
Process limitation with ssh is primarily an issue when using the mpirun option
-distributed=off. The default setting is now -distributed=on; therefore,
in most cases, ssh process limitations will not be encountered. This limitation for
the -distributed=off case is described in the following paragraph. See
“Process Limitation with ssh” on page D-20 for an example of an error message
associated with this limitation.
MPI jobs that use more than 10 processes per node may encounter an ssh
throttling mechanism that limits the amount of concurrent per-node connections
to 10. If you need to use more processes, you or your system administrator must
increase the value of MaxStartups in your /etc/ssh/sshd_config file.