Product specifications
Table Of Contents
- Table of Contents
- 1 Introduction
- 2 Feature Overview
- 3 Step-by-Step Cluster Setup and MPI Usage Checklists
- 4 InfiniPath Cluster Setup and Administration
- Introduction
- Installed Layout
- Memory Footprint
- BIOS Settings
- InfiniPath and OpenFabrics Driver Overview
- OpenFabrics Drivers and Services Configuration and Startup
- Other Configuration: Changing the MTU Size
- Managing the InfiniPath Driver
- More Information on Configuring and Loading Drivers
- Performance Settings and Management Tips
- Host Environment Setup for MPI
- Checking Cluster and Software Status
- 5 Using QLogic MPI
- Introduction
- Getting Started with MPI
- QLogic MPI Details
- Use Wrapper Scripts for Compiling and Linking
- Configuring MPI Programs for QLogic MPI
- To Use Another Compiler
- Process Allocation
- mpihosts File Details
- Using mpirun
- Console I/O in MPI Programs
- Environment for Node Programs
- Environment Variables
- Running Multiple Versions of InfiniPath or MPI
- Job Blocking in Case of Temporary InfiniBand Link Failures
- Performance Tuning
- MPD
- QLogic MPI and Hybrid MPI/OpenMP Applications
- Debugging MPI Programs
- QLogic MPI Limitations
- 6 Using Other MPIs
- A mpirun Options Summary
- B Benchmark Programs
- C Integration with a Batch Queuing System
- D Troubleshooting
- Using LEDs to Check the State of the Adapter
- BIOS Settings
- Kernel and Initialization Issues
- OpenFabrics and InfiniPath Issues
- Stop OpenSM Before Stopping/Restarting InfiniPath
- Manual Shutdown or Restart May Hang if NFS in Use
- Load and Configure IPoIB Before Loading SDP
- Set $IBPATH for OpenFabrics Scripts
- ifconfig Does Not Display Hardware Address Properly on RHEL4
- SDP Module Not Loading
- ibsrpdm Command Hangs when Two Host Channel Adapters are Installed but Only Unit 1 is Connected to the Switch
- Outdated ipath_ether Configuration Setup Generates Error
- System Administration Troubleshooting
- Performance Issues
- QLogic MPI Troubleshooting
- Mixed Releases of MPI RPMs
- Missing mpirun Executable
- Resolving Hostname with Multi-Homed Head Node
- Cross-Compilation Issues
- Compiler/Linker Mismatch
- Compiler Cannot Find Include, Module, or Library Files
- Problem with Shell Special Characters and Wrapper Scripts
- Run Time Errors with Different MPI Implementations
- Process Limitation with ssh
- Number of Processes Exceeds ulimit for Number of Open Files
- Using MPI.mod Files
- Extending MPI Modules
- Lock Enough Memory on Nodes When Using a Batch Queuing System
- Error Creating Shared Memory Object
- gdb Gets SIG32 Signal Under mpirun -debug with the PSM Receive Progress Thread Enabled
- General Error Messages
- Error Messages Generated by mpirun
- MPI Stats
- E Write Combining
- F Useful Programs and Files
- G Recommended Reading
- Glossary
- Index

5–Using QLogic MPI
QLogic MPI Details
5-20 IB6054601-00 H
S
Running Multiple Versions of InfiniPath or MPI
The variable MPICH_ROOT sets a root prefix for all InfiniPath-related paths. It is
used by mpirun to try to find the mpirun-ipath-ssh executable, and it also
sets up the LD_LIBRARY_PATH for new programs. Consequently, multiple
versions of the InfiniPath software releases can be installed on some or all nodes,
and QLogic MPI and other versions of MPI can be installed at the same time. It
may be set in the environment, in mpirun.defaults, or in an rcfile (such
as .mpirunrc, .bashrc, or .cshrc) that will be invoked on remote nodes.
If you have installed the software into an alternate location using the --prefix
option with rpm, --prefix would have been set to $MPICH_ROOT.
If MPICH_ROOT is not set, the normal PATH is used unless mpirun is invoked with
a full pathname.
Job Blocking in Case of Temporary InfiniBand Link Failures
By default, as controlled by mpirun’s quiescence parameter -q, an MPI job is
killed for quiescence in the event of an IB link failure (or unplugged cable). This
quiescence timeout occurs under one of the following conditions:
A remote rank’s process cannot reply to out-of-band process checks.
MPI is inactive on the IB link for more than 15 minutes.
PSM_SHAREDCONTEXTS_MAX This variable restricts the number of InfiniPath
contexts that are made available on each node of
an MPI job.
Default:
PSM_SHAREDCONTEXTS_MAX=8 (QHT7140)
PSM_SHAREDCONTEXTS_MAX=4 (QLE7140)
Up to 16 on (QLE7240 and QLE7280; set auto-
matically based on number of CPUs on node)
NOTE:
mpirun-ssh was renamed mpirun-ipath-ssh to avoid name conflicts
with other MPI implementations.
Table 5-7. Environment Variables (Continued)
Name Description