Product specifications
Table Of Contents
- Table of Contents
- 1 Introduction
- 2 Feature Overview
- 3 Step-by-Step Cluster Setup and MPI Usage Checklists
- 4 InfiniPath Cluster Setup and Administration
- Introduction
- Installed Layout
- Memory Footprint
- BIOS Settings
- InfiniPath and OpenFabrics Driver Overview
- OpenFabrics Drivers and Services Configuration and Startup
- Other Configuration: Changing the MTU Size
- Managing the InfiniPath Driver
- More Information on Configuring and Loading Drivers
- Performance Settings and Management Tips
- Host Environment Setup for MPI
- Checking Cluster and Software Status
- 5 Using QLogic MPI
- Introduction
- Getting Started with MPI
- QLogic MPI Details
- Use Wrapper Scripts for Compiling and Linking
- Configuring MPI Programs for QLogic MPI
- To Use Another Compiler
- Process Allocation
- mpihosts File Details
- Using mpirun
- Console I/O in MPI Programs
- Environment for Node Programs
- Environment Variables
- Running Multiple Versions of InfiniPath or MPI
- Job Blocking in Case of Temporary InfiniBand Link Failures
- Performance Tuning
- MPD
- QLogic MPI and Hybrid MPI/OpenMP Applications
- Debugging MPI Programs
- QLogic MPI Limitations
- 6 Using Other MPIs
- A mpirun Options Summary
- B Benchmark Programs
- C Integration with a Batch Queuing System
- D Troubleshooting
- Using LEDs to Check the State of the Adapter
- BIOS Settings
- Kernel and Initialization Issues
- OpenFabrics and InfiniPath Issues
- Stop OpenSM Before Stopping/Restarting InfiniPath
- Manual Shutdown or Restart May Hang if NFS in Use
- Load and Configure IPoIB Before Loading SDP
- Set $IBPATH for OpenFabrics Scripts
- ifconfig Does Not Display Hardware Address Properly on RHEL4
- SDP Module Not Loading
- ibsrpdm Command Hangs when Two Host Channel Adapters are Installed but Only Unit 1 is Connected to the Switch
- Outdated ipath_ether Configuration Setup Generates Error
- System Administration Troubleshooting
- Performance Issues
- QLogic MPI Troubleshooting
- Mixed Releases of MPI RPMs
- Missing mpirun Executable
- Resolving Hostname with Multi-Homed Head Node
- Cross-Compilation Issues
- Compiler/Linker Mismatch
- Compiler Cannot Find Include, Module, or Library Files
- Problem with Shell Special Characters and Wrapper Scripts
- Run Time Errors with Different MPI Implementations
- Process Limitation with ssh
- Number of Processes Exceeds ulimit for Number of Open Files
- Using MPI.mod Files
- Extending MPI Modules
- Lock Enough Memory on Nodes When Using a Batch Queuing System
- Error Creating Shared Memory Object
- gdb Gets SIG32 Signal Under mpirun -debug with the PSM Receive Progress Thread Enabled
- General Error Messages
- Error Messages Generated by mpirun
- MPI Stats
- E Write Combining
- F Useful Programs and Files
- G Recommended Reading
- Glossary
- Index

IB6054601-00 H Glossary-3
Glossary
latency — mpihosts file
A
latency
The delay inherent in processing network
data. In terms of MPI, it is the time
required to send a message from one
node to another, independent of message
size. Latency can be further split into
sender and receiver processing
overheads, as well as wire and switch
overhead.
launch node
Same as front end node
layered driver
A driver that does not directly manage any
target devices. The layered driver calls
another driver’s routines, which in turn
manage the target devices.
LID
Stands for Local Identifier. Assigned by the
Subnet Manager (SM) to each visible node
within a single InfiniBand fabric. It is similar
conceptually to an IP address for TCP/IP.
Lustre
Open source project to develop scalable
cluster file systems
MAC Address
Stands for Media Access Control Address.
It is a unique identifier attached to most
forms of networking equipment.
machines file
Same as mpihosts file
MADs
Stands for Management Datagrams.
Subnet Managers (SMs) and Subnet
Management Agents (SMAs) communi-
cate via MADs.
managed switch
A switch that can be configured to run an
embedded Subnet Manager (SM)
MGID
Stands for Multicast Group ID. An identifier
for a multicast group. This can be
assigned by the SM at multicast group
creation time, although frequently it is
chosen by the application or protocol
instead.
MLID
Stands for Multicast Local ID for InfiniBand
multicast. This is the identifier that a
member of a multicast group uses for
addressing messages to other members of
the group.
MPD
Stands for Multi-Purpose Daemon. An
alternative to mpirun to launch MPI jobs,
it provides support for MPICH. Developed
at Argonne National laboratory.
MPI
Stands for Message-Passing Interface.
MPI is a message-passing library or
collection of routines used in distrib-
uted-memory parallel programming. It is
used in data exchange and task synchroni-
zation between processes. The goal of
MPI is to provide portability and efficient
implementation across different platforms
and architectures.
MPICH
A freely available, portable implementation
of MPI
mpihosts file
A file containing a list of the hostnames of
the nodes in a cluster on which node
programs can be run. Also referred to as
node file, hosts file, or machines file.