Product specifications

Table Of Contents
D–Troubleshooting
QLogic MPI Troubleshooting
D-26 IB6054601-00 H
S
General Error Messages
The following message may be generated by ipath_checkout or mpirun:
PSM found 0 available contexts on InfiniPath device
The most likely cause is that the cluster has processes using all the available
PSM contexts.
Error Messages Generated by mpirun
The following sections describe the mpirun error messages. These messages
are in one of these categories:
Messages from the QLogic MPI (InfiniPath) library
MPI messages
Messages relating to the InfiniPath driver and InfiniBand links
Messages generated by mpirun follow this format:
program_name: message
function_name: message
Messages can also have different prefixes, such as ipath_ or psm_, which
indicate in which part of the software the errors are occurring.
Messages from the QLogic MPI (InfiniPath) Library
Messages from the QLogic MPI (InfiniPath) library appear in the mpirun output.
The following example contains rank values received during connection setup that
were higher than the number of ranks (as indicated in the mpirun startup code):
sender rank rank is out of range (notification)
sender rank rank is out of range (ack)
The following are error messages, which indicate internal problems and must be
reported to Technical Support.
unknown frame type type
[n] Src lid error: sender: x, exp send: y
Frame receive from unknown sender. exp. sender = x, came from y
Failed to allocate memory for eager buffer addresses: str
The following error messages usually indicate a hardware or connectivity
problem:
Failed to get IB Unit LID for any unit
Failed to get our IB LID
Failed to get number of Infinipath units
In these cases, try to reboot. If that does not work, call Technical Support.