Product specifications

Table Of Contents
D–Troubleshooting
OpenFabrics and InfiniPath Issues
D-6 IB6054601-00 H
S
If the driver loaded, but MPI or other programs are not working, check to see if
problems were detected during the driver and QLogic hardware initialization with
the command:
$ dmesg | grep -i ipath
This command may generate more than one screen of output.
Also, check the link status with the commands:
$ cat /sys/class/infiniband/ipath*/device/status_str
These commands are normally executed by the ipathbug-helper script, but
running them separately may help locate the problem.
See also “status_str” on page F-15 and “ipath_checkout” on page F-7.
MPI Job Failures Due to Initialization Problems
If one or more nodes do not have the interconnect in a usable state, messages
similar to the following appear when the MPI program is started:
userinit: userinit ioctl failed: Network is down [1]: device init
failed
userinit: userinit ioctl failed: Fatal Error in keypriv.c(520):
device init failed
These messages may indicate that a cable is not connected, the switch is down,
SM is not running, or that a hardware error occurred.
OpenFabrics and InfiniPath Issues
The following sections cover issues related to OpenFabrics (including OpenSM)
and InfiniPath.
Stop OpenSM Before Stopping/Restarting InfiniPath
OpenSM must be stopped before stopping or restarting InfiniPath. Here is a
sample command and the corresponding error messages:
# /etc/init.d/openibd stop
Unloading infiniband modules: sdp cm umad uverbs ipoib sa ipath mad
coreFATAL:Module ib_umad is in use.
Unloading infinipath modules FATAL: Module ib_ipath is in use.
[FAILED]