HP XC System Software Administration Guide Version 3.1
20.4.3 InfiniBand System Interconnect Troubleshooting
The following troubleshooting information applies to the InfiniBand system interconnects. Perform these
steps on any node on which you suspect a problem to determine if your HP XC system is configured
properly. If these tests pass but you are still experiencing difficulty, see Chapter 19: Using Diagnostic Tools
(page 215).
1. Run the /usr/mellanox/bin/vstat command to verify that the InfiniBand system interconnect
is connected and operating correctly:
# /usr/mellanox/bin/vstat
1 HCA found:
hca_id=InfiniHost0
vendor_id=0x02C9
vendor_part_id=0x5A44
hw_ver=0xA1
fw_ver=0x300000000
num_phys_ports=2
port=1
port_state=PORT_ACTIVE
sm_lid=0x0001
port_lid=0x0002
port_lmc=0x00
max_mtu=2048
port=2
port_state=PORT_DOWN
sm_lid=0x0000
port_lid=0x0000
port_lmc=0x00
max_mtu=2048
2. Run the ib-setup command to verify that the configuration is correct. The output should be similar
to that in the following example:
# ib-setup
====== Voltaire HCA400 InfiniBand Stack Setup ======
Version: ibhost-v2.1.5_5_itapi: date on amt152.
domain.
System: kernel version: 2.4.21-15.10hp.
XCsmp, memory 3595MB.
Hostname: n3.
IB configuration: AutoStart: on, SM: VFM.
HCA400 Firmware version: 0x300000000.
HCA Ports status: 1 - PORT_ACTIVE, 2 - PORT_DOWN
IPoIB (config) IPoIB (active)
ip-addr: 172.22.0.3 172.22.0.3
netmask: 255.255.0.0 255.255.0.0
broadcast: 172.22.255.255 172.22.255.255
mtu: 2044 2044
1) Auto-start 4) Firmware-update 7) Exit
2) IPoIB 5) Start
3) Fabric 6) Stop
3. Enter 7 to exit.
4. Make sure that you are running an HP XC kernel. The HP XC kernels are identified by the presence
of XC in the kernel name:
238 Troubleshooting