User Manual
Table Of Contents
- Mellanox WinOF VPI User Manual
- Table of Contents
- List of Tables
- Document Revision History
- About this Manual
- 1 Introduction
- 2 Firmware Upgrade
- 3 Driver Features
- 3.1 Hyper-V with VMQ
- 3.2 Header Data Split
- 3.3 Receive Side Scaling (RSS)
- 3.4 Port Configuration
- 3.5 Load Balancing, Fail-Over (LBFO) and VLAN
- 3.6 Ports TX Arbitration
- 3.7 RDMA over Converged Ethernet (RoCE)
- 3.8 Network Virtualization using Generic Routing Encapsulation
- 3.9 Differentiated Services Code Point (DSCP)
- 4 Deploying Windows Server 2012 and Above with SMB Direct
- 5 Driver Configuration
- 6 Performance Tuning
- 7 OpenSM - Subnet Manager
- 8 InfiniBand Fabric
- 8.1 Network Direct Interface
- 8.2 part_man - Virtual IPoIB Port Creation Utility
- 8.3 InfiniBand Fabric Diagnostic Utilities
- 8.3.1 Utilities Usage
- 8.3.2 ibdiagnet
- 8.3.3 ibportstate
- 8.3.4 ibroute
- 8.3.5 ibdump
- 8.3.6 smpquery
- 8.3.7 perfquery
- 8.3.8 ibping
- 8.3.9 ibnetdiscover
- 8.3.10 ibtracert
- 8.3.11 sminfo
- 8.3.12 ibclearerrors
- 8.3.13 ibstat
- 8.3.14 vstat
- 8.3.15 osmtest
- 8.3.16 ibaddr
- 8.3.17 ibcacheedit
- 8.3.18 iblinkinfo
- 8.3.19 ibqueryerrors
- 8.3.20 ibsysstat
- 8.3.21 saquery
- 8.3.22 smpdump
- 8.4 InfiniBand Fabric Performance Utilities
- 8.4.1 ib_read_bw
- 8.4.2 ib_read_lat
- 8.4.3 ib_send_bw
- 8.4.4 ib_send_lat
- 8.4.5 ib_write_bw
- 8.4.6 ib_write_lat
- 8.4.7 ibv_read_bw
- 8.4.8 ibv_read_lat
- 8.4.9 ibv_send_bw
- 8.4.10 ibv_send_lat
- 8.4.11 ibv_write_bw
- 8.4.12 ibv_write_lat
- 8.4.13 nd_write_bw
- 8.4.14 nd_write_lat
- 8.4.15 nd_read_bw
- 8.4.16 nd_read_lat
- 8.4.17 nd_send_bw
- 8.4.18 nd_send_lat
- 8.4.19 NTttcp
- 9 Software Development Kit
- 10 Troubleshooting
- 11 Documentation
- Appendix A: Windows MPI (MS-MPI)
- Appendix B: NVGRE Configuration Scrips Examples
Rev 4.60
Mellanox Technologies
124
10 Troubleshooting
10.1 InfiniBand Troubleshooting
Issue 1. The InfiniBand interfaces are not up after the first reboot after the installation process is
completed.
Suggestion: To troubleshoot this issue, follow the steps below:
1. Check that the InfiniBand driver is running on all nodes by using “vstat”. The
vstat utility located at <installation_directory>\tools, displays the status and capabil-
ities of the network adaptor card(s).
2. On the command line, enter “vstat” (use -h for options) to retrieve information
about one or more adapter ports. The field port_state will be equal to:
• PORT_DOWN - when there is no InfiniBand cable ("no link");
• PORT_INITIALIZED - when the port is connected to some other port ("physical link");
• PORT_ACTIVE - when the port is connected and OpenSM is running ("logical link")
• PORT_ARMED - when the port is connected to some other port ("physical link");
3.
Run “sminfo” and verify that OpenSM is running.
In case OpenSM is not running, please see OpenSM operation instructions in
Section 7, “OpenSM - Subnet Manager”, on page 60 above.
4. Verify the status of ports by using vstat: All connected ports should report
"PORT_ACTIVE" state.
10.2 Ethernet Troubleshooting
Issue 1. The installation of Win OFED VPI for Windows fails with the following error mes-
sage:
This installation package is not supported by this processor type. Contact your product vendor."
Suggestion: This message is printed if you have downloaded and attempted to install an incor-
rect driver version-- for example, if you are trying to install a 64-bit driver on a 32-bit machine
(or vice versa).
Issue 2. The performance is low.
Suggestion: This can be due to non-optimal system configuration. See the section "Perfor-
mance Tuning" to take advantage of Mellanox 40/10 GBit NIC performance.
Issue 3. The driver does not start.
Suggestion 1: This can happen due to an RSS configuration mismatch between the TCP stack
and the Mellanox adapter. To confirm this scenario, open the event log and look under "Sys-
tem" for the "mlx4ethX" source. If found, enable RSS as follows:
1. Run the following command: "netsh int tcp set global rss = enabled".
Suggestion 2: This is a less recommended suggestion, and will cause low performance. To dis-
able RSS on the adapter, run the following command: "netsh int tcp set global rss = no dynamic
balancing".
Issue 4. The Ethernet driver fails to start. In the Event log, under the mlx4_bus source, the fol-
lowing error message appears: RUN_FW command failed with error -22