User Manual
Table Of Contents
- Mellanox WinOF VPI User Manual
- Table of Contents
- List of Tables
- Document Revision History
- About this Manual
- 1 Introduction
- 2 Firmware Upgrade
- 3 Driver Features
- 3.1 Hyper-V with VMQ
- 3.2 Header Data Split
- 3.3 Receive Side Scaling (RSS)
- 3.4 Port Configuration
- 3.5 Load Balancing, Fail-Over (LBFO) and VLAN
- 3.6 Ports TX Arbitration
- 3.7 RDMA over Converged Ethernet (RoCE)
- 3.8 Network Virtualization using Generic Routing Encapsulation
- 3.9 Differentiated Services Code Point (DSCP)
- 4 Deploying Windows Server 2012 and Above with SMB Direct
- 5 Driver Configuration
- 6 Performance Tuning
- 7 OpenSM - Subnet Manager
- 8 InfiniBand Fabric
- 8.1 Network Direct Interface
- 8.2 part_man - Virtual IPoIB Port Creation Utility
- 8.3 InfiniBand Fabric Diagnostic Utilities
- 8.3.1 Utilities Usage
- 8.3.2 ibdiagnet
- 8.3.3 ibportstate
- 8.3.4 ibroute
- 8.3.5 ibdump
- 8.3.6 smpquery
- 8.3.7 perfquery
- 8.3.8 ibping
- 8.3.9 ibnetdiscover
- 8.3.10 ibtracert
- 8.3.11 sminfo
- 8.3.12 ibclearerrors
- 8.3.13 ibstat
- 8.3.14 vstat
- 8.3.15 osmtest
- 8.3.16 ibaddr
- 8.3.17 ibcacheedit
- 8.3.18 iblinkinfo
- 8.3.19 ibqueryerrors
- 8.3.20 ibsysstat
- 8.3.21 saquery
- 8.3.22 smpdump
- 8.4 InfiniBand Fabric Performance Utilities
- 8.4.1 ib_read_bw
- 8.4.2 ib_read_lat
- 8.4.3 ib_send_bw
- 8.4.4 ib_send_lat
- 8.4.5 ib_write_bw
- 8.4.6 ib_write_lat
- 8.4.7 ibv_read_bw
- 8.4.8 ibv_read_lat
- 8.4.9 ibv_send_bw
- 8.4.10 ibv_send_lat
- 8.4.11 ibv_write_bw
- 8.4.12 ibv_write_lat
- 8.4.13 nd_write_bw
- 8.4.14 nd_write_lat
- 8.4.15 nd_read_bw
- 8.4.16 nd_read_lat
- 8.4.17 nd_send_bw
- 8.4.18 nd_send_lat
- 8.4.19 NTttcp
- 9 Software Development Kit
- 10 Troubleshooting
- 11 Documentation
- Appendix A: Windows MPI (MS-MPI)
- Appendix B: NVGRE Configuration Scrips Examples
Rev 4.60
Mellanox Technologies
103
8.4.2 ib_read_lat
ib_read_lat calculates the latency of RDMA read operation of message_size between a pair of
machines. One acts as a server and the other as a client. They perform a ping pong benchmark on
which one side RDMA reads the memory of the other side only after the other side have read his
memory. Each of the sides samples the CPU clock each time they read the other side memory , in
order to calculate latency. Read is availible only in RC connection mode (as specified in IB spec).
8.4.2.1 ib_read_lat Synopsys
8.4.2.2 ib_read_lat Options
The table below lists the various flags of the command.
-b, --bidirectional Measures bidirectional bandwidth (default unidirectional)
-V, --version Displays version number
-g, --grh Use GRH with packets (mandatory for RoCE)
ib_read_lat [-i(b_port) ib_port] [-m(tu) mtu_size] [-s(ize) message_size] [-t(x-
depth) tx_size] [-n iteration_num] [-p(ort) PDT_port] [-o(uts) outstanding reads] [-
a(ll)] [-V(ersion)] [-C report cycles] [-H report histogram] [-U report unsorted]
Table 33 - ib_read_lat Flags and Options
Flag Description
-p, --port=<port> Listens on/connect to port <port> (default 18515)
-d, --ib-dev=<dev> Uses IB device <device guid> (default first device found)
-i, --ib-port=<port> Uses port <port> of IB device (default 1)
-m, --mtu=<mtu> The mtu size (default 1024)
-o, --outs=<num> The number of outstanding read/atom(default 4)
-s, --size=<size> The size of message to exchange (default 65536)
-a, --all Runs sizes from 2 till 2^23
-t, --tx-depth=<dep> The size of tx queue (default 100)
-n, --iters=<iters> The number of exchanges (at least 2, default 1000)
-C, --report-cycles Reports times in cpu cycle units (default microseconds)
-H, --report-histogram Print out all results (default print summary only)
-U, --report-unsorted (implies -
H)
Print out unsorted results (default sorted)
-V, --version Displays version number
Table 32 - ib_read_bw Flags and Options
Flag Description