Product specifications

Table Of Contents

IB6054601-00 H B-1

B Benchmark Programs

Several MPI performance measurement programs are installed from the

mpi-benchmark RPM. This appendix describes these benchmarks and how to

run them. These programs are based on code from the group of Dr. Dhabaleswar

K. Panda at the Network-Based Computing Laboratory at the Ohio State

University. For more information, see: http://mvapich.cse.ohio-state.edu/

These programs allow you to measure the MPI latency and bandwidth between

two or more nodes in your cluster. Both the executables, and the source for those

executables, are shipped. The executables are shipped in the mpi-benchmark

RPM, and installed under /usr/bin. The source is shipped in the mpi-devel

RPM and installed under /usr/share/mpich/examples/performance.

The following examples are intended to show only the syntax for invoking these

programs and the meaning of the output. They are not representations of actual

InfiniPath performance characteristics.

Benchmark 1: Measuring MPI Latency Between

Two Nodes

In the MPI community, latency for a message of given size is the time difference

between a node program’s calling MPI_Send and the time that the corresponding

MPI_Recv in the receiving node program returns. The term latency, alone without

a qualifying message size, indicates the latency for a message of size zero. This

latency represents the minimum overhead for sending messages, due to both

software overhead and delays in the electronics of the fabric. To simplify the

timing measurement, latencies are usually measured with a ping-pong method,

timing a round-trip and dividing by two.

The program osu_latency, from Ohio State University, measures the latency for

a range of messages sizes from 0 to 4 megabytes. It uses a ping-pong method, in

which the rank zero process initiates a series of sends and the rank one process

echoes them back, using the blocking MPI send and receive calls for all

operations. Half the time interval observed by the rank zero process for each

exchange is a measure of the latency for messages of that size, as previously