White Papers
20 Dell HPC Omni-Path Fabric: Supported Architecture and Application Study June 2016
5 Performance Benchmarking Results
5.1 Latency
OSU Micro-benchmarks were used to determine latency. These latency tests were done in Ping-Pong
fashion. HPC applications need low latency and high throughput. As seen in the graph below, the back to
back latency is 0.77µs and switch latency is 0.9µs which is on par with industry standards.
Figure 10 OSU Latency values based on Intel
®
Xeon
®
CPU E5-2697 v4 processor
5.2 Bandwidth
Figure 11 shows the OSU uni-directional and bi-directional bandwidth results with OpenMPI-1.10-hfi
version. At 4MB uni-directional bandwidth is around 12.3 GB/s and bi-directional bandwidth is around 24.3
GB/s which is on par with the theoretical peak values.
0.9 0.9 0.9 0.9 0.9
1.03 1.03 1.03
1.05
1.09
1.16
0.78 0.78
0.77 0.77 0.77
0.9
0.91 0.91
0.93
0.96
1.03
0.6
0.7
0.8
0.9
1
1.1
1.2
0 1 2 4 8 16 32 64 128 256 512
Time(us)
Messages (Bytes)
OSU_Latency
Switch B2B