White Papers

Deep Learning Performance: Scale-up vs Scale-out
Architectures & Technologies Dell EMC | Infrastructure Solutions Group
38
distributed framework Horovod over IB/GPUDirect-RDMA, see below Figure 32 the scaling
efficiency reached by PowerEdge C4140:
Figure 32: The Performance with Distributed Horovod TensorFlow, connected by Mellanox ConnectX-5
network adapter with 100Gbit/s over IPoIB, and GPUDirect RDMA