White Papers

Deep Learning Performance: Scale-up vs Scale-out

Architectures & Technologies Dell EMC | Infrastructure Solutions Group

distributed framework Horovod over IB/GPUDirect-RDMA, see below Figure 32 the scaling

efficiency reached by PowerEdge C4140:

Figure 32: The Performance with Distributed Horovod TensorFlow, connected by Mellanox ConnectX-5

network adapter with 100Gbit/s over IPoIB, and GPUDirect RDMA