Administrator Guide

6 Deep Learning Performance Scale-Out
Figure 1: Servers Logical Design. Source: Image adapted from
https://community.mellanox.com/docs/DOC-2971
Error! Reference source not found. 2 below shows how PowerEdge C4140-M is connected
via InifniBand fabric for multi-node testing.
Figure 2: Using Mellanox CX5 InfiniBand adapter to connect PowerEdge C4140 in multi-node
configuration
PowerEdge C4140-M Details
The Dell EMC PowerEdge C4140, an accelerator-optimized, high density 1U rack server, is used
as the compute node unit in this solution. The PowerEdge C4140 can support four NVIDIA V100
Tensor Core GPUs, both the V100-SXM2 (with high speed NVIDIA NVLink interconnect) as well
as the V100-PCIe models.
Figure 3: PowerEdge C4140 Server
The Dell EMC PowerEdge C4140 supports NVIDIA V100 with NVLink in topology ‘M’ with a high
bandwidth host to GPU communication is one of the most advantageous topologies for deep
learning. Most of the competitive systems, supporting either a 4-way, 8-way or 16-way NVIDIA