White Papers

Deep Learning Performance: Scale-up vs Scale-out
Architectures & Technologies Dell EMC | Infrastructure Solutions Group
41
Figure 36: Relative speed performance based on training time
After training the PowerEdge C4140 Configuration M with SXM2 in multi-node configuration,
we saw it reached the fastest training in 5.3 hours, overpassing the Non-Dell EMC SN_8x-V100-
16GB-SXM2 which completed the training time in 6.6 hours. See Figure 36
7.3.1 Elapsed Training Time for Several Models
Another aspect we wanted to explore was the accuracy convergence capacity for other models,
so we selected models with different depth network topology (vg199, ResNet50, and Inception-
v4) and ran the long tests on PowerEdge C4140 in multi-node configuration and non-Dell EMC 8x
V100 SXM2. The results are show in Figure 37 below.