White Papers

Deep Learning Performance: Scale-up vs Scale-out

Architectures & Technologies Dell EMC | Infrastructure Solutions Group

Figure 36: Relative speed performance based on training time

After training the PowerEdge C4140 – Configuration M with SXM2 in multi-node configuration,

we saw it reached the fastest training in 5.3 hours, overpassing the Non-Dell EMC SN_8x-V100-

16GB-SXM2 which completed the training time in 6.6 hours. See Figure 36

7.3.1 Elapsed Training Time for Several Models

Another aspect we wanted to explore was the accuracy convergence capacity for other models,

so we selected models with different depth network topology (vg199, ResNet50, and Inception-

v4) and ran the long tests on PowerEdge C4140 in multi-node configuration and non-Dell EMC 8x

– V100 SXM2. The results are show in Figure 37 below.