White Papers

Dell - Internal Use - Confidential
Conclusions and Future Work
In this blog, we presented and analyzed the performance of different applications on Dell PowerEdge
C4130 servers with P100-PCIe GPUs. In all of the tested applications, HPL, GROMACS and ANSYS
Mechanical benefit from the balanced CPU-GPU configuration in configuration G, because they do not
require P2P access among GPUs. However, LAMMPS, HOOMD-blue, Amber (and possibly RELION) rely on
P2P accesses. Therefore, with configuration G, they scale well up to 2 P100 GPUs, then scale weakly with
4 or more P100 GPUs. But with Configuration B, they scale better than G with 4 GPUs, so configuration B
is more suitable and recommended for applications implemented with P2P accesses.
In the future work, we will run these applications on P100-SXM2 and compare the performance difference
between P100-PCIe and P100-SXM2.