White Papers

Abaqus Performance

17 Dell EMC Ready Solution for HPC Digital Manufacturing—Dassault Systѐmes’ Simulia Abaqus Performance

These results demonstrate that while the performance increase is not linear with respect to the number of

cores per node, there can be a substantial benefit in using system with large numbers of cores, and typically

the best performance is obtained when using all of the cores available. The exception for these cases is the

modal analysis model S3D, which is only thread parallel. The MPI mode is required to take full advantage of

several cores per server.

Figure 8 shows the parallel performance improvement when running a single job in parallel across multiple

nodes. These benchmarks were carried out on a cluster with four 6252 based servers using all available

cores in each server.

The parallel speedup when running jobs across more than a single node is somewhat mixed. These datasets

are rather small by current production standards, and do not represent typical production sized simulations.

However, these models do show significant speedup across two nodes, and modest speedups up to four

nodes.

Many FEA applications such as Abaqus, have been modified to allow for GPU acceleration with appropriate

GPU enabled servers. Figure 9 contains performance information for the Abaqus standard benchmarks

models described above run on a Dell EMC PowerEdge C4140 server, equipped with dual Intel Xeon Gold

20-core 6148 processors and four NVIDIA V100 GPUs.

1.25

1.5

1.75

2.25

2.5

1-node 2-node 4-node

Performance (relative to a single node)

Figure 8: Multi Server Parallel Performance

S4B S4D S6 E1 E3 E5