Administrator Guide

7 RAPIDS Scaling on Dell EMC PowerEdge Servers

Figure 3. Average ranking of the ML algorithms.

Source: Nvidia/ https://arxiv.org/pdf/1708.05070.pdf

1.2 Dask for Parallel Computing

Dask is a distributed computation scheduler built to scale Python workloads from laptops to supercomputer

clusters. It takes advantage of hardware advancements using a communications framework called

OpenUCX to scale up and out with cuDF (Native integration with Dask + cuDF):

• For intranode data movement, utilizing NVLink and PCIe peer-to-peer communications

• For internode data movement, utilizing GPU RDMA over InfiniBand and RoCE

Figure 4. Dask for Parallel Computing. Source: Nvidia