White Papers

4 Retail Analytics with Malong RetailAI® on DELL EMC PowerEdge servers
Figure 2. Inference Flow.
Why NVIDIA T4 GPU?
The NVIDIA® Tesla® T4 is single-slot, low profile, PCIE Express Gen3 Deep learning accelerator card based
on the TU104 NVIDIA graphics processing unit (GPU). The NVIDIA T4 has 16GB GDDR6 memory and a
70W maximum power limit. It is a passively cooled board.
Tesla T4 is powered by NVIDIA Turing Tensor Cores to accelerate inference, video transcoding and
virtual desktops. Turing Tensor Core technology with multi-precision computing for AI powers
breakthrough performance from FP32 to FP16 to INT8, as well as INT4 precisions. It delivers up to 9.3X
higher performance than CPUs on training and up to 36X on inference.
Figure 3: NVIDIA Tesla T4 GPU
Dell EMC PowerEdge R7425 Server
Dell EMC PowerEdge R7425-T4-16GB server supports the latest GPU accelerator to speed results in data
analytics and AI applications, it enables fast workload performance on more cores for cutting edge
applications such Artificial Intelligence (AI), High Performance Computing (HPC), and scale up software
defined deployments. See Figure 4.