White Papers

Deep Learning Performance: Scale-up vs Scale-out
Architectures & Technologies Dell EMC | Infrastructure Solutions Group
8
The plot in Figure 3 shows GPU performance when looking at single precision and Figure 4
shows GPU performance when looking at half-precision. Most of the Deep Learning frameworks
and models take advantage of half-precision since they can work with larger datasets with the
available memory. It’s very important to look at the raw Flop numbers for a GPU, since we want
to extract the same level of performance when that GPU is put into a system.
Figure 3 NVidia GPU Performance - Single precision
[7]