White Papers

Deep Learning Performance: Scale-up vs Scale-out

Architectures & Technologies Dell EMC | Infrastructure Solutions Group

The plot in Figure 3 shows GPU performance when looking at single precision and Figure 4

shows GPU performance when looking at half-precision. Most of the Deep Learning frameworks

and models take advantage of half-precision since they can work with larger datasets with the

available memory. It’s very important to look at the raw Flop numbers for a GPU, since we want

to extract the same level of performance when that GPU is put into a system.

Figure 3 NVidia GPU Performance - Single precision

[7]