White Papers
Figure 1: Performance for Intel Broadwell processors with WRF
Figure 1 compares performance among five Broadwell processors by using small and large size of WRF benchmarks.
WRF was compiled with the “sm + dm” mode. The combinations of MPI and OpenMP processes that were used are
mentioned in Table2.
The “X” value in the graph on top of each bar show the performance relative to the 12 core Broadwell processor (which
is set as baseline, 1.0). For the small size dataset CONUS12km, the top bin processor performs 26% better than 12
core processor. While for CONUS2.5km, performance increases up to 30% due to the large dataset size, which can
more efficiently utilize larger number of processors. The performance increase from 20 to 22 cores is not as significant
due to the lower memory bandwidth per core as explained in the first blog’s STREAM results.