White Papers

Comparing Haswell processor models for HPC applications
Garima Kochhar, September 2014
This blog evaluates four Haswell processor models (Intel® Xeon® E5-2600 v3 Product Family) comparing
them for performance and energy efficiency on HPC applications. This is part three in a three part series.
Blog one provided HPC results and performance comparisons across server generations, comparing Ivy
Bridge (E5-2600 v2), Sandy Bridge (E5-2600) and Westmere (X5600) to Haswell. The second blog
discussed the performance and energy efficiency implications of BIOS tuning options available on the
new Dell Haswell servers.
In this study we evaluate processor models with different core counts, CPU frequencies and Thermal
Design Power (TDP) ratings and analyze the differences in performance and power. Focusing on HPC
applications, we ran two benchmarks and four applications on our server. The server in question is part
of Dell’s PowerEdge 13
th
generation (13G) server line-up. These servers support DDR4 memory at up to
2133 MT/s and Intel’s latest E5-2600 v3 series processors (architecture code-named Haswell). Haswell is
a net new micro-architecture when compared to the previous generation Sandy Bridge/Ivy Bridge.
Haswell based processors use a 22nm process technology, so there’s no process-shrink this time around.
Note the “v3” in the Intel product name that is what distinguishes a processor as one based on
Haswell micro-architecture. You’ll recall that “E5-2600 v2” processors are based on the Ivy Bridge micro-
architecture and plain E5-2600 series with no explicit version are Sandy Bridge based processors.
Haswell processors require a new server/new motherboard and DDR4 memory. The platform we used is
a standard dual-socket rack server with two Haswell-EP based processors. Each socket has four memory
channels and can support up to 3 DIMMs per channel (DPC).
Configuration
Table 1 below details the applications we used and Table 2 describes the test configuration on the new
13G server.
Table 1 - Applications and benchmarks
Application
Domain
Version
Benchmark
Stream
Memory bandwidth
v5.9
Triad
HPL
Computation - solve a dense
system of linear equations
From Intel MKL
Problem size 90% of total
memory
Ansys Fluent
Computational fluid
dynamics
v15.0
truck_poly_14m
LS-DYNA
Finite element analysis
v7_0_0_79069
car2car with endtime=0.02
WRF
Weather Research and
Forecasting
v3.5.1
Conus 2.5km
MILC
Quantum chromo dynamics
v7.7.3, v7.7.11
Input data file from Intel

Summary of content (6 pages)