White Papers

Performance study of four Socket PowerEdge R940 Server with Intel Skylake
processors
Author: Somanath Moharana, Dell EMC HPC Innovation Lab, August 2017
This blog explores the performance of the four socket Dell EMC PowerEdge R940 server with
Intel Skylake processors. The latest Dell EMC 14th generation servers support the new Intel®
Xeon® Processor Scalable Family (processor architecture codenamed “Skylake”), and the
increased number of cores and higher memory speed benefit a wide variety of HPC applications.
The PowerEdge R940 is Dell EMC’s latest 4-socket, 3U rack server designed to run complex
workloads, which supports up to 6TB of DDR4 memory and up to 122 TB of storage. The
system features the Intel® Xeon® Scalable Processor Family, 48 DDR4 DIMMs, up to 13 PCI
Express® (PCIe) 3.0 enabled expansion slots and a choice of embedded NIC technologies. It is a
general-purpose platform capable of handling demanding workloads and applications, such as
data warehouses, ecommerce, databases, and high-performance computing (HPC). With the
increase in storage capacity the PowerEdge R940 makes it well-suited for data intensive
applications that require greater storage.
This blog also describes the impact of BIOS tuning options on HPL, STREAM and scientific
applications ANSYS Fluent and WRF and compares performance of the new PowerEdge R940
to the previous generation PowerEdge R930 platform. It also analyses the performance with Sub
NUMA Cluster (SNC) modes (SNC=Enabled and SNC=Disabled). SNC enabled will expose
eight NUMA nodes to the OS on a four socket PowerEdge R940. Each NUMA node can
communicate with seven other remote NUMA nodes, six in other three sockets and one within
same socket. NUMA domains on different sockets communicate over the UPI interconnect.
Please visit BIOS characteristics of Skylake processor-blog for more details on BIOS options.
Table 1 lists the server configuration and the application details used for this study.
Table 1: Details of Server and HPC Applications used for R940 analysis
Platform PowerEdge R930 PowerEdge R930 PowerEdge R940
Processor 4 x Intel Xeon E7-8890
v3@2.5GHz (18 cores)
45MB L3 cache 165W
Codename=Haswell-
EX
4 x Intel Xeon E7-8890
v4@2.2GHz (24 cores)
60MB L3 cache 165W
Codename=Broadwell-
EX
4 x Intel Xeon Platinum
8180@2.5GHz,
10.4GT/s (Cross-bar
connection)
Codename=Skylake
Memory 1024 GB = 64 x 16GB
DDR4 @1866MHz
1024 GB = 32 x 32GB
DDR4 @1866 MHz
384GB = (24 x 16GB)
DDR4@2666MT/s
CPU
Interconne
ct
Intel QuickPath
Interconnect (QPI)
8GT/s
Intel QuickPath
Interconnect (QPI)
8GT/s
Intel Ultra Path
Interconnect (UPI)
BIOS Settings

Summary of content (8 pages)