SEL Troubleshooting Guide
System Event Log Troubleshooting Guide for PCSD
Platforms Based on Intel
®
Xeon
®
Processor E5
4600/2600/2400/1600/1400 Product Families Table of Contents
Revision 1.2 Intel order number G90620-003 v
5.2.1 Threshold-based Temperature Sensors ............................................................... 49
5.2.2 Thermal Margin Sensors ...................................................................................... 51
5.2.3 Processor Thermal Control Sensors ..................................................................... 53
5.2.4 Processor DTS Thermal Margin Sensors ............................................................. 54
5.2.5 Discrete Thermal Sensors .................................................................................... 55
5.2.6 DIMM Thermal Trip Sensors ................................................................................ 57
5.3 System Air Flow Monitoring Sensor ...................................................................... 58
6. Processor Subsystem ...................................................................................................... 59
6.1 Processor Status Sensor ...................................................................................... 59
6.2 Catastrophic Error Sensor .................................................................................... 60
6.3 CPU Missing Sensor ............................................................................................ 62
6.3.1 CPU Missing Sensor – Next Steps ....................................................................... 62
6.4 Quick Path Interconnect Sensors ......................................................................... 62
6.4.1 QPI Link Width Reduced Sensor .......................................................................... 63
6.4.2 QPI Correctable Error Sensor .............................................................................. 64
6.4.3 QPI Fatal Error and Fatal Error #2 ........................................................................ 64
6.5 Processor ERR2 Timeout Sensor ......................................................................... 67
6.5.1 Processor ERR2 Timeout – Next Steps................................................................ 68
6.6 Processor MSID Mismatch Sensor ....................................................................... 68
6.6.1 Processor MSID Mismatch Sensor – Next Steps .................................................. 69
7. Memory Subsystem .......................................................................................................... 70
7.1 Memory RAS Configuration Status ....................................................................... 70
7.2 Memory RAS Mode Select ................................................................................... 72
7.3 Mirroring Redundancy State ................................................................................. 73
7.3.1 Mirroring Redundancy State Sensor – Next Steps ............................................... 74
7.4 Sparing Redundancy State ................................................................................... 74
7.4.1 Sparing Redundancy State Sensor – Next Steps ................................................. 76
7.5 ECC and Address Parity ...................................................................................... 76
7.5.1 Memory Correctable and Uncorrectable ECC Error .............................................. 76
7.5.2 Memory Address Parity Error ............................................................................... 78
8. PCI Express* and Legacy PCI Subsystem ...................................................................... 81
8.1 PCI Express* Errors ............................................................................................. 81
8.1.1 Legacy PCI Errors ................................................................................................ 81
8.1.2 PCI Express* Fatal Errors and Fatal Error #2 ....................................................... 82
8.1.3 PCI Express* Correctable Errors .......................................................................... 84
9. System BIOS Events ........................................................................................................ 87
9.1 System Events ..................................................................................................... 87
9.1.1 System Boot ......................................................................................................... 87
9.1.2 Timestamp Clock Synchronization ....................................................................... 87
9.2 System Firmware Progress (Formerly Post Error) ................................................ 89