HP NetServer AA 6200 Solution Release 3.0 Messages
211
Chapter 13 MtcMon Messages
IOPn.MtcMon FRU: Uncorrectable memory error on CEx
Va r ia b le s x – the ID of the CE that has been removed from the Endurance server
Severity Error
Description The specified CE has Error Correction Code (ECC) memory, and the memory controller has
detected but failed to correct one or more double-bit memory errors. This indicates that the
system memory has been corrupted, and the CE has been removed.
Related Messages
IOPn.MtcMon FRU: Uncorrectable platform error on CEx
IOPn.MtcMon Fault Handler status: CEx has reported n memory error(s).
IOPn.MtcMon Fault Handler status: CEx has reported n platform error(s).
Hardware/Software Double-bit memory errors are most likely caused by faulty memory or memory with marginal
timing characteristics. However, they can also be caused by memory controller or other
motherboard problems.
Action • Run any memory diagnostics supplied with your system to verify the correct functioning
of the memory subsystem.
• If the problem can be isolated to a particular memory component (DIMM or SIMM),
replace the memory.
• ECC errors can be caused by marginal timing characteristics of the memory components
(DIMM, SIMM, or memory controller) and can be exacerbated by the addition of more
DIMMs or SIMMs. If your memory slots are fully-populated, try removing memory from
one or more of the slots to see if this alleviates the problem. You must shut down the CE
operating system in order to remove the memory.
• If the problem persists, replace the CE.