Dell OpenManage Server Administrator Version 6.
Notes and Cautions NOTE: A NOTE indicates important information that helps you make better use of your computer. CAUTION: A CAUTION indicates potential damage to hardware or loss of data if instructions are not followed. ____________________ Information in this document is subject to change without notice. © 2011 Dell Inc. All rights reserved. Reproduction of these materials in any manner whatsoever without the written permission of Dell Inc. is strictly forbidden.
Contents 1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . What’s New in this Release . . . . . . . . . . . . . . . . Sample Event Message Text 8 . . . . . . . . . . . . . 8 . . . . . . . . . . . . Viewing Alerts and Event Messages . . . . . . . . . . Logging Messages to a Unicode Text File . . . . . Viewing Events in Windows Server 2003 and Windows Server 2008 . . . . . . . . . . . . Viewing Events in Red Hat Enterprise Linux and SUSE Linux Enterprise Server . . . . .
Chassis Intrusion Messages . . . . . . . . . . . . . . 35 Redundancy Unit Messages . . . . . . . . . . . . . . . 38 . . . . . . . . . . . . . . . . 42 Power Supply Messages Memory Device Messages Fan Enclosure Messages . . . . . . . . . . . . . . . 46 . . . . . . . . . . . . . . . . 47 AC Power Cord Messages . . . . . . . . . . . . . . . . Hardware Log Sensor Messages . . . . . . . . . . . . 50 Processor Sensor Messages . . . . . . . . . . . . . . 52 Pluggable Device Messages . . . .
Voltage Sensor Events Fan Sensor Events . . . . . . . . . . . . . . . . . 222 . . . . . . . . . . . . . . . . . . . . 223 . . . . . . . . . . . . . . . . 225 . . . . . . . . . . . . . . . . . . 226 . . . . . . . . . . . . . . . . . . . 229 Processor Status Events Power Supply Events Memory ECC Events . . . . . . . . . . . . . . . . . 229 . . . . . . . . . . . . . . . . . . . . . 230 BMC Watchdog Events Memory Events . . . . . . . . . . . . . . 231 . . . . . . . . . . . . . . . . . . . . .
Contents
1 Introduction Dell OpenManage Server Administrator generates event messages stored primarily in the operating system or Server Administrator event logs and sometimes in Simple Network Management Protocol (SNMP) traps. This document describes the event messages that are created by Server Administrator version 6.5 and displayed in the Server Administrator alert log. Server Administrator creates events in response to sensor status changes and other monitored parameters.
What’s New in this Release No new alerts have been added. The existing alerts 2081, 2347, and 2388 are modified to include additional information. Messages Not Described in This Guide This guide describes only event messages logged by Server Administrator and Storage Management that are displayed in the Server Administrator alert log.
Server Administrator generates events based on status changes in the following sensors: • Temperature Sensor — Helps protect critical components by alerting the systems management console when temperatures become too high inside a chassis; also monitors the temperature in a variety of locations in the chassis and in attached system(s). • Fan Sensor — Monitors fans in various locations in the chassis and in attached system(s).
• Pluggable Device Sensor — Monitors the addition, removal, or configuration errors for some pluggable devices, such as memory cards. • Battery Sensor — Monitors the status of one or more batteries in the system. • SD Card Device Sensor — Monitors instrumented Secure Digital (SD) card devices in the system. Sample Event Message Text The following example shows the format of the event messages logged by Server Administrator.
The location of the event log file depends on the operating system you are using. • On systems running the Microsoft Windows operating systems, event messages are logged in the operating system event log and the Server Administrator event log. The Server Administrator event log file is named dcsys32.xml and is located in the \omsa\log directory. The default install_path is C:\Program Files\Dell\SysMgt.
restart command to restart the Server Administrator Event Manager service and enable the setting. This also restarts the Server Administrator Data Manager and SNMP services. The Server Administrator Unicode text event log file is named dcsys.log where xx is 32 or 64 bit depending on the operating system and is located in the /opt/dell/srvadmin/var/log/ openmanage directory.
Feb 6 14:20:51 server01 Server Administrator: Instrumentation Service EventID: 1000 Server Administrator starting Feb 6 14:20:51 server01 Server Administrator: Instrumentation Service EventID: 1001 Server Administrator startup complete Feb 6 14:21:21 server01 Server Administrator: Instrumentation Service EventID: 1254 Chassis intrusion detected Sensor location: Main chassis intrusion Chassis location: Main System Chassis Previous state was: OK (Normal) Chassis intrusion state: Open Feb 6 14:21:51 server01 S
• Source — The software that logged the event. • Category — The classification of the event by the event source. • Event ID — The number identifying the particular event type. • Description — A description of the event. The format and contents of the event description vary, depending on the event type. Understanding the Event Description Table 1-2 lists in alphabetical order each line item that may appear in the event description. Table 1-2.
Table 1-2.
Table 1-2.
Table 1-2.
Introduction
Server Management Messages 2 The following tables lists in numerical order each event ID and its corresponding description, along with its severity and cause. NOTE: For corrective actions, see the appropriate documentation. Server Administrator General Messages The messages in Table 2-1 indicate that certain alert systems are up and working. Table 2-1. Server Administrator General Messages Event Description ID Severity 0000 Information User cleared the log from Server Administrator.
Table 2-1. Server Administrator General Messages (continued) Event Description ID Severity Cause 1002 A system BIOS update Information The user has chosen to update has been scheduled for the flash basic input/output the next reboot system (BIOS). 1003 A previously scheduled Information The user decides to cancel the system BIOS update has flash BIOS update, or an error been canceled occurs during the flash.
Table 2-1. Server Administrator General Messages (continued) Event Description ID Severity Cause 1007 User initiated host system control action Action requested was: Information User requested a host system control action to reboot, power off, or power cycle the system. Alternatively, the user had indicated protective measures to be initiated in the event of a thermal shutdown. 1008 Systems Management Data Manager Started Information Systems Management Data Manager services were started.
Table 2-1. Server Administrator General Messages (continued) Event Description ID Severity Cause 1013 System Peak Power detected new peak value Peak value (in Watts): Information The system peak power sensor detected a new peak value in power consumption. The new peak value in Watts is provided.
Table 2-2. Temperature Sensor Messages Event Description ID Severity Cause 1050 Temperature sensor has failed Error A temperature sensor on the backplane board, system board, or the carrier in the specified system failed. The sensor location, chassis location, previous state, and temperature sensor value are provided.
Table 2-2. Temperature Sensor Messages (continued) Event Description ID Severity Cause 1052 Temperature sensor returned to a normal value Information A temperature sensor on the backplane board, Sensor location: drive carrier in the Chassis location: returned to a valid range after crossing Previous state was: a failure threshold.
Table 2-2. Temperature Sensor Messages (continued) Event Description ID Severity Cause 1054 Temperature sensor detected a failure value Error A temperature sensor on the backplane board, system board, or drive carrier in the specified system exceeded its failure threshold. The sensor location, chassis location, previous state, and temperature sensor value are provided.
Cooling Device Messages The cooling device sensors listed in Table 2-3 monitor how well a fan is functioning. Cooling device messages provide status and warning information for fans in a particular chassis. Table 2-3. Cooling Device Messages Event Description ID Severity Cause 1100 Fan sensor has failed Error A fan sensor in the specified system is not functioning. The sensor location, chassis location, previous state, and fan sensor value information is provided.
Table 2-3. Cooling Device Messages (continued) Event Description ID Severity 1102 Fan sensor returned to a normal value Information A fan sensor reading on the specified system returned to a valid range after crossing a warning threshold. The sensor location, chassis location, previous state, and fan sensor value information is provided.
Table 2-3. Cooling Device Messages (continued) Event Description ID Severity Cause 1105 Fan sensor detected a non-recoverable value Error A fan sensor detected an error from which it cannot recover. The sensor location, chassis location, previous state, and fan sensor value information is provided.
Voltage Sensor Messages The voltage sensors listed in Table 2-4 monitor the number of volts across critical components. Voltage sensor messages provide status and warning information for voltage sensors in a particular chassis. Table 2-4. Voltage Sensor Messages Event Description ID Severity Cause 1150 Voltage sensor has failed Error A voltage sensor in the specified system failed. The sensor location, chassis location, previous state, and voltage sensor value information is provided.
Table 2-4. Voltage Sensor Messages (continued) Event Description ID Severity 1152 Voltage sensor returned to a normal value Information A voltage sensor in the specified system returned to a valid range after crossing a failure threshold. The sensor location, chassis location, previous state, and voltage sensor value information is provided.
Table 2-4. Voltage Sensor Messages (continued) Event Description ID Severity Cause 1154 Voltage sensor detected a failure value Error A voltage sensor in the specified system exceeded its failure threshold. The sensor location, chassis location, previous state, and voltage sensor value information is provided. Error A voltage sensor in the specified system detected an error from which it cannot recover.
Current Sensor Messages The current sensors listed in Table 2-5 measure the amount of current (in amperes) that is traversing critical components. Current sensor messages provide status and warning information for current sensors in a particular chassis. Table 2-5. Current Sensor Messages Event Description ID Severity Cause 1200 Current sensor has failed Error A current sensor in the specified system failed. The sensor location, chassis location, previous state, and current sensor value are provided.
Table 2-5. Current Sensor Messages (continued) Event Description ID Severity Cause 1201 Current sensor value unknown Error A current sensor in the specified system could not obtain a reading. The sensor location, chassis location, previous state, and a nominal current sensor value information is provided.
Table 2-5. Current Sensor Messages (continued) Event Description ID Severity Cause 1203 Current sensor detected a warning value Warning A current sensor in the specified system exceeded its warning threshold. The sensor location, chassis location, previous state, and current sensor value are provided. Error A current sensor in the specified system exceeded its failure threshold. The sensor location, chassis location, previous state, and current sensor value are provided.
Table 2-5. Current Sensor Messages (continued) Event Description ID Severity Cause 1205 Current sensor detected a non-recoverable value Error A current sensor in the specified system detected an error from which it cannot recover. The sensor location, chassis location, previous state, and current sensor value are provided.
Table 2-6. Chassis Intrusion Messages Event Description ID Severity Cause 1250 Error A chassis intrusion sensor in the specified system failed. The sensor location, chassis location, previous state, and chassis intrusion state are provided. Error A chassis intrusion sensor in the specified system could not obtain a reading. The sensor location, chassis location, previous state, and chassis intrusion state are provided.
Table 2-6. Chassis Intrusion Messages (continued) Event Description ID Severity Cause 1253 Warning A chassis intrusion sensor in the specified system detected that a system cover is currently being opened and the system is operating. The sensor location, chassis location, previous state, and chassis intrusion state information is provided. Critical A chassis intrusion sensor in the specified system detected that the system cover was opened while the system was operating.
Redundancy Unit Messages Redundancy means that a system chassis has more than one of certain critical components. Fans and power supplies, for example, are so important for preventing damage or disruption of a computer system that a chassis may have “extra” fans or power supplies installed. Redundancy allows a second or nth fan to keep the chassis components at a safe temperature when the primary fan has failed. Redundancy is normal when the intended number of critical components are operating.
Table 2-7. Redundancy Unit Messages (continued) Event Description ID Severity 1302 Redundancy not applicable Information A redundancy sensor in the specified system detected that a unit was not redundant. The redundancy location, chassis location, previous redundancy state, and the number of devices required for full redundancy information is provided.
Table 2-7. Redundancy Unit Messages (continued) Event Description ID Severity 1304 Redundancy regained Information A redundancy sensor in the specified system detected that a “lost” redundancy device has been reconnected or replaced; full redundancy is in effect. The redundancy unit location, chassis location, previous redundancy state, and the number of devices required for full redundancy information is provided.
Table 2-7. Redundancy Unit Messages (continued) Event Description ID Severity Cause 1306 Redundancy lost Error A redundancy sensor in the specified system detected that one of the components in the redundant unit has been disconnected, has failed, or is not present. The redundancy unit location, chassis location, previous redundancy state, and the number of devices required for full redundancy are provided.
Power Supply Messages The power supply sensors monitor how well a power supply is functioning. The power supply messages listed in Table 2-8 provide status and warning information for power supplies present in a particular chassis. Table 2-8. Power Supply Messages Event Description ID Severity Cause 1350 Error A power supply sensor in the specified system failed.
Table 2-8. Power Supply Messages (continued) Event Description ID 1351 Severity Cause Information A power supply sensor in the specified system could not Sensor location:
Table 2-8. Power Supply Messages (continued) Event Description ID Severity Cause 1353 Warning A power supply sensor reading in the specified system exceeded a user-definable warning threshold. The sensor location, chassis location, previous state, power supply type, additional power supply status, and configuration error type information are provided. Error A power supply has been disconnected or has failed.
Table 2-8. Power Supply Messages (continued) Event Description ID 1355 Severity Power supply sensor detected Error a non-recoverable value Sensor location: Chassis location: Previous state was: Power Supply type: Cause A power supply sensor in the specified system detected an error from which it cannot recover.
Memory Device Messages The memory device messages listed in Table 2-9 provide status and warning information for memory modules present in a particular system. Memory devices determine health status by monitoring the ECC memory correction rate and the type of memory events that have occurred. NOTE: A critical status does not always indicate a system failure or loss of data. In some instances, the system has exceeded the ECC correction rate.
Fan Enclosure Messages Some systems are equipped with a protective enclosure for fans. Fan enclosure messages listed in Table 2-10 monitor whether foreign objects are present in an enclosure and how long a fan enclosure is missing from a chassis. Table 2-10. Fan Enclosure Messages Event Description ID Severity Cause 1450 Critical/ Failure / Error The fan enclosure sensor in the specified system failed. The sensor and chassis location information is provided.
Table 2-10. Fan Enclosure Messages (continued) Event Description ID Severity Cause 1454 Error A fan enclosure has been removed from the specified system for a user-definable length of time. The sensor and chassis location information is provided. Error A fan enclosure sensor in the specified system detected an error from which it cannot recover. The sensor and chassis location are provided.
AC Power Cord Messages The AC power cord messages listed in Table 2-11 provide status and warning information for power cords that are part of an AC power switch, if your system supports AC switching. Table 2-11. AC Power Cord Messages Event Description ID Severity 1500 Critical/ An AC power cord sensor in Failure/ Error the specified system failed. The AC power cord status cannot be monitored. The sensor and chassis location information is provided.
Table 2-11. AC Power Cord Messages (continued) Event Description ID Severity Cause 1503 AC power has been lost Critical/ Power supply is disrupted to Sensor location: Failure/ Error the AC power cord or an AC power cord is not transmitting power, but there is sufficient Chassis location: redundancy to classify this as a warning. The sensor and chassis location information is provided.
Table 2-12. Hardware Log Sensor Messages Event Description ID Severity Cause 1550 Warning A hardware log sensor in the specified system is disabled. The log type information is provided. Log monitoring has been disabled Log type: 1551 Log status is unknown Information A hardware log sensor in the specified system could not Log type: obtain a reading. The log type information is provided.
Processor Sensor Messages The processor sensors monitor how well a processor is functioning. Processor messages listed in Table 2-13 provide status and warning information for processors in a particular chassis. Table 2-13. Processor Sensor Messages Event Description ID Severity Cause 1600 Critical/ Failure/ Error A processor sensor in the specified system is not functioning. The sensor location, chassis location, previous state and processor sensor status information is provided.
Table 2-13. Processor Sensor Messages (continued) Event Description ID Severity 1602 Information A processor sensor in the specified system transitioned back to a normal state. The sensor location, chassis location, previous state and processor sensor status are provided.
Table 2-13. Processor Sensor Messages (continued) Event Description ID Severity Cause 1604 Error A processor sensor in the specified system is disabled, has a configuration error, or experienced a thermal trip. The sensor location, chassis location, previous state and processor sensor status are provided. Error A processor sensor in the specified system has failed. The sensor location, chassis location, previous state and processor sensor status are provided.
Pluggable Device Messages The pluggable device messages listed in Table 2-14 provide status and error information when some devices, such as memory cards, are added or removed. Table 2-14. Pluggable Device Messages Event Description ID 1650 Severity Cause Information A pluggable device event message of unknown type was received. The device location, chassis Device location: location, and additional event
Table 2-14. Pluggable Device Messages (continued) Event Description ID Severity 1652 Information A device was removed from the specified system. The device location, chassis location, and additional event details, if available, are provided.
Battery Sensor Messages The battery sensors monitor how well a battery is functioning. The battery messages listed in Table 2-15 provide status and warning information for batteries in a particular chassis. Table 2-15.
Table 2-15. Battery Sensor Messages (continued) Event Description ID 1702 Battery sensor returned to a normal value 1703 Battery sensor detected a warning value Severity Information A battery sensor in the specified system detected that a Sensor Location: back to a normal Chassis Location:
Table 2-15. Battery Sensor Messages (continued) Event Description ID Severity Cause 1705 Error A battery sensor in the specified system could not retrieve a value. The sensor location, chassis location, previous state, and battery sensor status information is provided.
Table 2-16. SD Card Device Messages Event ID Description 1751 Information An SD card device sensor in the specified system could not Sensor location: sensor location, chassis Chassis location: and SD card device type information is Previous state was: provided. The SD card state is provided if an SD card device type: the SD card device.
Table 2-16. SD Card Device Messages Event ID Description Severity Cause 1753 SD card device detected a warning Warning An SD card device sensor in the specified system detected a warning condition. The sensor location, chassis location, previous state, and SD card device type information is provided. The SD card state is provided if an SD card is present in the SD card device. Error An SD card device sensor in the specified system detected an error.
Table 2-16. SD Card Device Messages Event ID Description 1755 SD card device sensor Error detected a non-recoverable value Sensor location: Chassis location: Previous state was: SD card device type: SD card state: 62 Server Management Messages Severity Cause An SD card device sensor in the specified system detected an error from which it cannot recover.
Chassis Management Controller Messages The Alerts sent by Dell M1000e Chassis Management Controller (CMC) are organized by severity. That is, the event ID of the CMC trap indicates the severity (informational, warning, critical, or non-recoverable) of the alert. Each CMC alert includes the originating system name, location, and event message text. The alert message text matches the corresponding Chassis Event Log message text that is logged by the sending CMC for that event. Table 2-17.
Server Management Messages
Storage Management Message Reference 3 The Dell OpenManage Server Administrator Storage Management’s alert or event management features let you monitor the health of storage resources such as controllers, enclosures, physical disks, and virtual disks. Alert Monitoring and Logging The Storage Management Service performs alert monitoring and logging. By default, the Storage Management service starts when the managed system starts up.
Alert Message Format with Substitution Variables When you view an alert in the Server Administrator alert log, the alert identifies the specific components such as the controller name or the virtual disk name to which the alert applies. In an actual operating environment, a storage system can have many combinations of controllers and disks as well as user-defined names for virtual disks and other components. Each environment is unique in its storage configuration and user-defined names.
NOTE: A, B, C and X, Y, Z in the following examples are variables representing the storage object name or number. Table 3-2. Message Format with Variables for Each Storage Object Storage Object Message Variables Controller Message Format: Controller A (Name) Message Format: Controller A For example, 2326 A foreign configuration has been detected: Controller 1 (PERC 5/E Adapter) NOTE: The controller name is not always displayed.
Table 3-2. Message Format with Variables for Each Storage Object (continued) Storage Object Message Variables SAS Power Supply Message Format: Power Supply X Controller A, Connector B, Enclosure C For example, 2312 A power supply in the enclosure has an AC failure: Power Supply 1, Controller 1, Connector 0, Enclosure 2 SCSI Temperature Probe Message Format: Temperature Probe X Controller A, Connector B, Target ID C where C is the SCSI ID number of the EMM managing the temperature probe.
Alert Message Change History The following table describes the changes made to the Storage Management alerts from the previous release of Storage Management to the current release. Table 3-3. Alert Message Change History Storage Management 3.5 Product Versions to which changes apply Storage Management 3.5.0 Server Administrator 4.5.0 Dell OpenManage 6.5.0 New Alerts None Deleted Alerts None Modified Alerts 2388, 2347, 2081 Storage Management 3.
Table 3-3. Alert Message Change History (continued) Storage Management 3.2 Product Versions to which changes apply Storage Management 3.2.0 Server Administrator 4.2.0 Dell OpenManage 6.2.0 New Alerts 2387, 2388, 2389, 2390, 2392, 2393 Deleted Alerts None Modified Alerts None Alert Descriptions and Corrective Actions The following sections describe alerts generated by the RAID or SCSI controllers supported by Storage Management.
Table 3-4. Storage Management Messages Event ID Description Severity Cause and Action 2048 Device failed Critical / Cause: A storage Failure / Error component such as a physical disk or an enclosure has failed. The failed component may have been identified by the controller while performing a task such as a rescan or a check consistency. Action: Replace the failed component. You can identify which disk has failed by locating the disk that has a red “X” for its status.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2049 Physical disk removed Warning / Non-critical Cause: A physical disk has been removed from the disk group. This alert can also be caused by loose or defective cables or by problems with the enclosure. Clear Alert Number: 2052.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2050 Physical disk offline Warning / Non-critical Cause: A physical disk in the disk group is offline. The user may have manually put the physical disk offline. Clear Alert Number: 2158. 903 Action: Perform a rescan. You can also select the offline disk and perform a Make Online operation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2052 Physical disk inserted OK / Normal / Cause: This alert is for Clear Alert: Informational informational purposes. None Action: None Related Alert Information SNMP Trap Numbers 901 Related Alert Number: 2065, 2305, 2367 LRA Number: None 2053 Virtual disk created OK / Normal / Cause: This alert is for Clear Alert: Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2056 Virtual disk failed Critical / Cause: One or more Failure / Error physical disks included in the virtual disk have failed. If the virtual disk is non-redundant (does not use mirrored or parity data), then the failure of a single physical disk can cause the virtual disk to fail.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2057 Virtual disk degraded Warning / Non-critical Cause 1: This alert message occurs when a physical disk included in a redundant virtual disk fails. Because the virtual disk is redundant (uses mirrored or parity information) and only one physical disk has failed, the virtual disk can be rebuilt.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Cause 2: A physical disk in the disk group has been removed. 2057 contd. Action 2: If a physical disk was removed from the disk group, either replace the disk or restore the original disk. You can identify which disk has been removed by locating the disk that has a red “X” for its status. Perform a rescan after replacing the disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2060 Copy of data OK / Normal Cause: This alert is for started on /Informationa informational purposes. physical disk 1 l Action: None from physical disk 2.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2063 Virtual disk OK / Normal / Cause: This alert is for Clear Alert 1201 reconfiguratio Informational informational purposes. Number: n started 2090. Action: None Related Alert Number: None LRA Number: None 2064 Virtual disk OK / Normal / Cause: This alert is for Clear Alert rebuild started Informational informational purposes. Number: 2091.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2067 Virtual disk check consistency cancelled OK / Normal / Cause: The check Informational consistency operation was cancelled because a physical disk in the array has failed or because a user cancelled the check consistency operation. Action: If the physical disk failed, then replace the physical disk. You can identify which disk failed by locating the disk that has a red “X” for its status.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2070 Virtual disk initialization cancelled OK / Normal / Cause: The virtual disk Informational initialization cancelled because a physical disk included in the virtual disk has failed or because a user cancelled the virtual disk initialization. Action: If a physical disk failed, then replace the physical disk. You can identify which disk has failed by locating the disk that has a red “X” for its status.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2075 Copy of data completed on physical disk %2 from physical disk %1 OK / Normal / Cause: This alert is Clear Alert Informational provided for Number: informational purposes. None Action: None Related Alert Information SNMP Trap Numbers 901 Related Alert Number: 2060.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2077 Virtual disk format failed Critical / Cause: A physical disk Failure / Error included in the virtual disk failed. Action: Replace the failed physical disk. You can identify which physical disk has failed by locating the disk that has a red X for its status. Rebuild the physical disk. When finished, restart the virtual disk format operation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2081 Virtual disk Critical / Hardware RAID: reconfiguratio Failure / Error Cause: A physical disk n failed included in the virtual disk has failed or is corrupt. A user may also have cancelled the reconfiguration. Action: Replace the failed or corrupt disk. You can identify a disk that has failed by locating the disk that dispalys a red X in the status field.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2081 Virtual disk Critical / Software RAID: contd. reconfiguratio Failure / Error • Perform a backup n failed with the Verify option. Related Alert Information SNMP Trap Numbers Clear Alert Number: None 1204 Related Alert Number: • If the file backup fails, try to restore the None failed file from a LRA Number: previous backup.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2083 Physical disk rebuild failed Critical / Cause: A physical disk Failure / Error included in the virtual disk has failed or is corrupt. A user may also have cancelled the rebuild. Action: Replace the failed or corrupt disk. You can identify a disk that has failed by locating the disk that has a red “X” for its status. Rebuild the virtual disk rebuild.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2087 Copy of data resumed from physical disk %2 to physical disk %1 OK / Normal / Cause: This alert is for Clear Alert Informational informational purposes. Status: None Virtual disk initialization completed OK / Normal / Cause: This alert is for Clear Alert 1201 Informational informational purposes. Status: Alert 2088 is a clear Action: None alert for alerts 2061 and 2136.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2090 Virtual disk OK / Normal / Cause: This alert is for Clear Alert 1201 reconfiguration Informational informational purposes. Status: Alert 2090 is a clear completed Action: None alert for alert 2063.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2094 Predictive Failure reported. Warning / Non-critical Cause: The physical disk is predicted to fail. Many physical disks contain Self Monitoring Analysis and Reporting Technology (SMART). When enabled, SMART monitors the health of the disk based on indications such as the number of write operations that have been performed on the disk.
Table 3-4. Storage Management Messages (continued) Event ID Description 2094 cond. Severity Cause and Action Related Alert Information SNMP Trap Numbers If this disk is a hot spare, then unassign the hot spare; perform the Prepare to Remove task on the disk; replace the disk; and assign the new disk as a hot spare. CAUTION: If this disk is part of a nonredundant disk, back up your data immediately. If the disk fails, you cannot recover the data. 2095 SCSI sense data.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2099 Global hot spare unassigned OK / Normal / Cause: A user has Informational unassigned a physical disk as a global hot spare. This alert is for informational purposes. Action: None 2100 Temperature exceeded the maximum warning threshold Warning / Non-critical Cause: The physical disk enclosure is too hot. A variety of factors can cause the excessive temperature.
Table 3-4. Storage Management Messages (continued) Event ID Description 2101 Temperature Warning / dropped below Non-critical the minimum warning threshold 2102 Temperature exceeded the maximum failure threshold Severity Cause and Action Related Alert Information SNMP Trap Numbers Cause: The physical disk enclosure is too cool. Clear Alert Number: 2353. 1053 Action: Check if the thermostat setting is too low and if the room temperature is too cool.
Table 3-4. Storage Management Messages (continued) Event ID Description 2103 Temperature Critical / Cause: The physical dropped below Failure / Error disk enclosure is too the minimum cool. failure Action: Check if the threshold thermostat setting is too low and if the room temperature is too cool.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2106 SMART FPT exceeded Warning / Non-critical Cause: A disk on the specified controller has received a SMART alert (predictive failure) indicating that the disk is likely to fail in the near future. Clear Alert Number: None 903 Related Alert Number: None LRA Number: Action: Replace the 2070 disk that has received the SMART alert.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2107 SMART configuration change Critical / Cause: A disk has Failure / Error received a SMART alert (predictive failure) after a configuration change. The disk is likely to fail in the near future. Related Alert Information SNMP Trap Numbers Clear Alert Number: None 904 Related Alert Number: None Action: Replace the LRA Number: disk that has received 2071 the SMART alert.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2108 SMART warning Warning / Non-critical Cause: A disk has received a SMART alert (predictive failure). The disk is likely to fail in the near future. Clear Alert Number: None 903 Related Alert Number: None Action: Replace the disk that has received LRA Number: the SMART alert.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2109 SMART warning temperature Warning / Non-critical Cause: A disk has reached an unacceptable temperature and received a SMART alert (predictive failure). The disk is likely to fail in the near future. Clear Alert Number: None 903 Action 1: Determine why the physical disk has reached an unacceptable temperature.
Table 3-4. Storage Management Messages (continued) Event ID 2109 contd Description Severity Cause and Action Make sure the enclosure has enough ventilation and that the room temperature is not too hot. See the physical disk enclosure documentation for more diagnostic information. Action 2: If you cannot identify why the disk has reached an unacceptable temperature, then replace the disk. If the physical disk is a member of a non-redundant virtual disk, then back up the data before replacing the disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2110 SMART warning degraded Warning / Non-critical Cause: A disk is degraded and has received a SMART alert (predictive failure). The disk is likely to fail in the near future. Clear Alert Number: None 903 Related Alert Number: None Action: Replace the LRA Number: disk that has received 2070 the SMART alert.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2112 Enclosure was Critical / Cause: The physical shut down Failure / Error disk enclosure is either hotter or cooler than the maximum or minimum allowable temperature range. Related Alert Information SNMP Trap Numbers Clear Alert Number: None 854 Related Alert Number: None Action: Check for LRA Number: factors that may cause 2091 overheating or excessive cooling.
Table 3-4. Storage Management Messages (continued) Event ID Description 2114 A consistency OK / Normal / check on a Informational virtual disk has been paused (suspended) 2115 Severity A consistency OK / Normal / check on a Informational virtual disk has been resumed Cause and Action Related Alert Information SNMP Trap Numbers Cause: The check consistency operation on a virtual disk was paused by a user. Clear Alert Number: 2115.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2116 A virtual disk OK / Normal / Cause: A user has caused and its mirror Informational a mirrored virtual disk to have been split be split. When a virtual disk is mirrored, its data is copied to another virtual disk in order to maintain redundancy. After being split, both virtual disks retain a copy of the data although the mirror is no longer intact.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2118 Change write policy OK / Normal / Cause: A user has Informational changed the write policy for a virtual disk. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2121 Device returned to normal OK / Normal / Cause: A device that Informational was previously in an error state has returned to a normal state. For example, if an enclosure became too hot and subsequently cooled down, you may receive this alert. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2122 Redundancy degraded Warning / Non-critical Cause: One or more of Clear Alert 1305 the enclosure Status: 2124. components has failed. Related Alert Number: 2048 For example, a fan or power supply may have LRA Number: failed. Although the 2090 enclosure is currently operational, the failure of additional components could cause the enclosure to fail.
Table 3-4. Storage Management Messages (continued) Event ID 2122 contd. Description Severity Cause and Action The controller status displayed on the Health subtab indicates whether a controller has a Failed or Degraded component. See the enclosure documentation for information on replacing enclosure components and for other diagnostic information.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2123 Redundancy lost Warning / Non-critical Cause: A virtual disk or an enclosure has lost data redundancy. In the case of a virtual disk, one or more physical disks included in the virtual disk have failed. Due to the failed physical disk or disks, the virtual disk is no longer maintaining redundant (mirrored or parity) data.
Table 3-4. Storage Management Messages (continued) Event ID 2123 contd. Description Severity Cause and Action The controller status displayed on the Health subtab indicates whether a controller has a Failed or Degraded component. Click the controller that displays a Warning or Failed status. This action displays the controller Health subtab which displays the status of the individual controller components.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2124 Redundancy normal OK / Normal / Cause: Data Informational redundancy has been restored to a virtual disk or an enclosure that previously suffered a loss of redundancy. This alert is for informational purposes. Action: None Related Alert Information SNMP Trap Numbers Clear Alert 1304 Number: Alert 2124 is a clear alert for alerts 2122 and 2123.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2126 SCSI sense Warning / sector reassign Non-critical Cause and Action Related Alert Information SNMP Trap Numbers Cause: A sector of the physical disk is corrupted and data cannot be maintained on this portion of the disk. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2127 Background initialization (BGI) started OK / Normal / Cause: BGI of a virtual Informational disk has started. This alert is for informational purposes. Action: None Related Alert Information SNMP Trap Numbers Clear Alert Status: 2130. 1201 Related Alert Number: None LRA Number: None 2128 BGI cancelled OK / Normal / Cause: BGI of a virtual Informational disk has been cancelled.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2131 Firmware version mismatch Warning / Non-critical Cause: The firmware on Clear Alert the controller is not a Number: None supported version. Action: Install a supported version of the firmware. If you do not have a supported version of the firmware available, you can download it from support.dell.com or check with your support provider for information on how to obtain the most current firmware.
Table 3-4. Storage Management Messages (continued) Event ID Description 2135 Array Manager Warning / is installed on Non-critical the system NOTE: This is not supported on Dell OpenManage Server Administrator version 6.0.1. 2136 Virtual disk initialization Severity Cause and Action Related Alert Information SNMP Trap Numbers Cause: Storage Management has been installed on a system that has an Array Manager installation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2137 Communication timeout Warning / Non-critical Cause: The controller is unable to communicate with an enclosure. There are several reasons why communication may be lost. For example, there may be a bad or loose cable. An unusual amount of I/O may also interrupt communication with the enclosure.
Table 3-4. Storage Management Messages (continued) Event ID 2137 contd. 2138 Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Clear Alert Number: None 851 Action: Check for problems with the cables. See the online help for more information on checking the cables. You should also check to see if the enclosure has degraded or failed components. To do so, select the enclosure object in the tree view and click the Health subtab.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2139 Enclosure OK / Normal / Cause: A user has alarm disabled Informational disabled the enclosure alarm.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2142 Controller rebuild rate has changed OK / Normal / Cause: A user has Informational changed the controller rebuild rate. This alert is for informational purposes. Action: None Related Alert Information SNMP Trap Numbers Clear Alert Number: None 751 Related Alert Number: None LRA Number: None 2143 Controller OK / Normal / Cause: A user has alarm enabled Informational enabled the controller alarm.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2146 Bad block replacement error Warning / Non-critical Cause: A portion of a physical disk is damaged. Clear Alert: None 753 Action: See the Dell OpenManage Server Administrator Storage Management online help for more information. 2147 Bad block sense error Warning / Non-critical Cause: A portion of a physical disk is damaged.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2149 Bad block Warning / extended sense Non-critical error Cause and Action Related Alert Information SNMP Trap Numbers Cause: A portion of a physical disk is damaged. Clear Alert: None 753 Action: See the Dell OpenManage Server Administrator Storage Management online help for more information. 2150 Bad block extended medium error Warning / Non-critical Cause: A portion of a physical disk is damaged.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2153 Service tag changed OK / Normal / Cause: An enclosure Informational service tag was changed. In most circumstances, this service tag should only be changed by Dell support or your service provider. Related Alert Information SNMP Trap Numbers Clear Alert: None 851 Related Alert: None LRA Number: None Action: Ensure that the tag was changed under authorized circumstances.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2157 Controller OK / Normal / Cause: A user has reset configuration Informational the controller has been reset configuration. See the online help for more information. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2159 Virtual disk renamed OK / Normal / Cause: A user has Informational renamed a virtual disk. When renaming a virtual disk on a PERC 3/SC, 3/DCL, 3/DC, 3/QC, 4/SC, 4/DC, 4e/DC, 4/Di, CERC ATA100/4ch, PERC 5/E, PERC 5/i or SAS 5/iR controller, this alert displays the new virtual disk name.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2163 Rebuild completed with errors Cause and Action Related Alert Information SNMP Trap Numbers Critical / Cause: You might be Failure / Error attempting a RAID configuration that is not supported by the controller. Clear Alert: None 904 Cause: Storage Management is unable to determine whether the system has the minimum required versions of the RAID controller drivers. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2165 The RAID Warning / controller Non-critical firmware and driver validation was not performed. The configuration file cannot be opened. Cause and Action Related Alert Information SNMP Trap Numbers Cause: Storage Management is unable to determine whether the system has the minimum required versions of the RAID controller firmware and drivers. This situation may occur for a variety of reasons.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2167 The current Warning / kernel version Non-critical and the non-RAID SCSI driver version are older than the minimum required levels. See readme.txt for a list of validated kernel and driver versions. Cause and Action Related Alert Information SNMP Trap Numbers Cause: The version of the kernel and the driver do not meet the minimum requirements.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2168 The non-RAID Warning / SCSI driver Non-critical version is older than the minimum required level. See readme.txt for the validated driver version. Cause and Action Related Alert Information SNMP Trap Numbers Cause: The version of the driver does not meet the minimum requirements.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2170 The controller OK / Normal / Cause: This alert is for Clear Alert: 1151 battery charge Informational informational purposes. None level is normal. Action: None Related Alert: None LRA Number: None 2171 The controller Warning / battery Non-critical temperature is above normal.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2173 Unsupported configuration detected. The SCSI rates of the enclosure management modules (EMMs) are not the same. EMM0%1 EMM1%2 Warning / Non-critical Cause: The EMMs in the enclosure have a different SCSI rate. This is an unsupported configuration. All EMMs in the enclosure should have the same SCSI rate.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2174 The controller Warning / battery has Non-critical been removed. Cause and Action Related Alert Information SNMP Trap Numbers Cause: The controller cannot communicate with the battery. The battery may be removed, or the contact point between the controller and the battery may be burnt or corroded.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2176 The controller OK / Normal / Cause: This alert is for battery Learn Informational informational purposes. cycle has Action: None started. Related Alert Information SNMP Trap Numbers Clear Alert Number: 2177. 1151 Related Alert: None LRA Number: None 2177 The controller OK / Normal / Cause: This alert is for battery Learn Informational informational purposes. cycle has Action: None completed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2178 The controller Warning / battery Learn Non-critical cycle has timed out. Cause and Action Related Alert Information SNMP Trap Numbers Cause: The controller battery must be fully charged before the Learn cycle can begin. The battery may be unable to maintain a full charge causing the Learn cycle to timeout.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2180 The controller OK / Normal / battery Learn Informational cycle will start in %1 days. Cause and Action Related Alert Information SNMP Trap Numbers Cause: This alert is for informational purposes. The %1 indicates a substitution variable. The text for this substitution variable is displayed with the alert in the alert log and can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2183 Replace member operation failed on physical disk %1 904 Cause: The physical Clear Alert: Critical / Failure / Error disk being replaced has None failed. Related Alert Action: None Number: 2060. Replace member operation cancelled on physical disk OK / Normal / Cause: User cancelled Informational the replace member operation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2186 The controller Warning / cache has been Non-critical discarded. Cause and Action Related Alert Information SNMP Trap Numbers Cause: The controller has flushed the cache and any data in the cache has been lost. This may happen if the system has memory or battery problems that cause the controller to distrust the cache.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2188 The controller OK / Normal / write policy Informational has been changed to Write Through. Cause and Action Related Alert Information SNMP Trap Numbers Cause: The controller battery is unable to maintain cached data for the required period of time. For example, if the required period of time is 24 hours, the battery is unable to maintain cached data for 24 hours.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2190 The controller OK / Normal / has detected a Informational hot-plugged enclosure. Cause and Action Related Alert Information SNMP Trap Numbers Cause: The SAS controller with firmware version 6.1 or later has detected a hot-plugged enclosure. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2192 The virtual disk Check Consistency has made corrections. OK / Normal / Cause: The virtual disk Informational Check Consistency has identified errors and made corrections. For example, the Check Consistency may have encountered a bad disk block and remapped the disk block to restore data consistency. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2195 Dedicated hot OK / Normal / Cause: This alert is for spare assigned. Informational informational purposes. Physical disk None %1 Related Alert Information SNMP Trap Numbers Clear Alert Number: 2196. 1201 Related Alert: None LRA Number: None 2196 Dedicated hot OK / Normal / Cause: This alert is for spare Informational informational purposes. unassigned.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2199 The virtual disk cache policy has changed. OK / Normal / Cause: This alert is for Clear Alert: Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2201 A global hot spare failed. Warning / Non-critical Cause: The controller is not able to communicate with a disk that is assigned as a dedicated hot spare. The disk may have been removed. There may also be a bad or loose cable.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2203 A dedicated hot spare failed. Warning / Non-critical Cause: The controller is unable to communicate with a disk that is assigned as a dedicated hot spare. The disk may have failed or been removed. There may also be a bad or loose cable.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2205 A dedicated hot spare has been automatically unassigned. OK / Normal / Cause: The hot spare is Informational no longer required because the virtual disk it was assigned to has been deleted. None Related Alert Information SNMP Trap Numbers Clear Alert: None 901 Related Alert Number: 2098, 2161, 2196 LRA Number: None 2210 2211 Battery Warning / requires Non-critical reconditioning.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2212 The controller OK / Normal / Cause: This alert is for battery Informational informational purposes. temperature is Action: None above normal.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2217 The battery OK / Normal / Cause: This alert is for learn mode has Informational informational purposes. changed to Action: None warn. Related Alert Information SNMP Trap Numbers Clear Alert: None 1151 Related Alert: None LRA Number: None 2218 2219 144 None of the Controller Property are changed. OK / Normal / Cause: This alert is for Clear Alert: Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description 2220 2221 2222 Related Alert Information SNMP Trap Numbers Allow OK / Normal / Cause: This alert is for Revertible Hot Informational informational purposes. Spare and Action: None Replace Member, Auto Replace Member operation on Predictive Failure, and Load balance changed. Clear Alert: None 751 Auto Replace OK / Normal / Cause: This alert is for Member Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description 2223 2224 2225 Severity Cause and Action Related Alert Information SNMP Trap Numbers Abort Check OK / Normal / Consistency on Informational Error, Allow Revertible Hot Spare and Replace Member, and Load balance changed. Cause: This alert is generated due to user initiated change in controller properties. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description 2227 2228 2229 Severity Cause and Action Related Alert Information SNMP Trap Numbers Abort Check OK / Normal / Consistency on Informational Error, Allow Revertible Hot Spare and Replace Member, and Auto Replace Member Operation on Predictive Failure changed. Cause: This alert is generated due to user initiated change in controller properties. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2230 Auto Replace Member operation on Predictive Failure changed. OK / Normal / Cause: This alert is for Clear Alert: Informational informational purposes. None 2231 2232 Cause and Action Action: None Related Alert Information SNMP Trap Numbers 751 Related Alert: None LRA Number: None Allow OK / Normal / Cause: This alert is for Revertible Hot Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2235 The Check Consistency rate has changed. OK / Normal / Cause: This alert is for Clear Alert: Informational informational purposes. None Action: None Related Alert Information SNMP Trap Numbers 751 Related Alert: None LRA Number: None 2236 2237 Allow OK / Normal / Cause: This alert is for Revertible Hot Informational informational purposes. Spare and Action: None Replace Member property changed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2240 A foreign configuration has been imported. OK / Normal / Cause: The user has Informational attempted to import a foreign configuration. This alert is for informational purposes. Action: None 2241 The Patrol OK / Normal / Cause: The controller Read mode has Informational has changed the patrol changed. read mode. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2244 A virtual disk OK / Normal / Cause: This alert is for Clear Alert: 1201 blink has been Informational informational purposes. None initiated. Action: None Related Alert: None LRA Number: None 2245 A virtual disk blink has ceased. OK / Normal / Cause: This alert is for Clear Alert: Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2247 1151 The controller OK / Normal / Cause: This alert is for Clear Alert battery is Informational informational purposes. Number: 2358. charging. Action: None Related Alert: None LRA Number: None 2248 The controller OK / Normal / Cause: This alert is for battery is Informational informational purposes. executing a Action: None Learn cycle.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2251 The physical disk blink has initiated. OK / Normal / Cause: This alert is for Clear Alert: Informational informational purposes. None Action: None Related Alert Information SNMP Trap Numbers 901 Related Alert: None LRA Number: None 2252 The physical disk blink has ceased. OK / Normal / Cause: This alert is for Clear Alert: Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2255 The physical disk has been started. OK / Normal / Cause: This alert is for Clear Alert: Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2260 An enclosure blink has ceased OK / Normal / Cause: This alert is for Clear Alert: Informational informational purposes. None None Related Alert Information SNMP Trap Numbers 851 Related Alert: None LRA Number: None 2261 A global rescan OK / Normal / Cause: This alert is for Clear Alert: has initiated. Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information 2264 A device is missing. Warning / Non-critical Cause: The controller cannot communicate with a device. The device may be removed. There may also be a bad or loose cable. Clear Alert: None Action: Check if the device is in and not removed. If it is in, check the cables. You should also check the connection to the controller battery and the battery health.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information 2265 A device is in an unknown state. Warning / Non-critical Cause: The controller cannot communicate with a device. The state of the device cannot be determined. There may be a bad or loose cable. The system may also be experiencing problems with the application programming interface (API). There could also be a problem with the driver or firmware.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2266 Controller log OK / Normal / Cause: The %1 file entry: %1 Informational indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text can vary depending on the situation. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description 2268 %1, Storage Cause: Storage Critical / Management Failure / Error Management has lost has lost communication with a communicatio controller. This may n with the conoccur if the controller troller. An driver or firmware is immediate experiencing a problem. reboot is The %1 indicates a strongly substitution variable. recommended The text for this to avoid substitution variable is further displayed with the alert problems.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2270 The physical disk Clear operation failed. Critical / Cause: A Clear task was Failure / Error being performed on a physical disk but the task was interrupted and did not complete successfully. The controller may have lost communication with the disk. The disk may have been removed or the cables may be loose or defective.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2272 Patrol Read found an uncorrectable media error. Critical / Cause: The Patrol Read Failure / Error task has encountered an error that cannot be corrected. There may be a bad disk block that cannot be remapped. Action: Back up your data. If you are able to back up the data successfully, then fully initialize the disk and then restore from back up.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2273 A block on the Critical / Cause: The controller physical disk Failure / Error encountered an has been unrecoverable medium punctured by error when attempting the controller. to read a block on the physical disk and marked that block as invalid.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2276 The dedicated Warning / hot spare is too Non-critical small. Cause and Action Related Alert Information SNMP Trap Numbers Cause: The dedicated hot spare is not large enough to protect all virtual disks that reside on the disk group. Clear Alert: None 903 Action: Assign a larger disk as the dedicated hot spare. 2277 The global hot Warning / spare is too Non-critical small.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2278 The controller OK / Normal / battery charge Informational level is below a normal threshold. Cause and Action Related Alert Information SNMP Trap Numbers Cause: The battery is discharging. A battery discharge is a normal activity during the battery Learn cycle. Before completing, the battery Learn cycle recharges the battery. You should receive alert 2179 when the recharge occurs.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2280 A disk media OK / Normal / Cause: A disk media error has been Informational error was detected corrected. while the controller was completing a background task. A bad disk block was identified. The disk block has been remapped. Related Alert Information SNMP Trap Numbers Clear Alert: None 1201 Related Alert: None LRA Number: None Action: Consider replacing the disk.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2282 Hot spare SMART polling failed. Critical / Cause: The controller Failure / Error firmware attempted a SMART polling on the hot spare but was unable to complete it. The controller has lost communication with the hot spare. Related Alert Information SNMP Trap Numbers Clear Alert: None 904 Related Alert: None LRA Number: 2071 Action: Check the health of the disk assigned as a hot spare.
Table 3-4. Storage Management Messages (continued) Event ID Description 2286 A Learn cycle OK / Normal / Cause: This alert is for start is pending Informational informational purposes. while the Action: None battery charges. 2287 Protection policy for %1 has changed. Severity Cause and Action Related Alert Information SNMP Trap Numbers Clear Alert: None 1151 Related Alert: None LRA Number: None OK / Normal / Cause: This alert is for Clear Alert: Informational informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2289 Multi-bit ECC Critical / Cause: An error error. Failure / Error involving multiple bits has been encountered during a read or write operation. The error correction algorithm recalculates parity data during read and write operations. If an error involves only a single bit, it may be possible for the error correction algorithm to correct the error and maintain parity data.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2290 Single-bit ECC error. Warning / Non-critical Cause: An error involving a single bit has been encountered during a read or write operation. The error correction algorithm has corrected this error. Clear Alert: None 753 Related Alert: None LRA Number: 2060 Action: None 2291 An EMM has been discovered.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2293 The EMM has Critical / Cause: The failure may failed. Failure / Error be caused by a loss of power to the EMM. The EMM self test may also have identified a failure. There could also be a firmware problem or a multi-bit error. Related Alert Information SNMP Trap Numbers Clear Alert: None 854 and 954 Related Alert: None LRA Number: 2091 Action: Replace the EMM.
Table 3-4. Storage Management Messages (continued) Event ID Description 2297 An EMM has Critical / Cause: An EMM has been removed. Failure / Error been removed. 2298 Severity There is a bad Warning / sensor on an Non-critical enclosure. Cause and Action Related Alert Information SNMP Trap Numbers Clear Alert: None 954 Action: Reinsert the EMM. See the hardware documentatio n for information on replacing the EMM. Related Alert: None Cause: The enclosure has a bad sensor.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2299 Bad PHY %1 Critical / Cause: There is a Failure / Error problem with a physical connection or PHY. The %1 indicates a substitution variable. The text for this substitution variable is displayed with the alert in the alert log and can vary depending on the situation. Action: Contact Dell technical support.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2300 The enclosure Critical / Cause: The controller is is unstable. Failure / Error not receiving a consistent response from the enclosure. There could be a firmware problem or an invalid cabling configuration. If the cables are too long, they degrade the signal.
Table 3-4. Storage Management Messages (continued) Event ID Description 2301 Severity Cause and Action Related Alert Information SNMP Trap Numbers The enclosure Critical / Cause: The enclosure or has a hardware Failure / Error an enclosure error. component is in a Failed or Degraded state. Clear Alert: None 854 The enclosure Critical / Cause: The enclosure or is not Failure / Error an enclosure responding. component is in a Failed or Degraded state.
Table 3-4. Storage Management Messages (continued) Event ID Description 2304 An attempt to OK / Normal / Cause: This alert is for hot plug an Informational informational purposes. EMM has been Action: None detected. This type of hot plug is not supported. 2305 The physical disk is too small to be used for a rebuild. Severity Warning / Non-critical Cause and Action Cause: The physical disk is too small to rebuild the data.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2306 Bad block table Warning / is 80% full. Non-critical Cause and Action Related Alert Information SNMP Trap Numbers Cause: The bad block table is used for remapping bad disk blocks. This table fills, as bad disk blocks are remapped. When the table is full, bad disk blocks can no longer be remapped, and disk errors can no longer be corrected. At this point, data loss can occur. The bad block table is now 80% full.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2307 Bad block table Critical / Cause: The bad block is full. Unable Failure / Error table is used for to log block %1 remapping bad disk blocks. This table fills, as bad disk blocks are remapped. When the table is full, bad disk blocks can no longer be remapped and disk errors can no longer be corrected. At this point, data loss can occur. The %1 indicates a substitution variable.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2309 A physical disk Warning / is Non-critical incompatible. Cause and Action Related Alert Information SNMP Trap Numbers Cause: You have attempted to replace a disk with another disk that is using an incompatible technology. For example, you may have replaced one side of a mirror with a SAS disk when the other side of the mirror is using SATA technology.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2311 The firmware Warning / on the EMMs Non-critical is not the same version. EMM0 %1 EMM1 %2 Cause and Action Related Alert Information SNMP Trap Numbers Cause: The firmware on the EMM modules is not the same version. It is required that both modules have the same version of the firmware. This alert may be caused if you attempt to insert an EMM module that has a different firmware version than an existing module.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2313 A power supply Warning / in the Non-critical enclosure has a DC failure. Cause and Action Related Alert Information Cause: The power Clear Alert supply has a DC failure. Number: 2323. Action: Replace the power supply. SNMP Trap Numbers 1003 Related Alert Number: 2122, 2322.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2315 Diagnostic message %1 OK / Normal / Cause: The %1 Informational indicates a substitution variable. The text for this substitution variable is generated by the utility that ran the diagnostics and is displayed with the alert in the alert log. This text can vary depending on the situation. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description 2318 2319 182 Severity Cause and Action Related Alert Information SNMP Trap Numbers Problems with Warning / the battery or Non-critical the battery charger have been detected. The battery health is poor. Cause: The battery or the battery charger is not functioning properly. Clear Alert: None 1154 Action: Replace the battery pack. LRA Number: 2100 Single-bit ECC error. The DIMM is degrading.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2320 Single-bit ECC error. The DIMM is critically degraded. Single-bit ECC error. The DIMM is critically degraded. There will be no further reporting. 2321 Cause and Action Related Alert Information SNMP Trap Numbers Cause: The DIMM is Critical / Failure / Error malfunctioning. Data loss or data corruption may be imminent. Clear Alert: None 754 Cause: The DIMM is Critical / Failure / Error malfunctioning.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2322 The DC power Critical / Cause: The power supply is Failure / Error supply unit is switched switched off. off. Either a user switched off the power supply unit or it is defective. Related Alert Information SNMP Trap Numbers Clear Alert Number: 2323. 1004 Related Alert: None LRA Number: Action: Check if the 2091 power switch is turned off. If it is turned off, turn it on.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2324 The AC power Critical / Cause: The power cable supply cable Failure / Error may be pulled out has been or removed. The power removed. cable may also have overheated and become warped and nonfunctional. Action: Replace the power cable. 2325 The power supply cable has been inserted. Related Alert Information SNMP Trap Numbers Clear Alert Number: 2325.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2326 A foreign configuration has been detected. OK / Normal / Cause: This alert is for Informational informational purposes. The controller has physical disks that were moved from another controller. These physical disks contain virtual disks that were created on the other controller.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2327 The NVRAM has corrupted data. The controller is reinitializing the NVRAM. Warning / Non-critical Cause: The nonvolatile random access memory (NVRAM) is corrupt. This may occur after a power surge, a battery failure, or for other reasons. The controller is reinitializing the NVRAM.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2329 SAS port report: %1 Warning / Non-critical Cause: The text for this alert is generated by the controller and can vary depending on the situation. The %1 indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2330 SAS port report: %1 OK / Normal / Cause: The %1 Informational indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text can vary depending on the situation. This alert is for informational purposes.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2332 A controller OK / Normal / Cause: This alert is for Clear Alert: 751 hot plug has Informational informational purposes. None been detected. Action: None Related Alert: None LRA Number: None 2334 Controller event log: %1 OK / Normal / Cause: The %1 Informational indicates a substitution variable.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2335 Controller event log: %1 Warning / Non-critical Cause: The %1 indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text is from events in the controller event log that were generated while Storage Management was not running.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2336 Controller event log: %1 Critical / Cause: The %1 Failure / Error indicates a substitution variable. The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log. This text is from events in the controller event log that were generated while Storage Management was not running. This text can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2337 The controller is Critical / Cause: The controller unable to Failure / Error was unable to recover recover cached data from the cache. data from the This may occur when battery backup the system is without unit (BBU). power for an extended period of time when the battery is discharged.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2340 The BGI com- Critical / Cause: The BGI task pleted with Failure / Error encountered errors that uncorrectable cannot be corrected. errors. The virtual disk contains physical disks that have unusable disk space or disk errors that cannot be corrected.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2342 The Check Consistency found inconsistent parity data. Data redundancy may be lost. Warning / Non-critical Cause: The data on a source disk and the redundant data on a target disk is inconsistent. Clear Alert: None 1203 The Check Consistency logging of inconsistent parity data is disabled.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2346 Error occurred: Warning / %1 Non-critical Cause and Action Related Alert Information SNMP Trap Numbers Cause: A physical device may have an error. The %1 indicates a substitution variable. The text for this substitution variable is generated by the firmware and is displayed with the alert in the alert log. This text can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2347 The rebuild Hardware RAID: Critical / failed due to Failure / Error Cause: You are errors on the attempting to rebuild source physical data that resides on a disk. defective disk. Action: Replace the source disk and restore from backup.
Table 3-4. Storage Management Messages (continued) Event ID Description 2348 2349 Related Alert Information SNMP Trap Numbers The rebuild Cause: You are Critical / failed due to Failure / Error attempting to rebuild errors on the data on a disk that is target physical defective. disk. Action: Replace the target disk. If a rebuild does not automatically start after replacing the disk, initiate the Rebuild task. You may need to assign the new disk as a hot spare to initiate the rebuild.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2351 901 A physical disk OK / Normal / Cause: This alert is for Clear Alert is marked as Informational informational purposes. Number: 2352. missing. Action: None. Related Alert: None LRA Number: None 2352 A physical disk OK / Normal / Cause: This alert is for that was Informational informational purposes. marked as Action: None. missing has been replaced.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2356 SAS SMP Critical / Cause: The %1 communicatio Failure / Error indicates a substitution ns error %1 variable. The text for this substitution variable is generated by the firmware and is displayed with the alert in the alert log. This text can vary depending on the situation. The reference to SMP in this text refers to SAS Management Protocol. Action: There may be a SAS topology error.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2357 SAS expander error: %1 Critical / Cause: The %1 Failure / Error indicates a substitution variable. The text for this substitution variable is generated by the firmware and is displayed with the alert in the alert log. This text can vary depending on the situation.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2359 The physical disk is not certified. Warning / Non-critical Cause: The physical disk does not comply with the standards set by Dell and is not supported. Clear Alert: None 903 Action: Replace the physical disk with a physical disk that is supported.
Table 3-4. Storage Management Messages (continued) Event ID Description 2362 2364 2366 Severity Cause and Action Related Alert Information SNMP Trap Numbers Physical OK / Normal / Cause: This alert is for disk(s) have Informational informational purposes. been removed Action: None. from a virtual disk. The virtual disk will be in Failed state during the next system reboot.
Table 3-4. Storage Management Messages (continued) Event ID Description 2367 Rebuild is not Warning / possible Non-critical because mixing of different media type (SSD/HDD) and bus protocols (SATA/SAS) is not supported on the same virtual disk. 2368 204 Severity Cause and Action Related Alert Information SNMP Trap Numbers Cause: The physical disk is using an incompatible technology.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2369 Virtual Disk Redundancy has been degraded. OK / Normal / Cause: A physical disk Informational in a RAID 6 virtual disk has either failed or been removed. Action: Replace the missing or failed physical disk. Related Alert Information SNMP Trap Numbers Clear Alert Number: 2121.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2373 Attempted OK / Normal / import of Informational unsupported Virtual Disk type RAID %1 Cause and Action Related Alert Information SNMP Trap Numbers Cause: This alert is provided for informational purposes. User is attempting to import a foreign virtual disk with unsupported RAID level on the controller. Clear Alert: None 751 Related Alert: None LRA Number: None Action: None.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2377 Attempted import of an orphan drive OK / Normal / Cause: User is Informational attempting to import an orphan drive. This alert is provided for informational purposes. Action: None. 2378 Attempted import of an incompatible physical drive OK / Normal / Cause: User is Informational attempting to import an incompatible physical drive. This alert is provided for informational purposes. Action: None.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2381 Controller preserved cache is recovered. OK / Normal / Cause: This alert is Clear Alert: 751 None Informational provided for informational purposes. Related Alert: Action: None None LRA Number: None 2382 An unWarning / supported Non-critical configuration was detected.
Table 3-4. Storage Management Messages (continued) Event ID Description 2384 The Warning Warning / level set for the Non-critical hot spare protection policy is violated for the Virtual Disk. 2385 2386 Severity Cause and Action Related Alert Information SNMP Trap Numbers Cause: The number of physical disks you specified for the hot spare protection policy is violated.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2387 A virtual disk Critical / Cause: Virtual disk bad bad block error Failure / Error blocks are due to is detected. presence of unrecoverable bad blocks on one or more member physical disks. Action: 1 Perform a backup of the virtual disk with the Verify option selected. One of the following can occur: • Backup operation fails. In this case, restore the file from a previous backup.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2387 contd. 2388 Cause and Action Related Alert Information SNMP Trap Numbers Clear Alert: None 751 2 To clear these bad blocks, execute the Clear Virtual Disk Bad Blocks task. 3 Run Patrol Read to ensure no new bad blocks are found. The Controller OK / Normal / Encryption Informational Key is destroyed. Cause: The Controller Encryption Key is destroyed. Action: None.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2393 The virtual disk is encrypted. OK / Normal / Cause: The Encrypted Informational virtual disk operation on normal virtual disk (created using Selfencrypting disks only) is successful. Action: None 2394 Persistent Hot OK / Normal / Cause: The Persistent Spare is Informational Hot Spare option is enabled. enabled.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2397 The Check Consistency completed with uncorrectable errors Cause: The Check Critical / Failure / Error Consistency task detected uncorrectable multiple errors. Clear Alert: None 1204 2398 The Manage Physical Disk Power property(s) changed OK / Normal / Cause: The Manage Clear Alert: 901 None Informational Physical Disk Power properties are changed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity 2400 Physical disk Warning / configuration Non-critical data updated as it was stale. Cause and Action Related Alert Information SNMP Trap Numbers Cause: The physical disk configuration data is updated because it was outdated. Clear Alert: None 903 Action: None LRA Number: None Related Alert: None 2401 Configuration Failure / Error command could not be committed to disk. Configuration has to be re applied.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2404 Virtual Disk is OK / Normal / Cause: The operating not available Informational system does not detect the newly created virtual disk. Action: Wait for some time. 2405 Command timeout on physical disk Informational Cause: The spundown physical disks take more time than the timeout period and the configuration commands are timed out.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2409 Controller Informational Cause: Using Manage DKM Encryption Key Encryption key operations, encryption is changed key is changed. Action: None 2410 2411 Controller Encryption mode is changed to LKM Informational Cause: Encryption mode is changed to LKM. Action: None Controller Informational Cause: Using Manage LKM Encryption Key Encryption key operations, encryption is changed key is changed.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2414 Controller CacheCade is deleted Informational Cause: This alert is Clear Alert: 1201 None provided for informational purposes. Related Alert: None Action: None LRA Number: None 2415 Controller battery is discharging Informational Cause: The battery learn cycle has started.
Table 3-4. Storage Management Messages (continued) Event ID Description Severity Cause and Action 2417 There is an unrecoverable medium error detected on virtual disk Cause: Unrecoverable Critical / Failure / Error medium error found on one or more member physical disks of a virtual disk. • If the consistency check is successful, no further action is required. • If the consistency check finds and unrecoverable medium error, it means that the medium error is located in non-user data.
Table 3-4. Storage Management Messages (continued) Event ID 2417 cntd. 2418 Description Severity Cause and Action Related Alert Information SNMP Trap Numbers NOTE: If the unrecoverable medium error has not been corrected, it may be reported again by the system. This error can be fixed by writing data on the affected area or deleting and recreating the Virtual Disk as demonstrated in the following procedure. 1 Back up the data. 2 Delete the Virtual Disk.
Storage Management Message Reference
System Event Log Messages for IPMI Systems 4 The tables in this chapter list the system event log (SEL) messages, their severity, and cause. NOTE: For corrective actions, see the appropriate documentation. Temperature Sensor Events The temperature sensor event messages help protect critical components by alerting the systems management console when the temperature rises inside the chassis.
Table 4-1. Temperature Sensor Events (continued) Event Message Severity Warning temperature sensor returned to warning state . temperature sensor returned to normal state . Cause Temperature of the backplane board, system board, or the carrier in the specified system returned from critical state to non-critical state.
Table 4-2. Voltage Sensor Events (continued) Event Message Severity Cause voltage sensor detected a warning . Warning Voltage of the monitored entity exceeded the warning threshold. voltage sensor returned to normal . Information The voltage of a previously reported is returned to normal state. Fan Sensor Events The cooling device sensors monitor how well a fan is functioning.
Table 4-3. Fan Sensor Events (continued) Event Message Severity Warning Fan sensor detected a warning . Cause The speed of the specified fan may not be sufficient to provide enough cooling to the system. Information The fan specified by may have failed and hence, the redundancy has been degraded. redundancy degraded.
Processor Status Events The processor status messages monitor the functionality of the processors in a system. These messages provide processor health and warning information of a system. Table 4-4. Processor Status Events Event Message Severity Cause status Critical processor sensor IERR, where is the processor that generated the event. For example, PROC for a single processor system and PROC # for multiprocessor system.
Table 4-4. Processor Status Events (continued) Event Message Severity Cause thermal tripped was deasserted. Information This event is generated when the processor has recovered from an earlier thermal condition. configuration error was asserted. Critical configuration error was deasserted. Information This event is generated when the earlier processor configuration error was corrected. throttled was asserted.
Table 4-5. Power Supply Events (continued) Event Message Severity Cause PS Redundancy sensor redundancy degraded. Information Power supply redundancy is degraded if one of the power supply sources is removed or failed. PS Redundancy sensor redundancy lost. Critical PS Redundancy sensor redundancy regained. Information This event is generated if the power supply has been reconnected or replaced.
Table 4-5. Power Supply Events (continued) Event Message Severity Cause PS 1 Status: Power supply Information This event is generated when the sensor for PS 1, failure power supply has recovered from was deasserted an earlier failure event. PS 1 Status: Power supply Warning sensor for PS 1, predictive failure was asserted This event is generated when the power supply is about to fail.
Memory ECC Events The memory ECC event messages monitor the memory modules in a system. These messages monitor the ECC memory correction rate and the type of memory events that occurred. Table 4-6. Memory ECC Events Event Message Severity Cause ECC error correction detected on Bank # DIMM [A/B]. Information This event is generated when there is a memory error correction on a particular Dual Inline Memory Module (DIMM). ECC uncorrectable error detected on Bank # [DIMM].
Table 4-7. BMC Watchdog Events (continued) Event Message Severity Cause BMC OS Watchdog Critical performed system power off. This event is generated when the BMC watchdog detects that the system has crashed (timer expired because no response was received from Host) and the action is set to power off. Critical This event is generated when the BMC watchdog detects that the system has crashed (timer expired because no response was received from Host) and the action is set to power cycle.
Table 4-8. Memory Events (continued) Event Message Severity Cause Memory Mirrored redundancy lost. Critical This event is generated when redundancy is lost in a mirrored memory configuration. Memory Mirrored redundancy regained. Information This event is generated when the redundancy lost or degraded earlier is regained in a mirrored memory configuration. Memory Spared redundancy degraded. Warning This event is generated when there is a memory failure in a spared memory configuration.
Drive Events The drive event messages monitor the health of the drives in a system. These events are generated when there is a fault in the drives indicated. Table 4-10. Drive Events Event Message Severity Cause Drive asserted fault state. Critical This event is generated when the specified drive in the array is faulty. Drive de-asserted fault state. Information This event is generated when the specified drive recovers from a faulty condition.
Table 4-10. Drive Events (continued) Event Message Severity Cause Drive Critical This event is generated when the drive is placed in critical array. in critical array was asserted in critical array was deasserted Informational This event is generated when the drive is removed from critical array. Drive Critical Drive in failed array was asserted This event is generated when the drive is placed in the fail array.
Table 4-11. Intrusion Events (continued) Event Message Severity Cause sensor intrusion was asserted while system was ON This event is generated when the intrusion sensor detects an intrusion while the system is on. sensor intrusion was asserted while system was OFF This event is generated when the intrusion sensor detects an intrusion while the system is off.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause POST Err Critical This event is generated when an error occurs during system boot. See the system documentation for more information on the error code. POST fatal error # Critical or This event is generated when a fatal error occurs during system boot. See Table 4-13 for more information. Memory Spared Critical This event is generated when memory spare is no longer redundant.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause Information This event is generated when memory is removed from the (BANK# DIMM#) presence was system. asserted Memory Removed Memory Cfg Err Critical configuration error (BANK# DIMM#) was asserted This event is generated when memory configuration is incorrect for the system. redundancy regained Information This event is generated when memory redundancy is regained.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Hdwr version err hardware Critical incompatibility (BMC/iDRAC Firmware and CPU mismatch) was asserted Cause This event is generated when there is a mismatch between the BMC and iDRAC firmware and the processor in use or vice versa.
Table 4-12. BIOS Generated System Events (continued) Event Message Severity Cause LinkT/FlexAddr: Link Tuning sensor, device option ROM failed to support link tuning or flex address (Mezz XX) was asserted Critical This event is generated when the PCI device option ROM for a NIC does not support link tuning or the Flex addressing feature. LinkT/FlexAddr: Link Tuning sensor, failed to program virtual MAC address () was asserted.
POST Code Table Table 4-13 lists the POST Code errors that are generated when a fatal error occurs during system boot. Table 4-13. POST Code Errors Fatal Error Description Code Cause 80 No memory detected This error code implies that no memory is installed. 81 Memory detected but is not configurable This error code indicates memory configuration error that could be a result of bad memory, mismatched memory or bad socket. 82 Memory configured but not usable.
Table 4-13. POST Code Errors (continued) Fatal Error Description Code Cause C0 Shutdown test failure This error code indicates a shutdown test failure. C1 POST Memory test failure This error code indicates bad memory detection. C2 RAC configuration failure Check screen for the actual error message C3 CPU configuration failure Check screen for the actual error message C4 Incorrect memory configuration Memory population order not correct.
Cable Interconnect Events The cable interconnect messages in Table 4-15 are used for detecting errors in the hardware cabling. Table 4-15. Cable Interconnect Events Description Severity Cause Cable sensor Critical This event is generated when the cable is not connected or is incorrectly connected. Information This event is generated when the earlier cable connection error was corrected. Configuration error was asserted. Cable sensor Connection was asserted.
Power And Performance Events The power and performance events are used to detect degradation in system performance with change in power supply. Table 4-17. Power And Performance Events Description Severity Cause System Board Power Normal Optimized: Performance status sensor for System Board, degraded, was deasserted This event is generated when system performance was restored.
Table 4-17. Description Power And Performance Events Severity Cause System Board Power Warning Optimized: Performance status sensor for System Board, degraded, user defined power capacity was asserted This event is generated when a change in power supply degrades system performance. System Board Power Normal Optimized: Performance status sensor for System Board, degraded, user defined power capacity was deasserted This event is generated when the system performance is restored.
Entity Presence Events The entity presence messages are used for detecting different hardware devices. Table 4-18. Entity Presence Events Description Severity Cause Information This event is generated when the device was detected. Critical This event is generated when the device was not detected.
Table 4-19. Miscellaneous Events Mezz C Critical Status: Add-in Card sensor for Mezz C, install error was asserted This event is generated when an incorrect Mezzanine card is installed for I/O fabric. Hdwar version err: Critical Version Change sensor, hardware incompatibility was asserted This event is generated when an incompatible hardware is detected.
Table 4-19. Miscellaneous Events LinkT/FlexAddr: Critical Link Tuning sensor, failed to program virtual MAC address (Bus # Device # Function #) was asserted This event is generated when Flex address can be programmed for this device. LinkT/FlexAddr: Critical Link Tuning sensor, device option ROM failed to support link tuning or flex address (Mezz ) was asserted This event is generated when ROM does not support Flex address or link tuning.
Index A C AC power cord messages, 49 cable interconnect messages, 241 AC power cord sensor, 9 AC power cord sensor has failed, 232 Change write policy, 103 chassis intrusion messages, 35 Asset name changed, 119 Chassis intrusion sensor, 226 Asset tag changed, 119 chassis intrusion sensor, 9 Communication regained, 122 B Background initialization, 111 Bad block extended medium error, 119 Bad block extended sense error, 119 Communication timeout, 114 Controller event log %1, 190-192 Controller rebu
E Hot spare SMART polling, 166 Enclosure alarm, 115 Enclosure firmware mismatch, 103 entity presence messages, 242 Error occurred %1, 196 event description reference, 14 I Intrusion Events, 233 intrusion messages, 233 L Log monitoring, 233 F fan enclosure messages, 47 fan enclosure sensor, 9 fan sensor, 9 Fan Sensor Events, 223 Fan sensor has failed, 222 fan sensor messages, 223 Firmware version mismatch, 112 G Global hot spare, 91 H hardware log sensor, 9 Hardware Log Sensor Events, 231 hardware log
fan sensor, 223 hardware log sensor, 231 intrusion, 233 memory device, 46 memory ECC, 229 memory modules, 230 pluggable device, 55, 234 power supply, 42, 226 processor sensor, 52 processor status, 225 r2 generated system, 239 redundancy unit, 38 Server Administrator General, 19 storage management, 71 temperature sensor, 22, 221 voltage sensor, 29, 222 Multi-bit ECC error.
temperature, 9 voltage, 9 viewing events in Windows operating systems, 12 Service tag changed, 120 Virtual disk initialization, 113 Single-bit ECC error limit, 134 Virtual disk renamed, 122 Single-bit ECC error.