Dell Server PRO Management Pack 3.
Notes, Cautions, and Warnings NOTE: A NOTE indicates important information that helps you make better use of your computer. CAUTION: A CAUTION indicates either potential damage to hardware or loss of data and tells you how to avoid the problem. WARNING: A WARNING indicates a potential for property damage, personal injury, or death. © 2013 Dell Inc.
Contents Notes, Cautions, and Warnings...................................................................................................2 1 Introduction..................................................................................................................................5 What's New in This Release.....................................................................................................................................5 Overview..........................................................
Introduction 1 This document is intended for system administrators who use the Dell Server PRO Management Pack (Dell PRO Pack) to monitor Dell systems and take remedial action when an inefficient system is identified. The Dell PRO Pack 3.
Overview Operations Manager uses PRO-enabled Management Pack to collect and store information on Dell hardware along with a description of their health status. Dell PRO Pack works with Operations Manager and VMM 2012 to manage Dell physical devices and their hosted virtual machines (VMs) using this available health information.
– Restrict and migrate: In this mode, it is recommended that all running VMs are migrated from an unhealthy server to a healthy server to prevent loss of service. For more information, see Implementing Recovery Actions. Understanding PRO Tip Management This section explains a typical Dell PRO Pack setup and the sequence of events involved in PRO tip management. Figure 1.
For more information on the type of events and the associated remedial actions, see Alerts and Recovery Actions. Supported Operating Systems The Dell PRO Pack supported operating systems on the managed system and management station are as follows: Managed system: The managed system for PRO Pack is a Virtual Machine Manager Server. For more information, see technet.microsoft.com/en-us/library/gg610649.aspx.
Using Dell Performance Resource Optimization Pack 2 This chapter suggests steps to use PRO Pack. Planning The Environment For PRO Tips You can plan for enabling the PRO Monitors that are relevant for the environment. By default, all the PRO Monitors are disabled in the Dell PRO Pack. For the list of alerts and the recovery actions, see Alerts and Recovery Actions. Select the alerts that you want to enable.
Alternatively, if you select the Show this window when new PRO Tips are created option in the PRO Tip window, the window automatically opens on the VMM console when a PRO Tip is generated. The PRO Tip window displays information such as source, tip, and state of the PRO Tip in a tabular format. The window also displays description of the problem that triggered the alert, the cause, and the suggested remedial action for recovery.
PRO Tip implementation of moving VMs can fail if no other healthy hosts are available in the host group or host cluster. In such a case, the PRO Tip window displays the state of the corresponding PRO Tip as Failed, and the reason is elaborated in the Error section. The status of the corresponding entry in the Jobs section on the VMM console is also display as Failed. NOTE: In the PRO Tip window the failure message is updated dynamically.
Alerts View Displays Dell PRO specific alerts in a tabular format with information on the severity level, source, name, resolution state, and, date and time of creation. To access the Alert View: 1. Launch the Operations Manager console. 2. Select the Monitoring tab. 3. From Dell Server PRO Pack, select Dell Server PRO Alerts. The alerts are displayed on the right-side of the screen, as shown in the following figure. State View Displays the discovered Dell system objects in a tabular format.
To manually reset the alert: 1. On the Actions menu, click Health Explorer . 2. Right-click the alert that you want to close. 3. Select Reset Health. The alert disappears from the PRO Tip window. Overriding Recovery Actions PRO Pack 3.0 supports two recovery actions. The following flag values trigger the respective recovery action: • 1: For migration • 2: For placing the server in restricted mode You can override the default recovery action by changing the default recovery action flag value.
Figure 2. Override Properties Alerts and Recovery Actions The following table lists the alerts and the recommended remedial actions: Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause 1053 Temperature sensor detected a warning value Warning A temperature sensor Restrict on the backplane board, system board, CPU, or drive carrier in the specified system exceeded its warning threshold value.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause 1204 Current sensor detected a failure value Error A current sensor in the Restrict and Migrate specified system exceeded its failure threshold value. 1305 Redundancy degraded Warning A power supply sensor Restrict reading in the specified system exceeded a warning threshold. 1306 Redundancy lost A power supply has been disconnected or has failed.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action controller while performing a task such as a rescan or a check consistency. 2056 Virtual Disk Failed Critical One or more physical Restrict and Migrate disks included in the virtual disk have failed. 2057 Virtual Disk Degraded Warning Warning This alert message occurs when a physical disk included in a redundant virtual disk fails.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action 2103 Temperature dropped below the Minimum Failure Threshold Critical The physical disk enclosure is too cool. Restrict and Migrate 2112 Enclosure shutdown Critical The physical disk enclosure is either hotter or cooler than the maximum or minimum allowable temperature range.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause 2174 The controller battey has been removed Warning The controller cannot Restrict and Migrate communicate with the battery. The battery may be removed or the contact point maye degraded 2178 The controller battery Learn cycle has timed out Warning The controller battery Restrict must be fully charged before the Learn cycle can begin.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action 2246 The controller battery is degraded Warning The temperature of the Restrict the battery is high. This maybe due to the battery being charged. 2264 A device is missing Warning The controller cannot communicate with a device. The device may be removed. Restrict 2265 A device is in an unknown state Warning The controller cannot communicate with a device.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action connected to the same enclosure. 2289 Multi-bit ECC error on controller DIMM 2290 An error involving multiple bits has been encountered during a read or write operation. Restrict and Migrate Single-bit ECC error on Warning controller DIMM An error involving a single bit has been encountered during a read or write operation.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM 2306 Severity Alert Cause Dell PRO Tip Recommended Remedial Action Bad block table is 80% Warning full The bad block table is the table used for remapping bad disk blocks. This table fills as bad disk blocks are remapped. Restrict 2307 Bad block table is full. The bad block table is the table used for remapping bad disk blocks.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM 2321 Single-bit ECC error. Critical The controller DIMM is nonfunctional. There will be no further reporting The dual in-line Restrict and Migrate memory module (DIMM) is malfunctioning. Data loss or data corruption is eminent. 2322 The DC power supply is switched off Critical The power supply unit Restrict and Migrate is switched off. Either a user switched off the power supply unit or it is defective.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action that cannot be corrected. 2342 The Check Warning Consistency found inconsistent parity data. Data redundancy may be lost The data on a source Restrict and Migrate disk and the redundant data on a target disk is inconsistent.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action 2397 The Check Consistency completed with uncorrectable errors Critical Medium errors in the physical drives. Restrict and Migrate 2416 Disk medium error detected Warning Disk medium error detected Restrict 2417 There is an Critical unrecoverable medium error detected on virtual disk Unrecoverable Restrict and Migrate medium error detected on virtual disk.
Related Documentation and Resources 3 This chapter gives the details of documents and resources to help you work with the Pro Pack 3.0. Security Considerations Operations Console access privileges are handled internally by Operations Manager. You can setup this using the User Roles option under Administration Security feature on the Operations Manager console. The profile of the role assigned to you determines what actions you can perform and which objects you are able to manage.