Dell Server PRO Management Pack 3.
Notes, Cautions, and Warnings NOTE: A NOTE indicates important information that helps you make better use of your computer. CAUTION: A CAUTION indicates either potential damage to hardware or loss of data and tells you how to avoid the problem. WARNING: A WARNING indicates a potential for property damage, personal injury, or death. © 2012 Dell Inc.
Contents Notes, Cautions, and Warnings...................................................................................................2 1 Introduction..................................................................................................................................5 What's New in This Release.....................................................................................................................................5 Overview..........................................................
Introduction 1 This document is intended for system administrators who use the Dell Server PRO Management Pack (Dell PRO Pack) to monitor Dell systems and take remedial action when an inefficient system is identified. The Dell PRO Pack 3.
Features and Functionalities Understanding PRO Tip Management Alerts and Recovery Actions Related Terms • • A managed system is a Dell system running the Dell OpenManage Server Administrator (OMSA), which is monitored and managed using Operations Manager and VMM. It is managed locally or remotely using supported tools. A management station or managing station is a Microsoft Windows based Dell system that has the Operations Manager and VMM installed to manage virtual workloads.
Figure 1. Interaction of Components In the figure, a group of PowerEdge systems act as the managed systems and two PowerEdge systems act as management stations hosting the Operations Manager and VMM. OMSA generates alerts with corresponding severity when there is a transition to an unhealthy state. Dell PRO Pack monitors the same alerts for PRO. Dell PRO Pack maps the OMSA alerts with its remedial action. The following table describes the sequence of events that occur in PRO Tip management.
The managed system for PRO Pack is a Virtual Machine Manager Server. For more information, see technet.microsoft.com/en-us/library/gg610649.aspx. Management station: For the list of supported configurations of Operations Manager and VMM, see the following: 8 • System Center 2012 Operations Manager - technet.microsoft.com/en-us/library/hh205990.aspx • System Center Operations Manager 2007 R2 - technet.microsoft.com/en-us/library/bb309428.
Using Dell Performance Resource Optimization Pack 2 This chapter suggests steps to use PRO Pack. Monitoring Using VMM You can manage the health of the virtualized environment using PRO Tips displayed on the VMM console. To see the PRO Tip window, click the PRO menu on the toolbar, as shown in the following figure. Alternatively, if you select the Show this window when new PRO Tips are created option in the PRO Tip window, the window automatically opens on the VMM console when a PRO Tip is generated.
The requirements for identifying a healthy system and moving the VMs are as follows: • Hardware requirements — Requirements that a host must meet to run VMs. For example, sufficient memory and storage. • Software requirements — Requirements if met by the host, allows a virtual machine to perform more optimally. For example, CPU allocation, network bandwidth, network availability, disk IO bandwidth, and free memory. VMM assigns a star rating to hosts in a range of zero to five.
For more information, see Using Health Explorer to Reset Alerts. VM Live Migration As a connected user, during live migration, you can migrate a VM from one node of a Windows Server 2008 R2 failover cluster to another node in the same cluster without any downtime or interruption. The difference in quick migration and live migration is that there is a downtime in quick migration whereas, there is no downtime in live migration. NOTE: Windows Server 2008 Hyper-V supports quick migration.
Using Health Explorer to Reset Alerts Health Explorer enables you to view and take action on alerts. When you select Dismiss in the PRO Tip window, the alert is removed from it. To manually reset the alert: 1. On the Actions menu, click Health Explorer . 2. Right-click the alert that you want to close. 3. Select Reset Health. The alert disappears from the PRO Tip window. Overriding Recovery Actions PRO Pack 3.0 supports two recovery actions.
9. Click Apply CAUTION: Saving the settings in the default management pack, creates a dependency between PRO Pack and the management pack. When you remove or delete PRO Pack, you must delete the default management pack as well, as it contains default settings for Operations Manager. Hence, it is recommended that you save settings using a new MP. 10. Click OK . 11. Generate an alert and PRO Tip. 12. Select Implement PRO Tip. This verifies that the overridden recovery action is successful. Figure 2.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action detected the failure of one or more fans. 1154 Voltage sensor detected a failure value Error A voltage sensor in the Restrict and Migrate specified system exceeded its failure threshold value. 1203 Current sensor detected a warning value Warning A current sensor in the Restrict specified system exceeded its warning threshold value.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action 1703 Battery sensor detected a warning value Warning A battery sensor in the Restrict specified system detected that a battery is in a predictive failure state. 2048 Device Failed Error Critical A storage component Restrict and Migrate such as a physical disk or an enclosure has failed.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action excessive temperature. 2101 Temperature dropped below Minimum Warning Threshold Warning The physical disk enclosure is too cool. Restrict 2102 Temperature exceeded Maximum Failure Threshold Critical The physical disk enclosure is too hot. A variety of factors can cause the excessive temperature.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action 2169 The controller battery needs to be replaced Critical The controller battery Restrict and Migrate cannot recharge. The battery may have been already recharged the maximum number of times. In addition, the battery charger may not be working. 2171 The controller battery temperature is above normal Warning The room temperature may be too hot.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM 2207 Alert Cause Dell PRO Tip Recommended Remedial Action The only hot spare Warning available is a SAS disk. SAS disks cannot replace SATA disks The only physical disk available to be assigned as a hot spare is using SAS technology. Restrict 2213 Recharge count maximum exceeded Warning A virtual disk or an Restrict enclosure has lost data redundancy.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity punctured by the controller Alert Cause Dell PRO Tip Recommended Remedial Action error when attempting to read a block on the physical disk and marked that block as invalid. 2282 Hot spare SMART polling failed Critical The controller firmware attempted to do SMART polling on the hot spare but was not able to complete the SMART polling.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action 2299 Bad PHY Critical There is a problem with a physical connection or PHY. Restrict 2300 Unstable Enclosure Failure Critical The controller is not receiving a consistent response from the enclosure. Restrict and Migrate 2301 Enclosure Hardware Error Critical The enclosure or an enclosure component is in a Failed or Degraded state.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action Warning The battery or the battery charger is not functioning properly. Restrict monitoring is not possible. 2318 Problems with the battery or the battery charger have been detected. The battery health is poor. 2319 Single-bit ECC error on Warning controller DIMM. The dual in-line Restrict and Migrate memory module (DIMM) is beginning to malfunction.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause Dell PRO Tip Recommended Remedial Action controller is unable to correct the situation 2329 SAS port report Warning The text for this alert is Restrict and Migrate generated by the controller and can vary depending on the situation. 2337 The controller is unable to recover cached data from the battery backup unit (BBU) Critical The controller was Restrict unable to recover data from the cache.
Dell Event ID Alert Description on Operations Manager and PRO Tip in VMM Severity Alert Cause 2357 SAS expander error Critical There may be a Restrict problem with the enclosure. Verify the health of the enclosure and its components.
Related Documentation and Resources 3 This chapter gives the details of documents and resources to help you work with the Pro Pack 3.0. Security Considerations Operations Console access privileges are handled internally by Operations Manager. You can setup this using the User Roles option under Administration Security feature on the Operations Manager console. The profile of the role assigned to you determines what actions you can perform and which objects you are able to manage.