HP StorageWorks 1000 Modular Smart Array maintenance and service guide This guide provides procedures and diagnostics needed for the maintenance, service, and troubleshooting of the HP StorageWorks 1000 Modular Smart Array (MSA1000 or MSA).
Legal and notice information © Copyright 1999, 2002–2006-NaN Hewlett-Packard Development Company, L.P. Confidential computer software. Valid license from HP required for possession, use or copying. Consistent with FAR 12.211 and 12.212, Commercial Computer Software, Computer Software Documentation, and Technical Data for Commercial Items are licensed to the U.S. Government under vendor’s standard commercial license. The information contained herein is subject to change without notice.
Contents About this guide . . . . . . . . . . . . . . . . . . . . . . . . . . Intended audience . . . . . . . . Prerequisites . . . . . . . . . . Related documentation . . . . . . Document conventions and symbols HP technical support . . . . . . . Customer self repair . . . . . . . Subscription service . . . . . . . HP websites . . . . . . . . . . . Documentation feedback . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6 Customer replaceable components . . . . . . . . . . . . . . . . . Procedural overview . . . . . . . . . . . . . . . . . . Customer self repair . . . . . . . . . . . . . . . . . . Parts-only warranty service . . . . . . . . . . . . . Procuring the spare component . . . . . . . . . . . Returning the defective component . . . . . . . . . . Recommended tools . . . . . . . . . . . . . . . . . . Warnings and precautions . . . . . . . . . . . . . . . Electrostatic discharge information . . . . . . . . . .
Replacing a fan module . . . . . Before you begin . . . . . . Verifying component failure . . Removing the component . . . Installing the component . . . Verifying proper operation . . Replacing the SCSI I/O module . . Before you begin . . . . . . Verifying component failure . . Removing the component . . . Installing the component . . . Verifying proper operation . . Replacing the power button module Before you begin . . . . . . Verifying component failure . . Removing the component . . .
Canadian notice (avis Canadien) . Class A equipment . . . . . . Class B equipment . . . . . . European Union notice . . . . . . BSMI notice . . . . . . . . . . . Japanese notice . . . . . . . . . Korean notices . . . . . . . . . Safety notices . . . . . . . . . . . . Battery replacement notice . . . . Taiwan battery recycling notice . . Power cords . . . . . . . . . . . Japanese power cord notice . . . . Electrostatic discharge . . . . . . Preventing electrostatic damage Grounding methods . . . . . . . . .
Tables 1 ..Document conventions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 ..LCD panel messages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 ..Component hot-pluggability . . . . . . . . . . . . . . . . . . . . . . . . . . .
About this guide This guide provides information about: • Maintaining the MSA • Servicing the MSA WARNING! To reduce the risk of personal injury from electric shock and hazardous energy levels, do not exceed the level of repairs specified in these procedures. Troubleshooting and repair procedures included in this document are detailed to allow only subassembly/module-level repair.
User documentation is available in the following locations: • MSA1000/1500 documentation CD • Technical documentation page of the MSA1000 website: http://www.hp.com/go/msa1000. • Manuals page of the HP Business Support Center website: http://www.hp.com/support/manuals. In the Storage section, click Storage array systems and then, in the Modular Array Systems, click HP StorageWorks 1000 Modular Smart Array.
HP technical support For worldwide technical support information, see the HP support website: http://www.hp.com/support. Before contacting HP, collect the following information: • • • • • • Product model names and numbers Technical support registration number (if applicable) Product serial numbers Error messages Operating system type and revision level Detailed questions Customer self repair HP customer self repair (CSR) programs allow you to repair your StorageWorks product.
About this guide
1 Illustrated parts list 2 8 7 6 3 1 4 5 15490 Item Description Service replacement module part number Additional/optional module part number Customer Self Repair Standard shipping configuration items: 1 Chassis (with backplane) 229198–001 n/a Optional 2 Blanking panel, MSA controller 229208–001 n/a Mandatory 3 MSA1000 controller with 256MB cache module 229203–001 218231-B22 Optional 4 Power button module 229201–001 n/a Optional 5 Power supply module 212398–001 n/a Option
Other available items: SCSI hard drives Contact HP Support Contact your HP Authorized Reseller Optional MSA1000 controller with 128MB cache 314718–001 n/a Optional Cache module (128MB) with battery 171387–001 n/a Optional Cache module (256MB) with battery 262012–001 254786-B21 Optional Cache battery kit 120978–001 n/a Optional MSA Fabric Switch 6 (embedded option — obsolete) 218681–001 n/a Optional MSA SAN Switch 2/8 (embedded option) 309503–001 288247-B21 Optional MSA Hub 2/3 (e
2 Specifications This chapter provides operating and performance specifications for the MSA1000. Included in this section: • Chassis specifications • Memory specifications • Power supply module specifications • SCSI I/O module specifications NOTE: For information on the components and their part numbers, see the Illustrated parts list.
Chassis specifications Item Description Height 17.5 cm (6.9 in) Width 52.1 cm (20.5 in) Depth 48.3 cm (19.0 in) Weight (no hard drives installed, single power supply) 22.7 kg (50 lb) International input voltage requirements Rated input voltage 100 to 240 VAC Rated input frequency 50 Hz to 60 Hz Rated input current 7.35 A Max Input power (max) 641 W* U.S. input voltage requirements Rated input voltage 100 VAC to 240 VAC Rated input frequency 50 Hz to 60 Hz Rated input current 7.
Memory specifications Item Description Speed 100 MHz minimum Width 80 bits NOTE: Use only HP battery-backed cache accelerator modules. Power supply module specifications Item Description Height 12.7 cm (4.5 in) Width 15.9 cm (6.25 in) Depth 24.1 cm (9.5 in) Weight 2.9 kg (6.4 lb) NOTE: The MSA1000 System Unit power supply specifications are calculated without the fan. SCSI I/O module specifications Item Description Height 11.4 cm (4.5 in) Width 3.5 cm (1.375 in) Depth 24.1 cm (9.
Specifications
3 System components and LEDs This chapter includes figures and tables that identify system components and describe chassis and module LEDs patterns.
Rear view 8 7 6 5 4 1 3 2 4 3 2 15578 1 Fibre Channel blanking panel 2 Power supply/blower assemblies 3 Power supply/blower assembly LEDs 4 AC power cord connectors 5 SCSI port A connector (enclosures connected to this port are labelled box 2) 6 SCSI port B connector (enclosures connected to this port are labelled box 3) 7 SCSI I/O module with integrated EMU 8 Fibre Channel I/O module Chassis and component LEDs The MSA chassis and its modular components are equipped with LEDs.
Chassis LEDs The chassis LEDs provide fundamental status information about the MSA. 1 2 3 15582 Item Name Condition Color Meaning 1 Heartbeat Blinking Green Normal Off/Solid Amber Power off or fault On Green Power on Off — Power off or system in Standby state On Amber Fault detected in one or more subsystems Off — Normal — no faults detected 2 3 Power Fault NOTE: System power does not shut off completely with the power button.
Controller LEDs During normal runtime, the array controller has 18 LEDs that indicate controller activity or malfunction. 15 14 13 12 7 6 5 4 11 10 3 9 8 17 1 0 16 2 15588 Item Name Condition Meaning 1 Drive failure On A configured hard drive has failed. 2 Cache activity On Cache active. Off No cache activity. Blinking Cache transfer pending. 3 SCSI bus 1 activity On Activity on the bus. 4 SCSI bus 0 activity On Activity on the bus.
Hard drive LEDs When a hard drive is configured as a part of an array and attached to a powered-on controller, the status of the hard drive can be determined from the illumination pattern of its LEDs. IMPORTANT: For additional information on replacing a hard drive, see Replacing a hard drive and Hard drive failure and faulted LUNs. SCSI hard drive LEDs Each SCSI hard drive has three LEDs. The following figure and table describe their locations and meanings.
On or Off‘ Blinking Off The hard drive is being accessed in one of the following ways: • The hard drive is a replacement hard drive and is being rebuilt. • If the online LEDs of all of the other hard drives in the array are blinking, an expansion is occurring. Do not remove the hard drive. Removing a hard drive at this time can cause data loss. Off Off On The hard drive has failed and has been placed offline. It is safe to replace the hard drive.
Fibre Channel I/O module LEDs The Fibre Channel I/O module has three LEDs. The following figure and table describe their locations and meanings.
Power supply/fan assembly LEDs Each power supply/fan assembly has one LED. The LED is illuminated when both the power supply and the fan are operational. When a power supply or fan fault occurs, the LED goes off. If the LED is off, AC power is not present or there is a complete power supply failure. 15583 The power-supply-mounted fans cool the chassis by circulating air through the enclosure. The rate at which air moves (the airflow) determines the amount of cooling.
SCSI I/O module LEDs The SCSI I/O module with integrated EMU has three LEDs. The following figure and table describe their locations and meanings.
System components and LEDs
4 Available diagnostic tools The following is a partial list of the tools that are supported for use with the MSA: • Array Configuration Utility (ACU) overview • Array Configuration Utility - Command Line Interface overview • MSA Command Line Interface (MSA-CLI) overivew • Array Diagnostics Utility (ADU) overview • HP Systems Insight Manager (HP-SIM) overview • HP Storage Management Initiative Specification (SMIS) overview • System Event Analyzer (SEA) overview Array Configuration Utility (ACU) overview Th
• ACU website: http://h18000.www1.hp.com/products/servers/priliantstorage/ softwaer-management/acumatrix/index.html. Array Configuration Utility Command Line Interface (ACU-CLI) overview The Array Configuration Utility Command Line Interface (ACU-CLI) is a command line user interface used to configure an array controller and its storage.
data for reporting and graphing. HP-SIM also provides base-level management of HP clients, printers, and selected storage. HP recommends using HP-SIM to monitor the performance of your Windows, Linux, and NetWare systems. IMPORTANT: You must install and use HP-SIM to benefit from the Pre-Failure Warranty for processors, SAS, SCSI, and SATA hard drives, and memory modules.
• Manual and automatic (real-time) analysis capabilities on local or remote machines • Filtering capabilities on error logs • Automatic notification (customer or HP Support Center) • Rule-based (allowing more frequent updates) • Proactive indictment with the operating system • Web and command line interfaces For more information, go to the WEBES website: http://h18023.www1.hp.com/support/svctools/ webes/?jumpid=reg_R1002_USEN.
5 LCD panel and message descriptions • LCD panel overview • LCD message types • LCD message descriptions NOTE: Some messages may specify a box number. For the MSA1000, the following box numbers are defined: Pre-defined box number Physical connection to MSA SCSI I/O module Box 1 This is the MSA1000 chassis Box 2 Port A of the SCSI I/O module Box 3 Port B of the SCSI I/O module LCD panel overview Each array controller contains an integrated Liquid Crystal Display (LCD).
1 2 7 3 4 5 6 15576 1 Fault LED—indicates an internal MSA component failure or external storage enclosure failure 2 Two-line, twenty-column text message LCD display panel 3 Left navigation button 4 Up navigation button 5 Right navigation button 6 Down navigation button 7 Redundancy Link LED (dual-controller configurations only) — indicates that both array controllers and their corresponding Fibre Channel I/O modules are in a redundant mode of operation LCD message types The display modu
User input messages User input messages indicate that the system has encountered a situation that allows user input. The user can select from a number of choices. If the user does not select one of the choices in a set amount of time, the system selects the default setting. These user input messages only occur during system power on, and not during runtime.
Message Type Description Action Yes = the volume is enabled (regardless of the issue.) The volume attempts to return to its normal operating state and the data is accessible. No = not selecting any option within the time-out period = the volume remains disabled. The volume stays in the failed state and the data is not be accessible. During this time, the user can attempt to fix the issue.
Message Type Description Action 10 FIRMWARE FLASH FAILED Error ROM flash failed. Download new firmware from the HP website and initiate the firmware update again. If the issue persists, contact HP support. 11 CLONING REFUSED SYSTEM HALTED Error A firmware mismatch was detected, message 07 CLONE FIRMWARE? '<'=NO, '>'=YES was displayed, and the user selected < for “No”. Firmware on the controllers must be updated to be the same.
Message Type Description Action 24 BAD SCSI BUS MODE NON-LVD DEVICE FOUND Error The MSA does not support SCSI Single Ended (SE) devices; it only supports SCSI Low Voltage Differential (LVD) devices. The MSA should be powered off and then all SCSI devices attached to it should be examined. Any SE devices found should be removed and replaced with LVD devices.
Message Type Description Action 32 CHASSIS NVRAM CONTENTS CORRUPTED Error The MSA has non-volatile memory on it that contains required information needed to operate. This non-volatile memory appears to be corrupted and the information is not valid. The MSA cannot continue to operate and halts. 40 INITIALIZING REDUNDANCY SUPPORT Informational Two array controllers are installed, and the they are attempting to enter redundant mode.
Message Type Description Action 44 REDUNDANCY FAILED MISMATCH HARDWARE Error Both array controllers must contain the same hardware for them to successfully enter redundant mode. The current array controllers do not contain the same hardware, possibly because one has an attached Fibre Channel daughter card and the other does not.
Message Type Description Action 48 REDUNDANCY HALTED FIRMWARE CLONED Informational Both array controllers must run the same version of firmware for them to successfully enter redundant mode. A process called firmware cloning has been successfully completed in order to make them both the same firmware level. The standby array controller is restarted automatically so they can attempt to achieve redundancy again.
Message Type Description Action 52 REDUNDANCY FAILED PCI BUS ERROR Error While either attempting to enter redundant mode or already operating in redundant mode, one of the array controllers encountered a PCI bus error on the communication channel used between the two array controllers. Redundancy is disabled at this time. If the MSA is currently involved in host I/O, remove the standby array controller, wait 10 seconds, and then reinsert it, ensuring that it is fully seated in the chassis.
Message Type Description Action 60 NO CACHE MODULE FOUND Error The array controller requires at least one cache module in order to operate. Either one is not present or it has failed. Remove the failed array controller, either add a cache module or replace the failed one, wait 10 seconds, and then reinsert it, ensuring that it is fully seated in the chassis. Should the issue persist, contact HP support.
Message Type Description 67 CACHE HARDWARE TEMPORARILY DISABLED Informational The cache memory hardware has temporarily been disabled, typically because the battery is not charged or a capacity expansion operation is occurring. The cache automatically enabled once the condition has been corrected. 68 OBSOLETE CACHE DATA DELETED Informational Old data that no longer belongs to any current configured volumes was found in the cache memory at power up. This data has been deleted.
Message Type Description Action 72 CACHE ERROR IGNORE? <=NO, >=YES User input During power up, data was found in the cache that could not be flushed to the hard drives. The reason is either because the data does not belong to this array controller (the cache module was moved from a different array controller) or the cache data is partial (the rest of the data is in another cache module that was removed from the array controller). This error could occur if cache modules are moved improperly.
Message Type Description Action 81 SMART DRIVE ALERT BOX #, BAY Informational A SCSI hard drive may be close to failing. This was determined either by the hard drive firmware itself using SMART technology or by the array controller using monitor and performance testing. 82 DRIVE HOT ADDED BOX #, BAY Informational A SCSI hard drive has been added to the MSA chassis or attached external storage enclosure.
Message Type Description Action 89 INVALID CONFIG BOX #, BAY Error For the MSA1500, before using an attached MSA20, the MSA20 is checked to see if any hard drives contain configuration data from a source other than the MSA1500. If so, the enclosure is failed and this message is displayed. An MSA20 attached to an MSA1500 must be configured by the 1500.
Message Type Description Action 106 VOLUME # STATE WAITING TO REBUILD Informational The configured volume is waiting to start rebuilding data on a SCSI hard drive that replaces a previously failed drive. The rebuild may not have started yet because the array controller is already performing a rebuild on another configured volume. 107 VOLUME # STATE WAITING TO EXPAND Informational The configured volume is waiting to start a volume expansion operation.
Message Type Description Action 112 VOLUME # REBUILD FAILURE Error The rebuild operation on the configured volume has failed. If the volume is still operating in regenerative mode, remove the new SCSI hard drive that was added as a replacement for the original failed hard drive and replace it with a different new drive. 113 VOLUME # EXPANSION FAILURE Error The volume expansion operation on the configured volume has failed.
Message Type Description Action 123 TOO MANY VOLUMES DETECTED Error The array controller only supports a maximum of 32 configured volumes. More volumes than that were detected at power up. This typically occurs when migrating a set of volumes from one array controller to a different array controller that already has configured volumes on it. The migrated volumes have not been added. Remove the migrated volumes and run the Array Configuration Utility (ACU).
Message Type Description Action 151 BEACON OFF Informational The LEDs on the MSA and all connected hard drives are no longer blinking because the host-based application is no longer identifying or configuring this MSA. 201 ARRAY CONTROLLER TEMPERATURE OK Informational The temperature sensor on the array controller indicates that the temperature that was previously exceeding the normal operating range is now back within the range.
Message Type Description 301 RECOVERY ROM AUTOFLASH DONE Informational The array controller has successfully completed the process of copying the current active firmware image into the backup recovery ROM. 302 RECOVERY ROM AUTOFLASH FAILED Error The array controller failed to copy the current active firmware image into the backup recovery ROM. Recovery ROM support is disabled.
Message Type Description Action 306 FIRMWARE FLASH STARTED Informational The array controller in the MSA has started the firmware flash process. Do not turn off power to the system until it has completed. This could take several minutes. 307 FIRMWARE FLASH DONE Informational The array controller in the MSA has completed the firmware flash process. It is now safe to restart the MSA and begin using the new firmware.
Message Type Description Action 402 STORAGE BOX # FAN DEGRADED Error A fan in the specified storage enclosure is not operating at full efficiency. The fan may eventually fail. 403 STORAGE BOX # FAN HOT INSERTED Informational An fan in the specified storage enclosure has been hot-added. 404 STORAGE BOX # FAN HOT REMOVED Informational A fan in the specified storage enclosure has been removed.
Message Type Description Action 412 STORAGE BOX # EMU NOT RESPONDING Error An EMU in the specified storage enclosure is not responding to commands. 413 STORAGE BOX # EMU VERSION Informational The version of firmware running on the EMU. This is only displayed for the internal EMU of the MSA chassis. It is not displayed for externally connected storage boxes. 414 STORAGE BOX # HOT REMOVED Informational An external storage enclosure has been hot-removed from the back of the MSA.
Message Type Description Action 511 FIBRE CHANNEL SUBSYSTEM HARDWARE FAILURE Error The array controller's Fibre Channel subsystem has encountered a critical error during the power up sequence. Remove the failed array controller and its associated Fibre Channel module, reseat the Fibre Channel daughter card on the array controller, wait 10 seconds, then reinsert the Fibre Channel module and the array controller, ensuring that they are fully seated in their respective bays in the chassis.
Message Type Description Action 518 PERSISTENT MEM ENABLED Informational Persistent memory data has been successfully initialized. 519 PERSISTENT MEM DISABLED Informational Persistent memory data has not been successfully initialized. Some configuration data may not be available. 520 UNKNOWN I/O MODULE DETECTED Error The MSA cannot determine the type of I/O module installed in the bay. Do the following: 1. Remove power from any external storage enclosures. 2. Remove power from the MSA. 3.
LCD panel and message descriptions
6 Customer replaceable components This chapter includes the following information regarding the removal and replacement of MSA1000 components: • Procedural overview • Customer self repair • Recommended tools • Warnings and precautions • Determining whether a component is hot-pluggable • Connecting the Power • Applying power to the MSA • Removing power from the MSA • Removing or installing an interconnect blank • Removing or installing a hard drive blank • Replacing a hard drive • Replacing the controller, c
Procedural overview To help you successfully replace hardware components on your storage system, the following information is included in all component replacement procedures: • Before you begin — This section may be included in the procedures to include component-specific precautions or notices. Be sure to read and observe these statements.
For more information about HP's customer Self Repair program, contact your local service provider. For the North American program, go to: http://www.hp.com/go/selfrepair. Parts-only warranty service Your HP Limited Warranty may include a parts-only warranty service. Under the terms of part-only warranty service, HP provides replacement parts free of charge. For parts-only warranty service, CSR part replacement is mandatory.
Warnings and precautions • • • • • • Electrostatic Discharge Information Grounding methods Equipment Symbols Weight Warning Rack warnings and precautions Device warnings and precautions Electrostatic discharge information To prevent damage to the system, be aware of the precautions you need to follow when setting up the system or handling parts. A discharge of static electricity from a finger or other conductor may damage system boards or other static-sensitive devices.
Equipment symbols The following symbols may be located on equipment in areas where hazardous conditions may exist. WARNING! Any enclosed surface or area of the equipment marked with these symbols indicates the presence of electrical shock hazards. Enclosed area contains no operator serviceable parts. To reduce the risk of injury from electrical shock hazards, do not open this enclosure. WARNING! Any RJ-45 receptacle marked with these symbols indicates a network interface connection.
Weight warning WARNING! The device can be very heavy. To reduce the risk of personal injury or damage to equipment: • Remove all hot-pluggable power supplies and modules to reduce the overall weight of the device before lifting. • Observe local health and safety requirements and guidelines for manual material handling. • Get help to lift and stabilize the device during installation or removal, especially when the device is not fastened to the rails. When a device weighs more than 22.
WARNING! Verify that the AC power supply branch circuit that provides power to the rack is not overloaded. Overloading AC power to the rack power supply circuit increases the risk of personal injury, fire, or damage to the equipment. The total rack load should not exceed 80 percent of the branch circuit rating. Consult the electrical authority having jurisdiction over your facility wiring and installation requirements.
CAUTION: To properly ventilate the system, you must provide at least 7.6 centimeters (3.0 inches) of clearance at the front and back of the device. CAUTION: When removing or replacing a failed component, allow 30 seconds to pass before inserting a replacement component. This time is needed to allow the device to clear out the system registries. CAUTION: Schedule physical configuration changes during periods of low or no activity.
Determining whether a component is hot-pluggable Before removing any serviceable part, determine whether the part is hot-pluggable or non-hot-pluggable. • If the component is hot-pluggable, a power shutdown of the device is not required for replacement of the part. • If the component is not hot-pluggable, the device must be powered down. CAUTION: Be sure the replacement is available before removing a component or blanking panel from the chassis.
Connecting power to the MSA HP strongly recommends using the power cord shipped with the MSA. If using a different power cord, your power cord should be approved for use in your country. The power cord must be rated for the product and for the voltage and current marked on the electrical ratings label of the product. The voltage and current rating of the cord should be greater than the voltage and current rating marked on the product. In addition, the diameter of the wire must be a minimum of 1.
Applying power to the MSA Before applying power to the MSA, all components of the storage system must be installed and connected to the supported interconnect options. Hard drives should be installed in the enclosures so that they can be identified and configured at power up. The MSA and the other network components must be powered on in the following sequence: 1.
Removing power from the MSA CAUTION: System power to the MSA does not shut off completely with the power switch. The two positions of the front panel power switch should be considered as On and Standby, rather than On and Off. The standby position removes power from most of the electronics and the hard drives, but portions of the power supply and some internal circuitry remain active. • To remove all power from the system, you must disconnect the power cord from the storage system.
Removing or installing a Fibre Channel interconnect blank Before you begin CAUTION: • Before removing a component or blanking panel from an operational device, make sure that you have the replacement part available. Removing a component or blank impacts the airflow and cooling ability of the device. To avoid possible overheating, insert the new or replacement component within one or two minutes.
Removing or installing a hard drive blank Before you begin CAUTION: • Before removing a component or blanking panel from an operational device, make sure that you have the replacement part available. Removing a component or blank impacts the airflow and cooling ability of the device. To avoid possible overheating, insert the new or replacement component within one or two minutes. If the internal temperature exceeds acceptable limits, the device may overheat and automatically shut down or restart.
Replacing a hard drive Before you begin CAUTION: • Before removing a component or blanking panel from an operational device, make sure that you have the replacement part available. Removing a component or blank impacts the airflow and cooling ability of the device. To avoid possible overheating, insert the new or replacement component within one or two minutes. If the internal temperature exceeds acceptable limits, the device may overheat and automatically shut down or restart.
• If you replace a hard drive while the system is off, it may be necessary to rebuild the replaced hard drive. 1. Review all warnings, cautions, and preparation procedures as detailed in Warnings and precautions. 2. Be sure that the Online and Drive Access LED LEDs are both Off and that the Fault LED is On. 3. Press the ejector button (1) and pull the release lever to the full open position (2). 4. Wait approximately 10 seconds for the internal disk to stop spinning before continuing to the next step.
1 2 15593 3. Close the ejector lever against the front of the hard drive (2). 4. Verify that the hard drive is firmly engaged and that the ejector lever is latched. CAUTION: Data can be lost if the hard drive is not firmly seated. 5. As the hard drive begins to spin, the drive LEDs will illuminate, indicating that the system has recognized the new drive. In fault-tolerant configurations, allow the replacement hard drive to be reconstructed automatically from parity data stored on the other hard drives.
Replacing the controller, controller cache, or controller battery The following sections detail procedures for replacing an MSA array controller, controller cache module, or controller cache battery pack: • Before you begin • Verifying component failure • Removing the controller • Removing the controller cache module • Removing the controller cache battery pack • Installing the controller cache battery pack • Installing the controller cache module • Installing the controller • Verifying the replacement Perf
Verifying component failure Before replacing the controller, cache module, or battery pack: • Check the controller LCD display panel for error messages. • Record any failure LEDs or LCD panel messages for reference purposes. • Check the controller heartbeat LED (1) . If the LED is not blinking, it might indicate a failure. • Check the controller fault LED (2) . If the LED is on, it might indicate a failure. 1 2 15512 Removing the controller 1.
Removing the controller cache module If your system is equipped with a single controller, and you must replace the controller cache: • You must schedule a maintenance window and remove power from the system to perform the service. If your system is equipped with two controllers: • and you are replacing a cache module with another module of the same size, you can remove one of the controllers and replace the module on that controller while the system is running.
WARNING! There is a risk of explosion, fire, or personal injury if the battery pack is incorrectly replaced or mistreated. To reduce the risk: • Do not attempt to recharge the battery outside of the controller. • Do not expose to water or temperatures higher than 60˚C. • Do not abuse, disassemble, crush, puncture, short external contacts, or dispose of in fire or water. • Replace only with the HP spare designated for this product.
15517 5. Lift the pack upward to unhook the top of the battery pack. 15518 6. Wait approximately 15 seconds after removing the old battery to allow the battery charge monitor to reset. NOTE: If you do not wait 15 seconds after removing the old battery, full charge of the new cache battery is delayed. 7. Repeat step 3 through step 6 to remove the second battery pack on this cache module. Installing the controller cache battery pack 1. Remove the array controller as instructed in Removing the controller.
5. After the battery pack is hooked in position, swing the pack downward, ensuring that the bottom clip and two pegs line up with the holes in the cache module. 6. Verify that the top hook (1) and bottom clip (2) on the battery pack are securely attached to the cache module. 1 2 15520 7. Repeatstep 4 through step 6 to install the second battery pack on the module. 8. Install the serviced cache module in the controller as instructed in Installing the controller cache module.
1 2 15595 3. If necessary, power on the MSA as instructed in Connecting the Power and Applying power to the MSA. NOTE: Each time a system with two controllers is powered on or restarted, or if a second controller is hot-plugged into a single-controller configuration, the firmware versions on the controllers are compared.
Replacing a Fibre Channel I/O module Before you begin CAUTION: • Before removing a component or blanking panel from an operational device, make sure that you have the replacement part available. Removing a component or blank impacts the airflow and cooling ability of the device. To avoid possible overheating, insert the new or replacement component within one or two minutes. If the internal temperature exceeds acceptable limits, the device may overheat and automatically shut down or restart.
CAUTION: Use appropriate precautions when handling Fibre Channel cables: • Touching the end of a Fibre Channel cable will either damage the cable or cause performance problems, including intermittent difficulties accessing the storage. • Whenever a Fibre Channel cable is not connected, replace the protective covers on the ends of the cable. • Make certain that the Fibre Channel cables are installed and supported so that no excess weight is placed on the connectors.
Replacing a 2-Gb small form factor pluggable (SFP) transceiver The transceiver is hot-pluggable, so it is not necessary to power down the system to replace it. Before you begin WARNING! To reduce the risk of injury from laser radiation or damage to the equipment, observe the following precautions: • Do not open any panels, operate controls, make adjustments, or perform procedures to a laser device other than those specified herein. • Do not stare into laser beam when panels are open.
Verifying proper operation After replacing the failed SFP, verify that: • The 1-Gb and 2-Gb link-status LEDs on the Fibre Channel I/O module cycle through blinking amber and green and then Off to indicate that the circuitry has recognized a newly installed SFP. • The status LED on the Fibre Channel I/O module is solid green. • No new error messages are displayed on the array controller LCD panel.
3. While lifting up the power supply module latch (1), grasp the fan element and pull the defective power supply assembly out of the chassis (2). 1 2 15557 Installing the component 1. Lift up on the power supply module latch (1) and slide the assembly into the chassis until it is fully seated (2). 1 2 15558 2. Connect the AC power cord to the power supply. 3. Engage the cord lock. Verifying proper operation After replacing the power supply, verify that: • The power supply LED is solid green.
Replacing a fan module The variable-speed fan module is hot-pluggable, so it is not necessary to power down the system to replace it. Before you begin CAUTION: • Before removing a component or blanking panel from an operational device, make sure that you have the replacement part available. Removing a component or blank impacts the airflow and cooling ability of the device. To avoid possible overheating, insert the new or replacement component within one or two minutes.
2 1 1 15555 CAUTION: Pressing the center section of the fan can damage the blades. Press only the outer edge of the fan. Installing the component 1. Align the guidepost on the new fan with the power supply connector (1). 1 3 2 3 15556 2. Slide the replacement fan into the fan base (2) until the tabs snap into place (3). Verifying proper operation After replacing the fan module, verify that: • The fan starts operating immediately. • The fan LED is On.
Replacing the SCSI I/O module CAUTION: The SCSI I/O module with an integrated EMU is not hot-pluggable. Before replacing the module, all I/O from the servers must be stopped and the MSA must be powered down. Before you begin CAUTION: • Before removing a component or blanking panel from an operational device, make sure that you have the replacement part available. Removing a component or blank impacts the airflow and cooling ability of the device.
Installing the component 1. Slide the replacement SCSI I/O module into the bay until it clicks into place. 15560 2. Reconnect all SCSI cables. NOTE: Ensure that all of the SCSI cable connectors are fastened tightly. Verifying proper operation After replacing the failed SCSI I/O module, verify that: • The module status LED is solid green. • No new error messages are displayed on the array controller LCD panel. Replacing the power button module CAUTION: The power button module is not hot-pluggable.
Verifying component failure Before replacing the power button module: • Verify that the module status LED is not illuminated or is flashing amber. • Check the array controller LCD display panel for error messages and take appropriate action. Removing the component 1. Review all warnings, cautions, and preparation procedures as detailed in Warnings and precautions. 2. Stop all access to and remove power from the MSA. For instructions, see Removing power from the MSA.
Installing the component 1. Slide the replacement power button module into the bay until it clicks into place. 2. Reinstall all hard drives in slots 10–14. CAUTION: To prevent data loss, each hard drive must be installed in the same slot from which is was removed. 3. Reinstall the MSA in the rack. 4. Reconnect he SCSI, Fibre Channel, and power cables. 5. Apply power to the MSA as instructed in Applying power to the MSA.
Customer replaceable components
7 Enclosure and hard drive migrations This section describes the following: • Adding a new storage enclosure • Migrating existing enclosures • Moving hard drives or arrays NOTE: For the most up-to-date information on supported enclosures, see the MSA website. Adding a new storage enclosure New storage enclosures with unconfigured hard drives are hot-pluggable. Perform the following steps to attach and install a new enclosure to an existing MSA: 1. Install the new storage enclosure in the rack.
4. Attach SCSI cables to the MSA expansion SCSI connectors. 5. Attach the SCSI cables to the expansion storage enclosures. 6. Attach the power cords to the power supplies at the rear of the enclosures. 7. Power on each enclosure and wait for it to complete its startup routine. 8. Power on the MSA array controller system using the power button on the front of the system. 9. Wait for the MSA to complete its startup routine. 10. Power on the host servers and allow them to boot. 11.
8 Capacity expansion and extension Array expansion is the addition of physical hard drives to an array that has already been configured. The capacity of these added physical hard drives may then be added to an existing LUN on the array or configured into a new LUN. LUN extension is the enlargement of an existing LUN after the corresponding array has undergone capacity expansion. Expansions and extensions are performed through the ACU or MSA-CLI.
Capacity expansion and extension
9 Hard drive failures and faulted LUNs The purpose of fault-tolerant array configurations is to protect against data loss due to hard drive failure. Each RAID configuration has inherent limitations on the number of hard drive failures that it can tolerate. If the fault-tolerance level of a particular LUN or array configuration is exceeded, the array will be locked from any further I/O.
• RAID 6 configurations can tolerate simultaneous failure of two hard drives in the array. Compromised fault tolerance Each RAID configuration has inherent limitations on the number of physical hard drive failures that it can tolerate. If more hard drives fail than the fault-tolerance method allows, fault tolerance is compromised. When the MSA determines that the fault tolerance of a LUN is compromised, the LUN is taken offline and subsequent I/O requests are rejected.
b. Make copies of important data, if possible. c. Replace any failed hard drives and allow the Automatic Data Recovery (ADR) process to rebuild the data onto replacement drives. d. After failed hard drives have been replaced, fault tolerance may again be compromised. If so, power cycle the system, and if the #02 or #04 ENABLE VOLUME LCD message is displayed, press the right push button to re-enable the LUNs. 7. If using the ACU: a. Open the ACU and highlight the faulted LUN (shown with a red drive symbol.
• When RAID 6 (ADG) is used, two drives can fail simultaneously (and be replaced simultaneously) without data loss. • If the offline drive is a spare, the degraded drive can be replaced. • Do not remove a failed second hard drive from an array until the first failed or missing hard drive has been replaced and the rebuild process is complete. (When the rebuild is complete, the online LED on the front of the hard drive stops blinking.
Failure of another drive during rebuild If a non-correctable read error occurs on another physical drive in the array during the rebuild process, the Online LED of the replacement drive stops blinking and the rebuild abnormally terminates. If this situation occurs, restart the server and the storage system. The system may temporarily become operational long enough to allow recovery of unsaved data. In any case, locate the faulty drive, replace it, and restore data from backup.
Hard drive failures and faulted LUNs
10 Array controller firmware Each array controller contains Read-Only Memory (ROM), which holds the firmware that operates the controller. In dual-controller configurations, the two controllers must execute the same firmware version.
If the firmware version on the two controllers is not the same, the following prompt is displayed in the controller LCD panel: CLONE FIRMWARE ? ‘<‘ = NO, ‘>' = YES If > is pressed or a response is not made to the prompt within 60 seconds, the following messages are displayed on the LCD panel while firmware is copied from the active controller to the standby controller and the standby controller is automatically restarted: ROM CLONING STARTED ARRAY CONTROLLER RESTARTING STARTUP COMPLETE The two controllers s
The following instructions and examples use HyperTerminal. For more information, see user documentation for your emulator program. a. On the menu bar at the top of the HyperTerminal window, select Transfer > Send File. b. Click Browse and navigate to the location of the firmware image obtained in step 1. c. Expand the Protocol drop-down box and select 1K XModem. d. Click Send. A status window is displayed. 4. Wait for the transfer process to complete. NOTE: This process may take up to 15 minutes.
6. In dual-controller configurations only, if the newly installed firmware on the recovered controller differs from the firmware on the active controller, the CLONE FIRMWARE LCD message is displayed. See Cloning controller firmware.
11 SCSI hard drive firmware SCSI hard drive firmware updates are performed from the bootable ProLiant Firmware Maintenance CD and are required only when the following message is displayed on the MSA controller LCD panel: 85 BAD DRIVE FRMWARE BOX BAY CAUTION: This update procedure must be performed during a scheduled maintenance window. 1. Obtain the ISO image for the ProLiant Firmware Maintenance CD from the following website: http://www.hp.com/support/proliantstorage.
SCSI hard drive firmware
A Regulatory compliance and safety Regulatory compliance Federal Communications Commission notice Part 15 of the Federal Communications Commission (FCC) Rules and Regulations has established Radio Frequency (RF) emission limits to provide an interference-free radio frequency spectrum. Many electronic devices, including computers, generate RF energy incidental to their intended function and are, therefore, covered by these rules.
Declaration of conformity for products marked with the FCC logo, United States only This device complies with Part 15 of the FCC Rules. Operation is subject to the following two conditions: (1) this device may not cause harmful interference, and (2) this device must accept any interference received, including interference that may cause undesired operation. For questions regarding your product, visit http://www.hp.
European Union notice Products bearing the CE marking comply with the EMC Directive (89/336/EEC) and the Low Voltage Directive (73/23/EEC) issued by the Commission of the European Community and if this product has telecommunication functionality, the R&TTE Directive (1999/5/EC).
Korean notices Safety notices Battery replacement notice Your computer is equipped with a lithium manganese dioxide, a vanadium pentoxide, or an alkaline internal battery or battery pack. There is a danger of explosion and risk of personal injury if the battery is incorrectly replaced or mistreated. Replacement is to be done by an HP authorized service provider using the HP spare part designated for this product.
Taiwan battery recycling notice The Taiwan EPA requires dry battery manufacturing or importing firms in accordance with Article 15 of the Waste Disposal Act to indicate the recovery marks on the batteries used in sales, giveaway or promotion. Contact a qualified Taiwanese recycler for proper battery disposal. Power cords The power cord set must meet the requirements for use in the country where the product was purchased.
• Use a wrist strap connected by a ground cord to a grounded workstation or computer chassis. Wrist straps are flexible straps with a minimum of 1 megohm (±10 percent) resistance in the ground cords. To provide proper ground, wear the strap snug against the skin. • Use heel straps, toe straps, or boot straps at standing workstations. Wear the straps on both feet when standing on conductive floors or dissipating floor mats. • Use conductive field service tools.
Index A AC circuit overload, warning, 65 accumulators, 114 ADU, described, 30 alkaline battery warning, 114 audience, 9 Avis Canadien, regulatory compliance notice, 112 B backplane, replacement, 93 batteries recycling or disposal, 114 replacement notice, 114 warning, 114 Taiwan EPA recycling and disposal, 115 warnings, 78 boot straps, using, 116 BSMI, regulatory compliance notice, 113 C cables distances supported, 13 FCC compliance statement, 112 multi-mode, 13 option kits, 13 part numbers, 13 shielded
fibre subsystem link failure, 56 fibre subsystem link OK, 56 fibre switch hardware failure, 56 firmware flash done, 53 firmware flash done on box #, 53 firmware flash failed, 37 firmware flash failed on box #, 53 firmware flash started, 53 firmware flash started on box #, 53 firmware version, 35 FW version is not supported, 38 hard drive failure box, 46 hard drive hot added box bay, 46 hard drive hot removed box bay, 46 hard drive position change detected, 46 hard drive position charge invalid, 46
device preparation removing power, 70 warnings and precautions, 65 device, unloading from pallet, 64 diagnostic tools, 29 disposal, battery, 114, 115 dissipating floor mats, 116 document conventions, 10 prerequisites, 9 related documentation, 9 documentation HP website, 9 providing feedback, 11 E electrostatic damage prevention, 115 electrostatic discharge See ESD, 62 electrostatic discharge See ESD, 115 electrostatic sensitive parts handling, 62 packaging, 62 storing, 62 transporting, 62 enclosure expans
Federal Communications Commission.
storage box fan hot inserted, 54 storage box fan hot removed, 54 storage box fan OK, 53 storage box hot added, 55 storage box hot removed, 55 storage box power supply added, 54 storage box power supply ok, 54 storage box power supply removed, 54 storage box temperature ok, 54 system name, 36 valid cache data found at power up, 43 volume expansion disabled, 48 volume initializing parity, 48 volume media exchanged, 49 volume state deleted, 49 volume state expansion active, 47 volume state interim recovery, 47
regulatory compliance information number, 112 notices BSMI, 113 Canada, 112 Class A, 111 Class B, 111 European Union, 113 HP series number, 112 IEC EMC statement, worldwide, 112 Japan, 113 Korean, 114 modifications, 112 shielded cables, 112 related documentation, 9 replacing a power cord, 115 RFI/EMI connector hoods, 112 ROM cloning, 105 S SCSI I/O module LEDs, 27 SCSI Module with Integrated Environmental Monitoring Unit See I/O EMU, 90 series number, regulatory compliance, 112 spare parts list, 13 specif