HP StorageWorks 1500 Modular Smart Array maintenance and service guide This guide provides procedures and diagnostics needed for the maintenance, service, and troubleshooting of the HP StorageWorks 1500 Modular Smart Array (MSA1500 or MSA).
Legal and notice information © Copyright 2004–2006-NaN Hewlett-Packard Development Company, L.P. Confidential computer software. Valid license from HP required for possession, use or copying. Consistent with FAR 12.211 and 12.212, Commercial Computer Software, Computer Software Documentation, and Technical Data for Commercial Items are licensed to the U.S. Government under vendor’s standard commercial license. The information contained herein is subject to change without notice.
Contents About this guide . . . . . . . . . . . . . . . . . . . . . . . . . . Intended audience . . . . . . . . Prerequisites . . . . . . . . . . Related documentation . . . . . . Document conventions and symbols HP technical support . . . . . . . Customer self repair . . . . . . . Subscription service . . . . . . . HP websites . . . . . . . . . . . Documentation feedback . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
LCD message types . . . Error messages . . . Informational messages User input messages . LCD message descriptions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 Customer replaceable components . . . .
Verifying proper operation . Replacing a fan module . . . . Before you begin . . . . . Verifying component failure . Removing the component . . Installing the component . . Verifying proper operation . Replacing a SCSI I/O module . . Before you begin . . . . . Verifying component failure . Removing the component . . Installing the component . . Verifying proper operation . Replacing a power supply module Before you begin . . . . . Verifying component failure . Removing the component . .
International notices and statements . . Canadian notice (avis Canadien) . Class A equipment . . . . . . Class B equipment . . . . . . European Union notice . . . . . . BSMI notice . . . . . . . . . . . Japanese notice . . . . . . . . . Korean notices . . . . . . . . . Safety notices . . . . . . . . . . . . Battery replacement notice . . . . Taiwan battery recycling notice . . Power cords . . . . . . . . . . . Japanese power cord notice . . . . Electrostatic discharge . . . . . .
Tables 1 ..Document conventions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 ..LCD panel messages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 ..Component hot-pluggability . . . . . . . . . . . . . . . . . . . . . . . . . . .
About this guide This guide provides information about: • Maintaining the MSA • Servicing the MSA WARNING! To reduce the risk of personal injury from electric shock and hazardous energy levels, do not exceed the level of repairs specified in these procedures. Troubleshooting and repair procedures included in this document are detailed to allow only subassembly/module-level repair.
User documentation is available in the following locations: • MSA1000/1500 documentation CD • Technical documents page of the MSA1500 website: http://www.hp.com/go/msa1500cs. • Manuals page of the HP Business Support Center website: http://www.hp.com/support/manuals. In the Storage section, click Storage array systems and then, in the Modular Array Systems, click HP StorageWorks 1500 Modular Smart Array.
HP technical support For worldwide technical support information, see the HP support website: http://www.hp.com/support. Before contacting HP, collect the following information: • • • • • • Product model names and numbers Technical support registration number (if applicable) Product serial numbers Error messages Operating system type and revision level Detailed questions Customer self repair HP customer self repair (CSR) programs allow you to repair your StorageWorks product.
About this guide
1 Illustrated parts list 4 3 2 5 6 1 7 8 15491 Item Description Service replacement module part number Additional/optional module part number Customer Self Repair Standard shipping configuration items: 1 Power supply modules 349800–001 / 406442–001 n/a Mandatory 2 SCSI I/O module 361261–001 AA988A Optional 3 Fan modules 349798–001 n/a Mandatory 4 Fibre Channel I/O module 361260–001 AA987A Optional 5 Chassis with backplane 361263–001 n/a Optional 6 Bezel 361262–001 Man
7 MSA1000 controller with 256MB cache module 314718–001 / 411048–001 218231-B22 Optional 8 Blanking panel, MSA controller 229208–001 n/a Mandatory AC power cords Contact HP Support Contact your HP Authorized Reseller Mandatory VHDCI SCSI cables Contact HP Support Contact your HP Authorized Reseller Mandatory Custom serial cable for CLI connection 316131–001 n/a Mandatory SCSI hard drives Contact HP Support Contact your HP Authorized Reseller Optional SATA hard drives Contact HP Su
2 Specifications This chapter provides operating and performance specifications for the MSA1500. Included in this section: • Chassis specifications • Memory specifications • Power supply module specifications • SCSI I/O module specifications • Fan module specifications • Fibre Channel I/O module specifications NOTE: For information on the components and their part numbers, see the Illustrated parts list.
Chassis specifications Item Description Height 8.79 cm (3.46 in) Width 48.41 cm (19.06 in) Depth 61.77 cm (24.32 in) Weight 24.95 kg (55 lb) Input voltage requirements Rated input voltage 100 to 240 VAC Rated input frequency 50 Hz to 60 Hz Rated input current 6.4A Max, 1A typical (MSA1000 was 7.
Power supply module specifications Item Description Height 7.7 cm (3.0 in) Width 6.4 cm (2.5 in) Depth 32.0 cm (12.6 in) Weight 1.80 kg (3.96 lb) SCSI I/O module specifications Item Description Height 8.13 cm (3.2 in) Width 2.34 cm (0.92 in) Depth 37.14 cm (14.62 in) Weight 0.46 kg (1.01 lbs) Fan module specifications Item Description Height 6.0 cm (2.36 in) Width 6.0 cm (2.36 in) Depth 26.8 cm (10.6 in) Weight 0.50 kg (1.
Specifications
3 System components and LEDs This chapter includes figures and tables that identify system components and describe chassis and module LEDs patterns.
Rear view 1 10 2 9 8 3 7 6 5 4 15504 1 Fibre Channel I/O module for controller 1 2 Slot diagram 3 Blank for optional Fibre Channel I/O module for controller 2 4 Power supplies 5 SCSI I/O module (bus 0) 6 Blank for optional SCSI I/O module (bus 1) 7 Fan modules 8 Blank for optional SCSI I/O module (bus 2) 9 Blank for optional SCSI I/O module (bus 3) 10 Unused slot SCSI I/O ports 1 2 15509 1 SCSI Port B 2 SCSI Port A NOTE: Two SATA storage enclosures or one SCSI storage e
Slot diagram label 1 2 3 8 4 4 7 5 3 7 6 2 5 6 8 2 5 1 10 1 1 2 9 0 PS 2 PS 1 15523 1 Fibre Channel I/O module 2 SCSI I/O module (bus 3) 3 SCSI I/O module (bus 2) 4 Fan module 5 Fan module 6 Fibre Channel I/O module 7 SCSI I/O module (bus 1) 8 SCSI I/O module (bus 0) 9 Power supply 10 Power supply Chassis and component LEDs The MSA chassis and its modular components are equipped with LEDs.
Chassis LEDs The chassis LEDs provide fundamental status information about the MSA.
Controller LEDs During normal runtime, the array controller has 18 LEDs that indicate controller activity or malfunction. 1 2 3 4 5 6 7 8 9 18 17 16 15 14 13 12 11 10 Item 15588 Name Condition Meaning 1 Drive failure On A configured hard drive has failed. 2 Cache activity On Cache active. Off No cache activity. Blinking Cache transfer pending. 3 SCSI bus 1 activity On Activity on the bus. 4 SCSI bus 0 activity On Activity on the bus.
Hard drive LEDs When a hard drive is configured as a part of an array and attached to a powered-on controller, the status of the hard drive can be determined from the illumination pattern of its LEDs. IMPORTANT: For additional information on replacing a hard drive, see ??? and Hard drive failure and faulted LUNs. SCSI hard drive LEDs Each SCSI hard drive has three LEDs. The following figure and table describe their locations and meanings.
On or Off‘ Blinking Off The hard drive is being accessed in one of the following ways: • The hard drive is a replacement hard drive and is being rebuilt. • If the online LEDs of all of the other hard drives in the array are blinking, an expansion is occurring. Do not remove the hard drive. Removing a hard drive at this time can cause data loss. Off Off On The hard drive has failed and has been placed offline. It is safe to replace the hard drive.
SATA hard drive LEDs Each SATA hard drive has two LEDs. The following figure and table describe their locations and meanings. 1 2 15464 1 Online/ activity LED (green) 2 Fault/UID LED (amber/blue) Meaning On, off, or Blinking Alternating amber and blue The drive has failed, or a predictive failure alert has been received for this drive; it also has been selected by a management application.
Fibre Channel I/O module LEDs The Fibre Channel I/O module has three bi-color LEDs. the following figure and table describe their locations and meanings.
Fan module LEDs Each fan module has one bi-color LED. This LED can be illuminated with either a green or amber color. The following figure and table describes their locations and meanings. 1 2 15505 Item Name Condition Color Meaning 1 Status Solid Green Power on Off — Power off Blinking Amber Fault 2 Fault Fan modules cool the chassis by circulating air through the enclosure. The rate at which air moves (the airflow) determines the amount of cooling.
SCSI I/O module LEDs The SCSI I/O module has two LEDs. The following figure and table describes their locations and meanings.
Power supply module LEDs The power supply uses a single bi-color LED to indicate status. This LED can be illuminated with either a green or amber color. The following figure and table describes their locations and meanings.
4 Available diagnostic tools The following is a partial list of the tools that are supported for use with the MSA: • Array Configuration Utility (ACU) overview • Array Configuration Utility - Command Line Interface overview • MSA Command Line Interface (MSA-CLI) overivew • Array Diagnostics Utility (ADU) overview • HP Systems Insight Manager (HP-SIM) overview • HP Storage Management Initiative Specification (SMIS) overview • System Event Analyzer (SEA) overview Array Configuration Utility (ACU) overview Th
• ACU website: http://h18000.www1.hp.com/products/servers/priliantstorage/ softwaer-management/acumatrix/index.html. Array Configuration Utility Command Line Interface (ACU-CLI) overview The Array Configuration Utility Command Line Interface (ACU-CLI) is a command line user interface used to configure an array controller and its storage.
data for reporting and graphing. HP-SIM also provides base-level management of HP clients, printers, and selected storage. HP recommends using HP-SIM to monitor the performance of your Windows, Linux, and NetWare systems. IMPORTANT: You must install and use HP-SIM to benefit from the Pre-Failure Warranty for processors, SAS, SCSI, and SATA hard drives, and memory modules.
• Manual and automatic (real-time) analysis capabilities on local or remote machines • Filtering capabilities on error logs • Automatic notification (customer or HP Support Center) • Rule-based (allowing more frequent updates) • Proactive indictment with the operating system • Web and command line interfaces For more information, go to the WEBES website: http://h18023.www1.hp.com/support/svctools/ webes/?jumpid=reg_R1002_USEN.
5 LCD panel and message descriptions • LCD panel overview • LCD message types • LCD message descriptions NOTE: Some messages may specify a box number.
1 2 7 3 4 5 6 15576 1 Fault LED—indicates an internal MSA component failure or external storage enclosure failure 2 Two-line, twenty-column text message LCD display panel 3 Left navigation button 4 Up navigation button 5 Right navigation button 6 Down navigation button 7 Redundancy Link LED (dual-controller configurations only) — indicates that both array controllers and their corresponding Fibre Channel I/O modules are in a redundant mode of operation LCD message types The display modu
User input messages User input messages indicate that the system has encountered a situation that allows user input. The user can select from a number of choices. If the user does not select one of the choices in a set amount of time, the system selects the default setting. These user input messages only occur during system power on, and not during runtime.
Message Type Description Action Yes = the volume is enabled (regardless of the issue.) The volume attempts to return to its normal operating state and the data is accessible. No = not selecting any option within the time-out period = the volume remains disabled. The volume stays in the failed state and the data is not be accessible. During this time, the user can attempt to fix the issue.
Message Type Description Action 10 FIRMWARE FLASH FAILED Error ROM flash failed. Download new firmware from the HP website and initiate the firmware update again. If the issue persists, contact HP support. 11 CLONING REFUSED SYSTEM HALTED Error A firmware mismatch was detected, message 07 CLONE FIRMWARE? '<'=NO, '>'=YES was displayed, and the user selected < for “No”. Firmware on the controllers must be updated to be the same.
Message Type Description Action 24 BAD SCSI BUS MODE NON-LVD DEVICE FOUND Error The MSA does not support SCSI Single Ended (SE) devices; it only supports SCSI Low Voltage Differential (LVD) devices. The MSA should be powered off and then all SCSI devices attached to it should be examined. Any SE devices found should be removed and replaced with LVD devices.
Message Type Description Action 32 CHASSIS NVRAM CONTENTS CORRUPTED Error The MSA has non-volatile memory on it that contains required information needed to operate. This non-volatile memory appears to be corrupted and the information is not valid. The MSA cannot continue to operate and halts. 40 INITIALIZING REDUNDANCY SUPPORT Informational Two array controllers are installed, and the they are attempting to enter redundant mode.
Message Type Description Action 44 REDUNDANCY FAILED MISMATCH HARDWARE Error Both array controllers must contain the same hardware for them to successfully enter redundant mode. The current array controllers do not contain the same hardware, possibly because one has an attached Fibre Channel daughter card and the other does not.
Message Type Description Action 48 REDUNDANCY HALTED FIRMWARE CLONED Informational Both array controllers must run the same version of firmware for them to successfully enter redundant mode. A process called firmware cloning has been successfully completed in order to make them both the same firmware level. The standby array controller is restarted automatically so they can attempt to achieve redundancy again.
Message Type Description Action 52 REDUNDANCY FAILED PCI BUS ERROR Error While either attempting to enter redundant mode or already operating in redundant mode, one of the array controllers encountered a PCI bus error on the communication channel used between the two array controllers. Redundancy is disabled at this time. If the MSA is currently involved in host I/O, remove the standby array controller, wait 10 seconds, and then reinsert it, ensuring that it is fully seated in the chassis.
Message Type Description Action 60 NO CACHE MODULE FOUND Error The array controller requires at least one cache module in order to operate. Either one is not present or it has failed. Remove the failed array controller, either add a cache module or replace the failed one, wait 10 seconds, and then reinsert it, ensuring that it is fully seated in the chassis. Should the issue persist, contact HP support.
Message Type Description 67 CACHE HARDWARE TEMPORARILY DISABLED Informational The cache memory hardware has temporarily been disabled, typically because the battery is not charged or a capacity expansion operation is occurring. The cache automatically enabled once the condition has been corrected. 68 OBSOLETE CACHE DATA DELETED Informational Old data that no longer belongs to any current configured volumes was found in the cache memory at power up. This data has been deleted.
Message Type Description Action 72 CACHE ERROR IGNORE? <=NO, >=YES User input During power up, data was found in the cache that could not be flushed to the hard drives. The reason is either because the data does not belong to this array controller (the cache module was moved from a different array controller) or the cache data is partial (the rest of the data is in another cache module that was removed from the array controller). This error could occur if cache modules are moved improperly.
Message Type Description Action 81 SMART DRIVE ALERT BOX #, BAY Informational A SCSI hard drive may be close to failing. This was determined either by the hard drive firmware itself using SMART technology or by the array controller using monitor and performance testing. 82 DRIVE HOT ADDED BOX #, BAY Informational A SCSI hard drive has been added to the MSA chassis or attached external storage enclosure.
Message Type Description Action 89 INVALID CONFIG BOX #, BAY Error For the MSA1500, before using an attached MSA20, the MSA20 is checked to see if any hard drives contain configuration data from a source other than the MSA1500. If so, the enclosure is failed and this message is displayed. An MSA20 attached to an MSA1500 must be configured by the 1500.
Message Type Description Action 106 VOLUME # STATE WAITING TO REBUILD Informational The configured volume is waiting to start rebuilding data on a SCSI hard drive that replaces a previously failed drive. The rebuild may not have started yet because the array controller is already performing a rebuild on another configured volume. 107 VOLUME # STATE WAITING TO EXPAND Informational The configured volume is waiting to start a volume expansion operation.
Message Type Description Action 112 VOLUME # REBUILD FAILURE Error The rebuild operation on the configured volume has failed. If the volume is still operating in regenerative mode, remove the new SCSI hard drive that was added as a replacement for the original failed hard drive and replace it with a different new drive. 113 VOLUME # EXPANSION FAILURE Error The volume expansion operation on the configured volume has failed.
Message Type Description Action 123 TOO MANY VOLUMES DETECTED Error The array controller only supports a maximum of 32 configured volumes. More volumes than that were detected at power up. This typically occurs when migrating a set of volumes from one array controller to a different array controller that already has configured volumes on it. The migrated volumes have not been added. Remove the migrated volumes and run the Array Configuration Utility (ACU).
Message Type Description Action 151 BEACON OFF Informational The LEDs on the MSA and all connected hard drives are no longer blinking because the host-based application is no longer identifying or configuring this MSA. 201 ARRAY CONTROLLER TEMPERATURE OK Informational The temperature sensor on the array controller indicates that the temperature that was previously exceeding the normal operating range is now back within the range.
Message Type Description 301 RECOVERY ROM AUTOFLASH DONE Informational The array controller has successfully completed the process of copying the current active firmware image into the backup recovery ROM. 302 RECOVERY ROM AUTOFLASH FAILED Error The array controller failed to copy the current active firmware image into the backup recovery ROM. Recovery ROM support is disabled.
Message Type Description Action 306 FIRMWARE FLASH STARTED Informational The array controller in the MSA has started the firmware flash process. Do not turn off power to the system until it has completed. This could take several minutes. 307 FIRMWARE FLASH DONE Informational The array controller in the MSA has completed the firmware flash process. It is now safe to restart the MSA and begin using the new firmware.
Message Type Description Action 402 STORAGE BOX # FAN DEGRADED Error A fan in the specified storage enclosure is not operating at full efficiency. The fan may eventually fail. 403 STORAGE BOX # FAN HOT INSERTED Informational An fan in the specified storage enclosure has been hot-added. 404 STORAGE BOX # FAN HOT REMOVED Informational A fan in the specified storage enclosure has been removed.
Message Type Description Action 412 STORAGE BOX # EMU NOT RESPONDING Error An EMU in the specified storage enclosure is not responding to commands. 413 STORAGE BOX # EMU VERSION Informational The version of firmware running on the EMU. This is only displayed for the internal EMU of the MSA chassis. It is not displayed for externally connected storage boxes. 414 STORAGE BOX # HOT REMOVED Informational An external storage enclosure has been hot-removed from the back of the MSA.
Message Type Description Action 511 FIBRE CHANNEL SUBSYSTEM HARDWARE FAILURE Error The array controller's Fibre Channel subsystem has encountered a critical error during the power up sequence. Remove the failed array controller and its associated Fibre Channel module, reseat the Fibre Channel daughter card on the array controller, wait 10 seconds, then reinsert the Fibre Channel module and the array controller, ensuring that they are fully seated in their respective bays in the chassis.
Message Type Description Action 518 PERSISTENT MEM ENABLED Informational Persistent memory data has been successfully initialized. 519 PERSISTENT MEM DISABLED Informational Persistent memory data has not been successfully initialized. Some configuration data may not be available. 520 UNKNOWN I/O MODULE DETECTED Error The MSA cannot determine the type of I/O module installed in the bay. Do the following: 1. Remove power from any external storage enclosures. 2. Remove power from the MSA. 3.
LCD panel and message descriptions
6 Customer replaceable components This chapter includes the following information regarding the removal and replacement of MSA1500 components: • Procedural overview • Customer self repair • Recommended tools • Warnings and precautions • Determining whether the component is hot-pluggable • Connecting the Power • Applying power to the MSA • Removing power from the MSA • Removing or installing an interconnect blank • Removing or installing a hard drive blank • Replacing a hard drive • Replacing the controller,
Procedural overview To help you successfully replace hardware components on your storage system, the following information is included in all component replacement procedures: • Before you begin — This section may be included in the procedures to include component-specific precautions or notices. Be sure to read and observe these statements.
For more information about HP's customer Self Repair program, contact your local service provider. For the North American program, go to: http://www.hp.com/go/selfrepair. Parts-only warranty service Your HP Limited Warranty may include a parts-only warranty service. Under the terms of part-only warranty service, HP provides replacement parts free of charge. For parts-only warranty service, CSR part replacement is mandatory.
Warnings and precautions • • • • • • Electrostatic Discharge Information Grounding methods Equipment Symbols Weight Warning Rack warnings and precautions Device warnings and precautions Electrostatic discharge information To prevent damage to the system, be aware of the precautions you need to follow when setting up the system or handling parts. A discharge of static electricity from a finger or other conductor may damage system boards or other static-sensitive devices.
Equipment symbols The following symbols may be located on equipment in areas where hazardous conditions may exist. WARNING! Any enclosed surface or area of the equipment marked with these symbols indicates the presence of electrical shock hazards. Enclosed area contains no operator serviceable parts. To reduce the risk of injury from electrical shock hazards, do not open this enclosure. WARNING! Any RJ-45 receptacle marked with these symbols indicates a network interface connection.
Weight warning WARNING! The device can be very heavy. To reduce the risk of personal injury or damage to equipment: • Remove all hot-pluggable power supplies and modules to reduce the overall weight of the device before lifting. • Observe local health and safety requirements and guidelines for manual material handling. • Get help to lift and stabilize the device during installation or removal, especially when the device is not fastened to the rails. When a device weighs more than 22.
WARNING! Verify that the AC power supply branch circuit that provides power to the rack is not overloaded. Overloading AC power to the rack power supply circuit increases the risk of personal injury, fire, or damage to the equipment. The total rack load should not exceed 80 percent of the branch circuit rating. Consult the electrical authority having jurisdiction over your facility wiring and installation requirements.
CAUTION: To properly ventilate the system, you must provide at least 7.6 centimeters (3.0 inches) of clearance at the front and back of the device. CAUTION: When removing or replacing a failed component, allow 30 seconds to pass before inserting a replacement component. This time is needed to allow the device to clear out the system registries. CAUTION: Schedule physical configuration changes during periods of low or no activity.
Determining whether a component is hot-pluggable Before removing any serviceable part, determine whether the part is hot-pluggable or non-hot-pluggable. • If the component is hot-pluggable, a power shutdown of the device is not required for replacement of the part. • If the component is not hot-pluggable, the device must be powered down. CAUTION: Be sure the replacement is available before removing a component or blanking panel from the chassis.
Connecting power to the MSA HP strongly recommends using the power cord shipped with the MSA. If using a different power cord, your power cord should be approved for use in your country. The power cord must be rated for the product and for the voltage and current marked on the electrical ratings label of the product. The voltage and current rating of the cord should be greater than the voltage and current rating marked on the product. In addition, the diameter of the wire must be a minimum of 1.
Applying power to the MSA Before applying power to the MSA, all components of the storage system must be installed and connected to the supported interconnect options. Hard drives should be installed in the enclosures so that they can be identified and configured at power up. The MSA and the other network components must be powered on in the following sequence: 1.
Removing power from the MSA CAUTION: System power to the MSA does not shut off completely with the power switch. The two positions of the front panel power switch should be considered as On and Standby, rather than On and Off. The standby position removes power from most of the electronics and the hard drives, but portions of the power supply and some internal circuitry remain active. • To remove all power from the system, you must disconnect the power cord from the storage system.
Removing or installing a Fibre Channel interconnect blank Before you begin CAUTION: • Before removing a component or blanking panel from an operational device, make sure that you have the replacement part available. Removing a component or blank impacts the airflow and cooling ability of the device. To avoid possible overheating, insert the new or replacement component within one or two minutes.
Removing or installing a hard drive blank Before you begin CAUTION: • Before removing a component or blanking panel from an operational device, make sure that you have the replacement part available. Removing a component or blank impacts the airflow and cooling ability of the device. To avoid possible overheating, insert the new or replacement component within one or two minutes. If the internal temperature exceeds acceptable limits, the device may overheat and automatically shut down or restart.
Replacing a hard drive Before you begin CAUTION: • Before removing a component or blanking panel from an operational device, make sure that you have the replacement part available. Removing a component or blank impacts the airflow and cooling ability of the device. To avoid possible overheating, insert the new or replacement component within one or two minutes. If the internal temperature exceeds acceptable limits, the device may overheat and automatically shut down or restart.
• If you replace a hard drive while the system is off, it may be necessary to rebuild the replaced hard drive. 1. Review all warnings, cautions, and preparation procedures as detailed in Warnings and precautions. 2. Be sure that the Online and Drive Access LED LEDs are both Off and that the Fault LED is On. 3. Press the ejector button (1) and pull the release lever to the full open position (2).
1 2 15621 3. Close the ejector lever against the front of the hard drive (2). 4. Verify that the hard drive is firmly engaged and that the ejector lever is latched. CAUTION: Data can be lost if the hard drive is not firmly seated. 5. As the hard drive begins to spin, the drive LEDs will illuminate, indicating that the system has recognized the new drive. In fault-tolerant configurations, allow the replacement hard drive to be reconstructed automatically from parity data stored on the other hard drives.
Replacing the controller, controller cache, or controller battery The following sections detail procedures for replacing an MSA array controller, controller cache module, or controller cache battery pack: • Before you begin • Verifying component failure • Removing the controller • Removing the controller cache module • Removing the controller cache battery pack • Installing the controller cache battery pack • Installing the controller cache module • Installing the controller • Verifying the replacement Perf
Verifying component failure Before replacing the controller, cache module, or battery pack: • Check the controller LCD display panel for error messages. • Record any failure LEDs or LCD panel messages for reference purposes. • Check the controller heartbeat LED (1) . If the LED is not blinking, it might indicate a failure. • Check the controller fault LED (2) . If the LED is on, it might indicate a failure. 1 2 15512 Removing the controller 1.
Removing the controller cache module If your system is equipped with a single controller, and you must replace the controller cache: • You must schedule a maintenance window and remove power from the system to perform the service. If your system is equipped with two controllers: • and you are replacing a cache module with another module of the same size, you can remove one of the controllers and replace the module on that controller while the system is running.
WARNING! There is a risk of explosion, fire, or personal injury if the battery pack is incorrectly replaced or mistreated. To reduce the risk: • Do not attempt to recharge the battery outside of the controller. • Do not expose to water or temperatures higher than 60˚C. • Do not abuse, disassemble, crush, puncture, short external contacts, or dispose of in fire or water. • Replace only with the HP spare designated for this product.
15517 5. Lift the pack upward to unhook the top of the battery pack. 15518 6. Wait approximately 15 seconds after removing the old battery to allow the battery charge monitor to reset. NOTE: If you do not wait 15 seconds after removing the old battery, full charge of the new cache battery is delayed. 7. Repeat step 3 through step 6 to remove the second battery pack on this cache module. Installing the controller cache battery pack 1. Remove the array controller as instructed in Removing the controller.
5. After the battery pack is hooked in position, swing the pack downward, ensuring that the bottom clip and two pegs line up with the holes in the cache module. 6. Verify that the top hook (1) and bottom clip (2) on the battery pack are securely attached to the cache module. 1 2 15520 7. Repeatstep 4 through step 6 to install the second battery pack on the module. 8. Install the serviced cache module in the controller as instructed in Installing the controller cache module.
Installing the controller 1. Insert the new or serviced array controller into the MSA chassis controller bay (1). 2. Push the controller in as far as it will go, making sure that the controller is seated, and then press the latch handle inward until it is flush against the front panel (2). 1 2 15493 3. If necessary, power on the MSA as instructed in Connecting the Power and Applying power to the MSA.
Replacing a Fibre Channel I/O module Before you begin CAUTION: • Before removing a component or blanking panel from an operational device, make sure that you have the replacement part available. Removing a component or blank impacts the airflow and cooling ability of the device. To avoid possible overheating, insert the new or replacement component within one or two minutes. If the internal temperature exceeds acceptable limits, the device may overheat and automatically shut down or restart.
CAUTION: Use appropriate precautions when handling Fibre Channel cables: • Touching the end of a Fibre Channel cable will either damage the cable or cause performance problems, including intermittent difficulties accessing the storage. • Whenever a Fibre Channel cable is not connected, replace the protective covers on the ends of the cable. • Make certain that the Fibre Channel cables are installed and supported so that no excess weight is placed on the connectors.
Verifying proper operation After replacing the failed Fibre Channel I/O module verify that: • The Fibre Channel I/O module status LED is solid green. • Verify that the 1-Gb link status is solid green. • Verify that the 2-Gb link status is solid green. • No new error messages are displayed on the array controller LCD panel. Replacing a 2-Gb small form factor pluggable (SFP) transceiver The transceiver is hot-pluggable, so it is not necessary to power down the system to replace it.
CAUTION: To reduce the risk of damage to the equipment, do not use excessive force when inserting the SFP. 2. Remove the protective cover on the cable and remove the dust-plug cover from the replacement SFP. 3. Re-connect the Fibre Channel I/O cable to the SFP.
2 1 15521 Installing the component Slide the replacement fan module into the MSA chassis bay until the module clicks into place. 15522 Verifying proper operation After replacing the fan module, verify that: • The fan starts operating immediately. • The fan Status LED is On. • No new error messages are displayed on the array controller LCD panel.
Verifying component failure Before replacing the SCSI I/O module: • Verify that the module Status LED is off or is blinking amber. • Check the array controller LCD display panel for error messages. Removing the component CAUTION: The SCSI I/O module is not hot-pluggable. Before replacing the module, all I/O from the servers must be stopped and the MSA must be powered down. 1. Review all warnings, cautions, and preparation procedures as detailed in Warnings and precautions.
15497 2. Reconnect all SCSI cables. NOTE: Ensure that all of the SCSI cable connectors are fastened tightly. Verifying proper operation After replacing the failed SCSI I/O module, verify that: • The SCSI I/O module Status LED is solid green. • No new error messages are displayed on the array controller LCD panel. Replacing a power supply module The power supply is hot-pluggable, so it is not necessary to power down the system to replace it.
3. While pressing the power supply module latch (1) to the left, grasp the module handle and pull the defective power supply out of the chassis (2). 1 2 15494 Installing the component 1. Slide the replacement power supply module into the MSA chassis bay until the module clicks into place. 15495 2. Connect the AC power cord to the power supply. Verifying proper operation After replacing the power supply, verify that: • The power supply Status LED is solid green.
7 Enclosure and hard drive migrations This section describes the following: • Adding a new storage enclosure • Migrating existing enclosures • Moving hard drives or arrays NOTE: For the most up-to-date information on supported enclosures, see the MSA website. Adding a new storage enclosure New storage enclosures with unconfigured hard drives are hot-pluggable. Perform the following steps to attach and install a new enclosure to an existing MSA: 1. Install the new storage enclosure in the rack.
4. Attach SCSI cables to the MSA expansion SCSI connectors. 5. Attach the SCSI cables to the expansion storage enclosures. 6. Attach the power cords to the power supplies at the rear of the enclosures. 7. Power on each enclosure and wait for it to complete its startup routine. 8. Power on the MSA array controller system using the power button on the front of the system. 9. Wait for the MSA to complete its startup routine. 10. Power on the host servers and allow them to boot. 11.
8 Capacity expansion and extension Array expansion is the addition of physical hard drives to an array that has already been configured. The capacity of these added physical hard drives may then be added to an existing LUN on the array or configured into a new LUN. LUN extension is the enlargement of an existing LUN after the corresponding array has undergone capacity expansion. Expansions and extensions are performed through the ACU or MSA-CLI.
Capacity expansion and extension
9 Hard drive failures and faulted LUNs The purpose of fault-tolerant array configurations is to protect against data loss due to hard drive failure. Each RAID configuration has inherent limitations on the number of hard drive failures that it can tolerate. If the fault-tolerance level of a particular LUN or array configuration is exceeded, the array will be locked from any further I/O.
• RAID 6 configurations can tolerate simultaneous failure of two hard drives in the array. Compromised fault tolerance Each RAID configuration has inherent limitations on the number of physical hard drive failures that it can tolerate. If more hard drives fail than the fault-tolerance method allows, fault tolerance is compromised. When the MSA determines that the fault tolerance of a LUN is compromised, the LUN is taken offline and subsequent I/O requests are rejected.
b. Make copies of important data, if possible. c. Replace any failed hard drives and allow the Automatic Data Recovery (ADR) process to rebuild the data onto replacement drives. d. After failed hard drives have been replaced, fault tolerance may again be compromised. If so, power cycle the system, and if the #02 or #04 ENABLE VOLUME LCD message is displayed, press the right push button to re-enable the LUNs. 7. If using the ACU: a. Open the ACU and highlight the faulted LUN (shown with a red drive symbol.
• When RAID 6 (ADG) is used, two drives can fail simultaneously (and be replaced simultaneously) without data loss. • If the offline drive is a spare, the degraded drive can be replaced. • Do not remove a failed second hard drive from an array until the first failed or missing hard drive has been replaced and the rebuild process is complete. (When the rebuild is complete, the online LED on the front of the hard drive stops blinking.
Failure of another drive during rebuild If a non-correctable read error occurs on another physical drive in the array during the rebuild process, the Online LED of the replacement drive stops blinking and the rebuild abnormally terminates. If this situation occurs, restart the server and the storage system. The system may temporarily become operational long enough to allow recovery of unsaved data. In any case, locate the faulty drive, replace it, and restore data from backup.
Hard drive failures and faulted LUNs
10 Array controller firmware Each array controller contains Read-Only Memory (ROM), which holds the firmware that operates the controller. In dual-controller configurations, the two controllers must execute the same firmware version.
If the firmware version on the two controllers is not the same, the following prompt is displayed in the controller LCD panel: CLONE FIRMWARE ? ‘<‘ = NO, ‘>' = YES If > is pressed or a response is not made to the prompt within 60 seconds, the following messages are displayed on the LCD panel while firmware is copied from the active controller to the standby controller and the standby controller is automatically restarted: ROM CLONING STARTED ARRAY CONTROLLER RESTARTING STARTUP COMPLETE The two controllers s
The following instructions and examples use HyperTerminal. For more information, see user documentation for your emulator program. a. On the menu bar at the top of the HyperTerminal window, select Transfer > Send File. b. Click Browse and navigate to the location of the firmware image obtained in step 1. c. Expand the Protocol drop-down box and select 1K XModem. d. Click Send. A status window is displayed. 4. Wait for the transfer process to complete. NOTE: This process may take up to 15 minutes.
6. In dual-controller configurations only, if the newly installed firmware on the recovered controller differs from the firmware on the active controller, the CLONE FIRMWARE LCD message is displayed. See Cloning controller firmware.
11 SCSI hard drive firmware SCSI hard drive firmware updates are performed from the bootable ProLiant Firmware Maintenance CD and are required only when the following message is displayed on the MSA controller LCD panel: 85 BAD DRIVE FRMWARE BOX BAY CAUTION: This update procedure must be performed during a scheduled maintenance window. 1. Obtain the ISO image for the ProLiant Firmware Maintenance CD from the following website: http://www.hp.com/support/proliantstorage.
SCSI hard drive firmware
A Regulatory compliance and safety Regulatory compliance Federal Communications Commission notice Part 15 of the Federal Communications Commission (FCC) Rules and Regulations has established Radio Frequency (RF) emission limits to provide an interference-free radio frequency spectrum. Many electronic devices, including computers, generate RF energy incidental to their intended function and are, therefore, covered by these rules.
Declaration of conformity for products marked with the FCC logo, United States only This device complies with Part 15 of the FCC Rules. Operation is subject to the following two conditions: (1) this device may not cause harmful interference, and (2) this device must accept any interference received, including interference that may cause undesired operation. For questions regarding your product, visit http://www.hp.
European Union notice Products bearing the CE marking comply with the EMC Directive (89/336/EEC) and the Low Voltage Directive (73/23/EEC) issued by the Commission of the European Community and if this product has telecommunication functionality, the R&TTE Directive (1999/5/EC).
Korean notices Safety notices Battery replacement notice Your computer is equipped with a lithium manganese dioxide, a vanadium pentoxide, or an alkaline internal battery or battery pack. There is a danger of explosion and risk of personal injury if the battery is incorrectly replaced or mistreated. Replacement is to be done by an HP authorized service provider using the HP spare part designated for this product.
Taiwan battery recycling notice The Taiwan EPA requires dry battery manufacturing or importing firms in accordance with Article 15 of the Waste Disposal Act to indicate the recovery marks on the batteries used in sales, giveaway or promotion. Contact a qualified Taiwanese recycler for proper battery disposal. Power cords The power cord set must meet the requirements for use in the country where the product was purchased.
• Use a wrist strap connected by a ground cord to a grounded workstation or computer chassis. Wrist straps are flexible straps with a minimum of 1 megohm (±10 percent) resistance in the ground cords. To provide proper ground, wear the strap snug against the skin. • Use heel straps, toe straps, or boot straps at standing workstations. Wear the straps on both feet when standing on conductive floors or dissipating floor mats. • Use conductive field service tools.
Index A AC circuit overload, warning, 67 accumulators, 112 ADU, described, 32 alkaline battery warning, 112 audience, 9 Avis Canadien, regulatory compliance notice, 110 B backplane, replacement, 92 batteries recycling or disposal, 112 replacement notice, 112 warning, 112 Taiwan EPA recycling and disposal, 113 warnings, 80 boot straps, using, 114 BSMI, regulatory compliance notice, 111 C cables distances supported, 13 FCC compliance statement, 110 multi-mode, 13 option kits, 13 part numbers, 13 shielded
fibre subsystem link failure, 58 fibre subsystem link OK, 58 fibre switch hardware failure, 58 firmware flash done, 55 firmware flash done on box #, 55 firmware flash failed, 39 firmware flash failed on box #, 55 firmware flash started, 55 firmware flash started on box #, 55 firmware version, 37 FW version is not supported, 40 hard drive failure box, 48 hard drive hot added box bay, 48 hard drive hot removed box bay, 48 hard drive position change detected, 48 hard drive position charge invalid, 48
device preparation removing power, 72 warnings and precautions, 67 device, unloading from pallet, 66 diagnostic tools, 31 disposal, battery, 112, 113 dissipating floor mats, 114 document conventions, 10 prerequisites, 9 related documentation, 9 documentation HP website, 9 providing feedback, 11 E electrostatic damage prevention, 113 electrostatic discharge See ESD, 64 electrostatic discharge See ESD, 113 electrostatic sensitive parts handling, 64 packaging, 64 storing, 64 transporting, 64 enclosure expans
FCC (Federal Communications Commission) Class A Equipment, compliance notice, 109 Class B Equipment, compliance notice, 109 declaration of conformity, 110 modifications, 110 notice, 109 Federal Communications Commission.
redundancy active standby controller, 41 redundancy halted expand active, 42 redundancy halted firmware cloned, 43 replacement hard drive found box, 47 restarting system, 38 ROM cloning done, 54 ROM cloning started, 54 scanning SCSI subsystem, 39 smart hard drive alert box, 48 spares cleared, 52 startup complete, 37 storage box EMU version, 57 storage box fan hot inserted, 56 storage box fan hot removed, 56 storage box fan OK, 55 storage box hot added, 57 storage box hot removed, 57 storage box power supply
R rack stability, warning, 66 recycling, battery, 112 recycling, Taiwan EPA battery, 113 regulatory compliance information number, 110 notices BSMI, 111 Canada, 110 Class A, 109 Class B, 109 European Union, 111 HP series number, 110 IEC EMC statement, worldwide, 110 Japan, 111 Korean, 112 modifications, 110 shielded cables, 110 related documentation, 9 replacing a power cord, 113 RFI/EMI connector hoods, 110 ROM cloning, 103 S SCSI I/O module LEDs, 29 removing, 89 series number, regulatory compliance, 11