HP A7143A RAID160 SA Controller Support Guide HP-UX 11i v3 Manufacturing Part Number: J6369-90040 February 2007 Printed in the US © Copyright 2007 Hewlett-Packard Development Company L.P.
Legal Notices The information in this document is subject to change without notice. Hewlett-Packard makes no warranty of any kind with regard to this manual, including, but not limited to, the implied warranties of merchantability and fitness for a particular purpose. Hewlett-Packard shall not be held liable for errors contained herein or direct, indirect, special, incidental or consequential damages in connection with the furnishing, performance, or use of this material.
Contents 1. RAID Technology Overview What is RAID? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The RAID Concept . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Logical Drives and Arrays . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Contents Determining the RAID160 SA Firmware Version. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Updating the RAID160 SA Firmware . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Physical Disk Firmware . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Determining the RAID160 SA Device File . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Contents Automatic Data Recovery . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . General Aspects of Disk Drive Replacement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Drive Failure During Rebuild . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Minimizing Fatal System Errors During Rebuild. . . . . . . . . . . . . . . . . . . .
Contents 6
Preface: About This Document This document describes how to install, configure, and troubleshoot the HP A7143A PCI 4-Channel RAID160 SA SCSI controller on HP-UX 11iv3 64-bit platforms. The latest version of this document can be found on line at docs.hp.com. Intended Audience This document is intended for system and network administrators responsible for installing, configuring, and managing fault tolerant data storage.
Chapter 6 Troubleshooting This chapter provides information of the different troubleshooting tools available for the RAID160 SA controller including using the sautil command for troubleshooting. Appendix B Controller Specifications Appendix C Hard Drive Installation and Replacement Appendix A Probability of Logical Drive Failure Appendix D Electrostatic Discharge Appendix E Questions and Answers Typographical Conventions This document uses the following conventions.
Other documents in this collection include: HP A7143A RAID160 SA Controller Release Notes HP A7143A RAID160 SA Controller Installation Guide HP Smart Array Support Matrix HP Encourages Your Comments HP encourages your comments concerning this document. We are committed to providing documentation that meets your needs. Please send comments to: netinfo_feedback@cup.hp.
1 RAID Technology Overview This chapter provides an overview of RAID technology and descriptions of the different RAID levels that are supported by the HP Smart Array 6400 Series Controllers.
RAID Technology Overview What is RAID? “Summary of RAID Methods” on page 20 “Choosing a RAID Method” on page 21 NOTE If you are ready to install the Smart Array Controller and you are familiar with RAID concepts and the RAID levels supported by the Smart Array Controllers, proceed to Chapter 4, “Installing the RAID160 SA Controller,” on page 33.
RAID Technology Overview What is RAID? The RAID Concept The RAID study proposed a multilevel concept for improved data input/output performance (by combining multiple physical drives) and improved data availability (by avoiding the impact of disk drive failures). Five original RAID configurations, or “levels” (RAID 1 through RAID 5), were defined to meet the needs of various computing environments. As the five original RAID configurations progress from RAID 1 through RAID 5, data redundancy increases.
RAID Technology Overview Logical Drives and Arrays Logical Drives and Arrays The group of physical disks containing the logical drive is called an array (or drive array). Since all the physical disks in an array are commonly configured into a single logical drive, the term array is also used as a synonym for logical drive. Without an array controller, connecting extra disk drives to a system increases the total storage capacity.
RAID Technology Overview Logical Drives and Arrays Figure 1-3 Data Striping (S1-S4) of Data Blocks B1-B12 S1 B1 B2 B3 S2 B4 B5 B6 S3 B7 B8 B9 S4 B10 B11 B12 For data in the logical drive to be readable, the data block sequence must be the same in every stripe. This sequencing process is performed by the Smart Array Controller, which sends the data blocks to the disk drive, writing the heads in the correct order.
RAID Technology Overview Smart Array Controller Supported RAID Configurations Smart Array Controller Supported RAID Configurations RAID 0—No Fault Tolerance The RAID 0 configuration enhances performance with data striping, but there is no data redundancy to protect against data loss when a physical disk fails. RAID 0 is useful for rapid storage of large amounts of non-critical data (for printing or image editing, for example), or when cost is the most important consideration (see Figure 1-4 on page 16).
RAID Technology Overview Smart Array Controller Supported RAID Configurations Figure 1-5 Disk Drive Mirroring of P1 onto P2 (RAID 1) B1 B1 B2 B2 B3 B3 B4 B4 P1 P2 The advantages of RAID 1 are as follows: • No data loss or interruption of service if a disk fails. • Fast read performance — data is available from either disk.
RAID Technology Overview Smart Array Controller Supported RAID Configurations This fault-tolerance method is useful when high performance and data protection are more important than the cost of physical disks. The advantages of RAID 1+0 are as follows: • Highest read and write performance of any fault-tolerant configuration. • No loss of data as long as no of failed disks are mirrored to any other failed disk (up to half of the physical disks in the array can fail).
RAID Technology Overview Smart Array Controller Supported RAID Configurations RAID ADG RAID Advanced Data Guarding (ADG) is similar to RAID 5 in that parity data is generated and stored to protect against data loss caused by physical disk failure. However, with RAID ADG two different sets of parity data are generated for each data block on a stripe. The two parity data blocks are stored on different physical disks, allowing data to be preserved even if two physical disks fail simultaneously.
RAID Technology Overview Summary of RAID Methods Summary of RAID Methods Table 1-1 summarizes the important features of the different RAID configurations that are supported by the Smart Array Controllers. The decision chart in Table 1-2 on page 21 can help you determine which option is best for your computing environment.
RAID Technology Overview Choosing a RAID Method Choosing a RAID Method Table 1-2 summarizes the advantages of each RAID method. Use this table to select the best RAID method for your needs.
RAID Technology Overview Choosing a RAID Method 22 Chapter 1
2 RAID160 SA Controller Overview This chapter contains the following sections providing an overview of the RAID160 SA controller features and functionality: Chapter 2 23
RAID160 SA Controller Overview Board Components and Features Board Components and Features The RAID160 SA controller has four Wide Ultra160 SCSI channels and 256 MB of cache. Figure 2-1 HP A7143A RAID160 SA controller 1 3 2 Table 2-1 Item Hardware Overview Description 1 Two internal 68-pin Wide SCSI connectors (channel 1 nearer the bracket, channel 2 nearer the board center). These internal connectors are not supported on the HP A7143A RAID160 SA controller.
RAID160 SA Controller Overview Board Components and Features Figure 2-2 Array Accelerator Cache with Batteries For detailed controller board specifications, refer to Appendix B, “Controller Specifications”.
RAID160 SA Controller Overview Board Components and Features The RAID160 SA controller also uses cache to complete drive write operations more quickly. This use of the cache has further performance benefits: • If the system requires data that still resides in the write cache, the controller delivers this data from the cache. This process is quicker than delivering the data from a drive. • If the system writes new data to the same location, the controller overwrites the cache contents.
RAID160 SA Controller Overview Board Components and Features IMPORTANT The batteries on a new RAID160 SA controller may have a low charge when the controller board is first installed. No action is required on your part, since the internal circuitry automatically recharges the batteries and enables the cache. The recharge process takes less than four hours. The controller will function properly during this time, but without the performance advantage of the array accelerator.
RAID160 SA Controller Overview Board Components and Features NOTE An online spare does not become active and start rebuilding when the imminent failure alert is sent, because the degraded disk has not actually failed yet and is still online. The online spare is activated only after a disk in the array has failed. • Drive failure alert features cause an alert message to be sent to Event Monitoring Services (EMS) when physical disk or logical drive failure occurs.
3 Installation Overview This chapter provides information to help you plan the installation and configuration of the RAID160 SA controller.
Installation Overview Overview of Installation Steps NOTE Complete Smart Array documentation (including the Smart Array Support matrix) is available on http://www.docs.hp.com under the I/O Cards and Networking Software section in the Smart Array (RAID) category. NOTE To view a detailed list of support accessories visit the system configuration site at: http://h30099.www3.hp.com/configurator/index.asp Overview of Installation Steps Installation steps for the RAID160 SA controller are as follows: Step 1.
Installation Overview Installation Prerequisites for the RAID160 SA Controller Installation Prerequisites for the RAID160 SA Controller Before installing the RAID160 SA controller, check to make sure the following hardware and software prerequisites have been met: ✓ Read the RAID160 SA controller Release Note, so you can check for any known problems, required patches, or other information you need for installation. ✓ Plan your disk configurations (see “Planning Disk Configurations” on page 30).
Installation Overview Installation Prerequisites for the RAID160 SA Controller 32 Chapter 3
4 Installing the RAID160 SA Controller This chapter contains the following sections detailing hardware, software, and firmware installation for the RAID160 SA controller: Chapter 4 33
Installing the RAID160 SA Controller Locating Software for the RAID160 SA Controller Locating Software for the RAID160 SA Controller The drivers, utilities and manpages for the RAID160 SA controller are located at the HP Software Depot. • Go to http://www.software.hp.
Installing the RAID160 SA Controller Installing Software for the RAID160 SA Controller Installing Software for the RAID160 SA Controller The drivers, utilities and manpages for the RAID160 SA controller are contained in the RAID-01 bundle located in the “pci 4-channel raid160 sa scsi controller” depot, see “Locating Software for the RAID160 SA Controller”. They can be installed using the Software Distributor (SD). The SD is a tool for installing software on HP-UX host systems.
Installing the RAID160 SA Controller Preparing for RAID160 SA Controller Installation Preparing for RAID160 SA Controller Installation Before installing a RAID160 SA controller in a server, backup all data. This step is mandatory if the disks were not previously configured on a RAID160 SA controller, because the data format will not be recognized and the existing data will not be preserved.
Installing the RAID160 SA Controller Preparing for RAID160 SA Controller Installation • Handle the controller only by the edges. Do not touch electronic components or electrical traces • If you must lay the controller down, place it on a non-conductive mat or surface Before beginning installation, and without removing the RAID160 SA controller from its antistatic bag, inspect the controller for any signs of obvious damage, such as chipped or loose components. Contact HP if the controller is damaged.
Installing the RAID160 SA Controller Installing the RAID160 SA Controller Hardware (Offline) Installing the RAID160 SA Controller Hardware (Offline) Follow these steps to perform an offline installation of the RAID160 SA Controller: Step 1. Disconnect any peripheral devices from the server. Step 2. Remove or open the access panel on the server. Step 3. Select an available 66-MHz PCI slot. Slots that use a 64-bit interface may provide higher performance. Step 4.
Installing the RAID160 SA Controller Online Addition and Replacement Online Addition and Replacement Online Addition and Replacement (OL*) enables the Smart Array Controller to be added online to HP-UX systems that support OL*; it is not necessary to power off and then reboot the system to add or replace a Smart Array Controller.
Installing the RAID160 SA Controller Online Installation of the RAID160 SA Controller Online Installation of the RAID160 SA Controller If your system has one or more slots that support OLAR—and you want to use OLAR to install the controller in one of those slots—follow these steps: Step 1. Be sure version B.11.31.01 or later of the ciss driver (RAID160 SA software) is installed on your system. See “Locating Software for the RAID160 SA Controller” on page 34. Step 2.
Installing the RAID160 SA Controller RAID160 SA Controller OL* Error Recovery RAID160 SA Controller OL* Error Recovery This section explains how to recover: • If you are attempting an online addition of a RAID160 SA controller and the SCSI cables are connected before the PCI slot is powered on • If you are attempting an online replacement of a RAID160 SA Controller and the SCSI cable(s) are not disconnected, before the PCI slot is powered off • If you are attempting an online replacement of a RAID160
Installing the RAID160 SA Controller RAID160 SA Controller OL* Error Recovery Figure 4-3 SCSI Cable Reconnect Note At this point you reset the RAID160 SA controller and the online replacement is complete. If the error message in Figure 4-4 appears, before the online replacement procedure is completed, proceed to the next section “OL* Error Recovery”.
Installing the RAID160 SA Controller RAID160 SA Controller OL* Error Recovery Figure 4-6 SCSI Cable Reconnect Note 4. Now connect the SCSI cable(s) to the controller, click “OK”, and run sautil /dev/cissxx reset_adapter. After the reset operation is completed, the controller will be operational.
Installing the RAID160 SA Controller Connecting the Cables and the StorageWorks‘ Disk Enclosures Connecting the Cables and the StorageWorks™ Disk Enclosures • Each channel on the RAID160 SA controller supports up to 14 drives • Ports 1 and 2 each have two SCSI connectors, one for external storage units and one for internal hard drives in the server • The internal and external connectors on channel 1 cannot be used simultaneously • The internal and external connectors on channel 2 cannot be used simu
Installing the RAID160 SA Controller Connecting the Cables and the StorageWorks‘ Disk Enclosures Figure 4-7 Identifying SCSI Cable Connectors 2 3 1 4 Table 4-1 Cable Connector Descriptions (see Figure 3-2 above) Item Description 1 External 68-pin Wide 2 External offset VHDCI 3 Internal 50-pin narrow (Internal connections are not supported on the RAID160 SA controller) 4 Internal 68-pin Wide (Internal connections are not supported on the RAID160 SA controller) Table 4-2 Supported External
Installing the RAID160 SA Controller Connecting the Cables and the StorageWorks‘ Disk Enclosures NOTE IMPORTANT: Offset VHDCI cables must be used with the RAID160 SA controller. Early versions of the VHDCI cables do not accommodate side-by-side connection to the RAID160 SA controller. If your storage enclosure did not include the Offset VHDCI cables, you may need to order them. See Table 4-2 above, for part numbers.
Installing the RAID160 SA Controller Verifying the Installation Verifying the Installation After the system reboots, verify that the installation was successful by following these steps: Step 1. Issue the swlist command: swlist If the RAID160 SA controller is installed correctly, the generated output will look similar to this: # swlist # Initializing... # Contacting target "hpfcs681"... # # Target: hpfcs681:/ # # # Bundle(s): # B3899BA (S700) B.11.11.06 HP C/ANSI C Developer’s Bundle for HP-UX 11.
Installing the RAID160 SA Controller Verifying the Installation HWEnable11i 2003 B.11.11.0306.4 Hardware Enablement Patches for HP-UX 11i, June OnlineDiag B.11.11.12.01 HPUX 11.11 Support Tools Bundle, Jun 2003 RAID-00 B.11.11.01 PCI RAID; Supptd HW=A5856A RAID-01 B.11.11.%08 RAID160; Supptd HW=A7143A perl B.5.6.1.C Perl Programming Language # # Product(s) not contained in a Bundle: # BullseyeCover11 6.1.6 HP-UX 11 LIF-LOAD B.11.11.07.
Installing the RAID160 SA Controller RAID160 SA Controller Firmware RAID160 SA Controller Firmware The sautil command is used to confirm and update the firmware version that is currently installed on the RAID160 SA controller. HP recommends that the latest firmware version be installed. This section of the RAID160 Support Guide focuses on the sautil command options that are used to confirm, or change, RAID160 SA controller firmware.
Installing the RAID160 SA Controller RAID160 SA Controller Firmware where "" is the adapter’s device file, e.g., /dev/ciss5 "-s" (optional) displays a shorter output "" is the firmware image file to download ":" is the physical disk to update, e.g., 4:12 Notes: - This utility requires superuser privilege. - See the sautil man page for tool details.
Installing the RAID160 SA Controller RAID160 SA Controller Firmware An example of the lengthy sautil command output is provided in “The sautil Command” on page 80 in the Troubleshooting chapter. Issuing the sautil -s command provides an abbreviated listing of information for the RAID160 SA controller and all connected devices, including the firmware version currently in ROM.
Installing the RAID160 SA Controller RAID160 SA Controller Firmware Array Accelerator Board Present?.... yes Cache Configuration Status.......... cache enabled Cache Ratio......................... 50% Read / 50% Write Total Cache Size (MB)............... 256 Battery Pack Count.................. 2 Battery Status (pack #1)............ ok Battery Status (pack #2)............
Installing the RAID160 SA Controller RAID160 SA Controller Firmware Device Firmware Version.......................... B016 SCSI Transfer Rate............................... Ultra 160 ---- SCSI DEVICE 1:1 --------------------------------------------------------- Channel Number................................... 1 SCSI ID.......................................... 1 Device Type...................................... DISK Disk Capacity.................................... 18.2 GB Device Status.................
Installing the RAID160 SA Controller RAID160 SA Controller Firmware Device Product ID................................ BD01864544 Device Serial Number............................. 342230710222 Device Firmware Version.......................... B20B SCSI Transfer Rate............................... Ultra 160 ---- SCSI DEVICE 1:7 --------------------------------------------------------- Channel Number................................... 1 SCSI ID.......................................... 7 Device Type........
Installing the RAID160 SA Controller RAID160 SA Controller Firmware | * * | * * * | * | | | | * | * * (Ch 3)---> * * | * * | * * | * * | * * <--- (Ch 4) * * | * | | | | | ~ ~ ~ ~ ~ ****************************************************************************** **** End of SAUTIL Output **** ****************************************************************************** The firmware version that is in the ROM on the RAID160 SA controller (/dev/ciss6), is version 2.92.
Installing the RAID160 SA Controller RAID160 SA Controller Firmware **** version A.01.01 **** **** **** **** (c) Copyright 2003 Hewlett-Packard Development Company, L.P. **** ****************************************************************************** ---- DRIVER INFORMATION ------------------------------------------------------ Driver State........................ READY ---- ADAPTER INFORMATION ----------------------------------------------------- Adapter.............................
Installing the RAID160 SA Controller RAID160 SA Controller Firmware Sending the new firmware to the adapter......................[Done] Activating the new firmware..................................[Done] Resetting the adapter (this could take up to 2 minutes)......[Done] Retrieving the firmware revision string from ROM.............[Done] Current Revision (in ROM)....... 3.
Installing the RAID160 SA Controller Physical Disk Firmware Physical Disk Firmware The sautil command is used to confirm and update the firmware version on each of the physical disks in the SmartArray™ disk enclosure that the RAID160 SA controller is attached to. This section of the RAID160 Support Guide focuses on the sautil command options that are used to confirm, or change, physical disk firmware.
Installing the RAID160 SA Controller Physical Disk Firmware "" is the firmware image file to download ":" is the physical disk to update, e.g., 4:12 Notes: - This utility requires superuser privilege. - See the sautil man page for tool details.
Installing the RAID160 SA Controller Physical Disk Firmware An example of the lengthy sautil command output is provided in “The sautil Command” on page 80 in the Troubleshooting chapter. Issuing the sautil -s command provides an abbreviated listing of information for the RAID160 SA controller and all connected devices including a SCSI Device Summary that lists the SCSI channel and the SCSI ID for each of the physical disks attached to the RAID160 SA controller.
Installing the RAID160 SA Controller Physical Disk Firmware ---- ARRAY ACCELERATOR (CACHE) INFORMATION ----------------------------------- Array Accelerator Board Present?.... yes Cache Configuration Status.......... cache enabled Cache Ratio......................... 50% Read / 50% Write Total Cache Size (MB)............... 256 Battery Pack Count.................. 2 Battery Status (pack #1)............ ok Battery Status (pack #2)............
Installing the RAID160 SA Controller Physical Disk Firmware Device Product ID................................ BD0186459A Device Serial Number............................. UFL0P14000K90117 Device Firmware Version.......................... HPB3 SCSI Transfer Rate............................... Ultra 160 ---- SCSI DEVICE 1:1 --------------------------------------------------------- Channel Number................................... 1 SCSI ID.......................................... 1 Device Type............
Installing the RAID160 SA Controller Physical Disk Firmware Device Status.................................... SPARE Device Vendor ID................................. COMPAQ Device Product ID................................ BD01864544 Device Serial Number............................. 342230710222 Device Firmware Version.......................... HPB3 SCSI Transfer Rate............................... Ultra 160 ---- SCSI DEVICE 1:7 --------------------------------------------------------- Channel Number....
Installing the RAID160 SA Controller Physical Disk Firmware ****************************************************************************** **** End of SAUTIL Output **** ****************************************************************************** In the example above, the SCSI Device Summary lists the SCSI channel and the SCSI ID for each of the physical disks attached to the RAID160 SA controller.
Installing the RAID160 SA Controller Physical Disk Firmware **** **** **** version A.01.01 **** **** **** **** (c) Copyright 2003 Hewlett-Packard Development Company, L.P. **** ****************************************************************************** ---- DRIVER INFORMATION ------------------------------------------------------ Driver State........................ READY ---- ADAPTER INFORMATION ----------------------------------------------------- Adapter.............................
Installing the RAID160 SA Controller Physical Disk Firmware Resetting the adapter (this could take up to 2 minutes).... [Done] Retrieving the firmware revision string from ROM........... [Done] Current Revision (in ROM).......
Installing the RAID160 SA Controller Setting up a RAID160 SA Controller As a Boot Device Setting up a RAID160 SA Controller As a Boot Device The RAID160 SA controller can be set up as a boot device. To do this, you actually set up a logical drive on the controller as a boot device. On each RAID160 SA controller, any number of its logical drives can be set up as boot devices.
Installing the RAID160 SA Controller Setting up a RAID160 SA Controller As a Boot Device Welcome to Ignite-UX! Use the key to navigate between fields, and the arrow keys within fields. Use the key to select an item. Use the or to pop-up a choices list. If the menus are not clear, select the "Help" item for more information.
Installing the RAID160 SA Controller Setting up a RAID160 SA Controller As a Boot Device * Loading in a shell... NOTE: Pushing a shell, you will have to use "reboot" to reboot the system when done. Use the "loadfile" command to bring in more commands as you need them. # Step 3. At the recovery shell prompt, enter: loadfile -l INSTCMDS/opt/raidsa/bin/ciss_insf # loadfile -l INSTCMDS /opt/raidsa/bin/ciss_insf # Step 4.
Installing the RAID160 SA Controller Setting up a RAID160 SA Controller As a Boot Device 70 Chapter 4
5 Configuring the RAID160 SA Controller This chapter contains the following sections detailing the use of the saconfig command to configure the RAID160 SA controller: Chapter 5 71
Configuring the RAID160 SA Controller Planning the RAID Configuration Planning the RAID Configuration If you don’t already know exactly how you want to configure RAID on your system, consider the following steps. Step 1. Determine the RAID level you want to use for each logical drive (see Chapter 1 of the HP A7143A RAID160 Support Guide. Step 2. Draw the physical disk configuration. You should consider these things: • Consider having one or more of your logical drives spread across controller channels.
Configuring the RAID160 SA Controller saconfig Configuration CLI saconfig Configuration CLI The saconfig command is used to: • Configure Logical Drives • Set the RAID level • Set the stripe size on a RAID • Set up on-line spare disk drives • Delete logical drives • Clear the configuration You will need a superuser ID to run the saconfig command.
Configuring the RAID160 SA Controller saconfig Configuration CLI To delete logical drive saconfig /dev/cissX -D To clear configuration saconfig /dev/cissX -D all To add spare drive to existing logical drive saconfig /dev/cissX -A -s To delete a spare drive of a logical drive saconfig /dev/cissX -D -s To delete all spare drives of a logical drive saconfig /dev/cissX -D -s all To change the rebuild prio
Configuring the RAID160 SA Controller saconfig Configuration CLI The device file for the RAID160 SA controller in the example above is /dev/ciss6. Displaying the RAID160 SA Configuration To display the physical disks connected to a RAID160 SA controller, as well as any logical drives and spare drives that have been configured, issue the saconfig command, where represents the device file for the RAID160 SA controller.
Configuring the RAID160 SA Controller saconfig Configuration CLI For example, if the rebuild priority needs to be set to high for the logical drives under controller ciss7 use the command # saconfig /dev/ciss7 -r high Specify Percentage of Cache Used for Read Caching The saconfig /dev/cissX -C command is used to specify a percentage of cache to be used exclusively for read caching. The can be: 0, 25, 50, 75, or 100 percent of the cache.
6 Troubleshooting This chapter contains the following sections providing troubleshooting options available for the RAID160 SA controller: Chapter 6 77
Troubleshooting HP Support Tools Manager (STM) HP Support Tools Manager (STM) STM is a software application that can be run from the console to obtain status and descriptive information about the RAID160 SA controller. STM can also be used to diagnose problems. STM version A.42.00, or later, must be used with the RAID160 SA controller. For detailed information on STM, go to http://www.docs.hp.com , click on “Browse by Topic”, then click on “Diagnostics”.
Troubleshooting ODE ODE The RAID160 SA controller supports HP’s Offline Diagnostics Environment (ODE). ODE is an offline support tools platform for troubleshooting systems that are running without an operating system or systems that cannot be tested using online tools. The offline environment is also useful for testing that needs to be done before a system is booted. ODE provides a user-friendly interface for diagnostics and utilities that have been developed to run in this environment.
Troubleshooting The sautil Command The sautil Command Troubleshooting with sautil NOTE Before troubleshooting with the sautil command, run STM. The diagnostic information STM provides will augment the information provided by the sautil command. See “HP Support Tools Manager (STM)” on page 78, for details.
Troubleshooting The sautil Command Notes: - You must be a privileged user to execute this command. See privileges(5) for more information on privileged users. - See sautil(1M) for tool details. This section of the RAID160 Support Guide focuses on sautil command options that are used for online troubleshooting of the RAID160 SA controller.
Troubleshooting The sautil Command sautil The sautil command output provides detailed information on the RAID 160 SA controller, any logical drives that have been configured, and all SCSI devices that are attached to the controller. There is a -s option for this command that provides a shorter and less detailed output. An example of the sautil -s command output can be found in “RAID160 SA Controller Firmware” on page 49.
Troubleshooting The sautil Command Battery Status (pack #2)............ ok ---- LOGICAL DRIVE SUMMARY --------------------------------------------------# RAID Size Status 0 1+0 17365 MB OK ---- SCSI DEVICE SUMMARY ----------------------------------------------------Ch ID 1 1 1 1 1 0 1 2 3 7 Type DISK DISK DISK DISK PROCESSOR Capacity 18.2 18.2 18.2 18.
Troubleshooting The sautil Command Synchronous (Fast/Ultra) transfer enabled...... Narrow disk tray detected...................... Wide transfer failed, reverted to narrow....... Ultra SCSI transfer enabled.................... Ultra-2 SCSI transfer enabled.................. yes no no yes yes S.M.A.R.T. supported........................... S.M.A.R.T. errors (in factory M&P data)........ S.M.A.R.T. enabled............................. S.M.A.R.T. errors (in powerup M&P data)........
Troubleshooting The sautil Command Running in DT clocking mode.................... Supports DT clocking mode only................. Supports DT and ST clocking modes.............. Running in U320 mode........................... Packetized SCSI supported...................... Supports redundant adapter operation........... Disk write cache enabled in current page....... Disk write cache disabled in default page...... yes no yes no no yes no yes QAS capable (U320 only)........................
Troubleshooting The sautil Command ---- SCSI DEVICE 1:3 --------------------------------------------------------Channel Number................................... SCSI ID.......................................... Device Type...................................... Disk Capacity.................................... Device Status.................................... Device Vendor ID................................. Device Product ID................................ Device Serial Number.............................
Troubleshooting The sautil Command | | | | | | | | (Ch 1)---> | | | | | (Ch 3)---> | | | | ~ | <--- LOOK FOR THIS LOGO | | | | | | <--- (Ch 2) | | | | | <--- (Ch 4) | | | | ULTRA-3 / \ <\ / * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * ~ ~ ~ ~ ****************************************************************************** **** End of SAUTIL Output **** ****************************************************************************** # Logical Drive State Definitions The status colu
Troubleshooting The sautil Command data loss may result if a second disk should fail. For RAID ADG, data loss may result if two additional disks should fail. READY FOR RECOVERY OPERATION A replacement disk is present, but rebuild hasn’t started yet (another logical drive may be currently rebuilding). The logical drive will also return to this state if the rebuild had been aborted due to unrecoverable read errors from another disk.
Troubleshooting The sautil Command QUEUED FOR EXPANSION The logical drive is waiting to undergo data reorganization (see EXPANDING above). Possible causes for the delay are a rebuild or expansion operation may already be in progress.
Troubleshooting The sautil Command Physical Disk State Definitions The status column of the SCSI Device Summary in the sautil command output example also provides useful troubleshooting information. Refer to the following physical disk state definitions to interpret the status of physical disks (the physical disk state definitions are also included in the sautil manpage). Physical Disk State Definitions: OK The physical disk is configured in one or more logical drives and is operational.
Troubleshooting The sautil Command sautil accept_media_xchg The sautil accept_media_xchg command instructs the RAID160 SA controller to: 1. Set the state of the specified failed logical drive, , to "OK" 2. Set the states of all failed physical disks that have been replaced via hot-plug exchanges to "OK" WARNING This option preserves the RAID configuration, logical drive configurations, and controller settings.
Troubleshooting The sautil Command Logical drive specified: 0 WARNING: The status of logical drive 0 will be set to "OK". The status of all physical disks that have been replaced on this logical drive will also be set to "OK". While this option preserves the RAID configuration (logical drive configurations, adapter settings, etc.), data on the failed logical drive may have already been compromised.
A Probability of Logical Drive Failure Appendix A 93
Probability of Logical Drive Failure RAID Level and Probability of Drive Failure RAID Level and Probability of Drive Failure The probability that a logical drive will fail depends on the RAID level setting. • A RAID 0 logical drive fails if only one physical disk fails. • For a RAID 1+0 logical drive, the failure situation is complex. — The maximum number of physical disks that can fail without causing failure of the logical drive is n/2, where n is the number of physical disks in the array.
Probability of Logical Drive Failure RAID Level and Probability of Drive Failure Figure A-1 Appendix A Relative Probability of Logical Drive Failure 95
Probability of Logical Drive Failure RAID Level and Probability of Drive Failure 96 Appendix A
B Controller Specifications Appendix B 97
Controller Specifications RAID160 SA Controller Specifications RAID160 SA Controller Specifications Table B-1 RAID160 SA Specifications Dimensions 31.5 cm x 10.8 cm x 1.5 cm (12.4 in x 4.3 in x 0.6 in) Power Required 24.9 W (19.0 W at 3.3 V, 5.9 W at 5 V) Battery 2 NiMH battery packs, rechargeable. PCI Specification The RAID160 SA controller complies with PCI revision 2.
C Hard Drive Installation and Replacement Appendix C 99
Hard Drive Installation and Replacement Background Background Each SCSI channel on the RAID160 SA controller supports up to 14 physical disks. Disks can be of the Ultra320 or the Ultra160 type. Each physical disk on a SCSI bus must have a unique ID value in the range of 0 to 15 (except ID 7, which is typically reserved for controller use). This value is set automatically on hot-pluggable disk drives in the storage systems that are supported by the RAID160 SA controller. • Do not terminate the disk drives.
Hard Drive Installation and Replacement General Information About Hard Drive Failure General Information About Hard Drive Failure When a physical disk fails, the logical drive it belongs to will be affected. Each logical drive connected to a RAID160 SA controller can be configured with be a different RAID level. For this reason, each logical drive may be affected differently by a physical disk failure, depending on the fault-tolerance method (RAID level) that has been configured on it.
Hard Drive Installation and Replacement General Information About Hard Drive Failure Table C-1 (1) Activity Disk Drive Status from LED Illumination Pattern (2) Online (3) Fault Meaning On, Off, or Flashing On or Off Flashing A predictive failure alert has been received for this disk drive. Replace the drive as soon as possible. On, Off, or Flashing On Off OK to replace the disk drive online if the array is configured for fault tolerance and all other drives in the array are online.
Hard Drive Installation and Replacement General Information About Hard Drive Failure • The amber LED lights up on the front of supported StorageWorks™ disk enclosures, if failed drives are inside. (Other problems such as fan failure, redundant power supply failure, or over-temperature conditions, will also cause this LED to light up.) • Event Monitoring Services (EMS) will cause an alert message to be sent when physical or logical drive failure occurs.
Hard Drive Installation and Replacement Compromised Fault Tolerance Compromised Fault Tolerance Compromised fault tolerance commonly occurs when more physical disks have failed than the fault-tolerance method can endure. In this case, the logical volume is failed and unrecoverable disk error messages are returned to the host. Data loss is likely to occur.
Hard Drive Installation and Replacement Automatic Data Recovery Automatic Data Recovery Automatic data recovery is an automatic background process that rebuilds data onto a spare or replacement disk drive when another disk drive in the array fails. This process is also called a rebuild. If a disk drive in a fault-tolerant configuration is replaced while the system power is off, a message is displayed during the next system startup.
Hard Drive Installation and Replacement General Aspects of Disk Drive Replacement General Aspects of Disk Drive Replacement CAUTION Sometimes, a disk drive that has previously been failed by the controller may seem to be operational after the system is power-cycled, or (for a hot-pluggable drive) after the drive has been removed and reinserted. However, continued use of such marginal drives may eventually result in data loss. Replace the marginal drive as soon as possible.
Hard Drive Installation and Replacement Drive Failure During Rebuild Drive Failure During Rebuild If another drive in the array fails while fault tolerance is unavailable during rebuild, a fatal system error may occur. If this happens, all data on the array is lost. In exceptional cases, however, failure of another drive need not lead to a fatal system error.
Hard Drive Installation and Replacement Drive Failure During Rebuild 108 Appendix C
D Electrostatic Discharge Appendix D 109
Electrostatic Discharge Preventing System Damage Preventing System Damage Handling Parts To prevent damaging the system, be aware of the precautions you need to follow when setting up the system or handling parts. A discharge of static electricity from a finger or other conductor may damage system boards or other static-sensitive devices. This type of damage may reduce the life expectancy of the device.
E Questions and Answers Appendix E 111
Questions and Answers RAID160 SA Controller Q & A RAID160 SA Controller Q & A Q:How many RAID160 SA controllers can I install in my system? A:The maximum number of controllers that you can install in your system depends on your server and several other factors specific to your configuration. Generally, the maximum number is restricted to the number of PCI slots not used for other peripherals. Another limiting factor is the power rating of your system. Each RAID160 SA controller requires 24.9 W.
Glossary ACU (Array Configuration Utility) A configuration utility useful both for novices and for more experienced RAID users. ADU (Array Diagnostic Utility) A diagnostic tool that collects comprehensive information about the array controllers in a system and lists any problems detected. ARM (Auto-Reliability Monitoring) Also known as surface analysis. A fault management feature whereby hard drives are scanned for bad sectors, and data in the faulty sectors is remapped onto good sectors.
Glossary LVD (low voltage differential) LVD (low voltage differential) A type of SCSI signaling that allows a maximum transfer rate of either 80 MB/s or 160 MB/s, conforming to either the Wide Ultra2 or Wide Ultra160 SCSI standards respectively. online spare Also known as a hot spare, this is a drive in a fault-tolerant system that normally contains no data. When any other drive in the array fails, the controller automatically rebuilds the missing data that was on the failed drive onto the online spare.