HP StorageWorks 1510i Modular Smart Array installation and user guide (383070-002, July 2008)

Recognizing and
recovering from hard drive failures and
faulted LUNs
The purpose of fault-tolerant array congurations is to protect against data loss due to hard drive failure.
Each RAID conguration has inherent limitations on the number of hard drive failures that it can tolerate.
If the fault-tolerance level of a particular LUN or array conguration is exceeded, the array will be locked
from any further I/O. This protection is designed to preserve the integrity of the local drive, but does
require manual intervention to recover or re-enable the LUN.
Although con
troller rmware is designed to protect against normal hard drive failure, it is imperative that
you perform t
he correct actions to recover from a hard drive failure without inadvertently introducing any
additional h
ard drive failures.
Included sections:
Recognizing hard drive failure
Compromise
dfaulttolerance
Recovering from compromised fault tolerance (enabling failed LUNs)
Automatic data recovery (rebuild)
•Replacing
aharddrive
Recognizing hard drive failure
LEDsonthefrontofeachharddrivearevisiblefromthefrontoftheexternalstorageunit.Whenahard
drive is congured as a part of an array and attached to a powered-on controller, the status of the hard
drive can be determined from the illumination pattern of these LEDs.
For detailed descriptions of the various LED combinations, see Hard drive LEDs.
Other ways to determine that a hard drive has failed include the following:
LEDs on the storage system chassis illuminate amber if failed hard drives are inside. (However, this
LED also illuminates when other problems occur, such as when a fan or a redundant power supply
fails, or when the system overheats.)
LEDs on the hard drives illuminate amber if a hard drive has failed or is a member of a faulted LUN.
Front-panel LCD display messages list faulted LUNs and failed hard drives whenever the system is
restarted, as long as the controller detects one or more good hard drives.
The ACU and SMU represent faulted LUNs and failed drives with distinctive icons.
HP-SIM can detect failed hard drives.
ADU lists all failed hard drives.
For more information on troubleshooting hard drive problems, see the HP ProLiant ser vers troubleshooting
guide.
Effects of hard drive failure
When a hard drive fails, all logical drives that are in the same array are affected. Each logical drive in an
array may be using a different fault-tolerance method, so each logical drive can be affected differently.
RAID 0
congurations cannot tolerate hard drive failure. If any physical hard drive in the array
fail
s, all non-fault-tolerant (RAID 0) LUNs in the same array also are failed.
RAID 1 and RAID 1+0 congurations can tolerate multiple hard drive failures, as long as none of
the failed hard drives are mirrored to one another.
RAID 5 congurations can tolerate one hard drive failure.
RAI
D6congurations can tolerate simultaneous failure of two hard drives in the array.
1510i Modular Smart Array installation and user guide
93