HP Smart Array Controllers on HP ProLiant Servers User Guide

Drive procedures 82
RAID 6 configurations can tolerate two failed drives at a given time.
RAID 60 configurations can tolerate two failed drives in each parity group.
RAID 1 (ADM) and RAID 10 (ADM) configurations can tolerate multiple drive failures if no more than
two drives, mirrored to one another, fail.
Compromised fault tolerance
CAUTION: When fault tolerance is compromised, data loss can occur. However, it may be
possible to recover the data. For more information, see "Recovering from compromised fault
tolerance (on page 82)."
If more drives fail than the fault-tolerance method can manage, fault tolerance is compromised, and the
logical drive fails. If this failure occurs, the operating system rejects all requests and indicates unrecoverable
errors.
For example, fault tolerance might occur when a drive in an array fails while another drive in the array is
being rebuilt.
Compromised fault tolerance can also be caused by problems unrelated to drives. In such cases, replacing
the physical drives is not required.
Recovering from compromised fault tolerance
If fault tolerance is compromised, inserting replacement drives does not improve the condition of the logical
volume. Instead, if the screen displays unrecoverable error messages, perform the following procedure to
recover data:
1. Power down the entire system, and then power it back up. In some cases, a marginal drive will work
again for long enough to enable you to make copies of important files.
If a 1779 POST message is displayed, press the F2 key to re-enable the logical volumes. Remember that
data loss has probably occurred and any data on the logical volume is suspect.
2. Make copies of important data, if possible.
3. Replace any failed drives.
4. After you have replaced the failed drives, fault tolerance may again be compromised. If so, cycle the
power again. If the 1779 POST message is displayed:
a. Press the F2 key to re-enable the logical drives.
b. Recreate the partitions.
c. Restore all data from backup.
To minimize the risk of data loss that is caused by compromised fault tolerance, make frequent backups of all
logical volumes.
Replacing drives
The most common reason for replacing a drive is that it has failed. However, another reason is to gradually
increase the storage capacity of the entire system ("Upgrading drive capacity" on page 86).