PCI Error Recovery Product Note HP-UX 11i v3 Third Edition Manufacturing Part Number: 5992-4013 March 2008 © Copyright 2001-2008 Hewlett-Packard Development Company L.P.
Legal Notices Copyright 2008 Hewlett-Packard Development Company, L.P. Confidential computer software. Valid license from HP required for possession, use or copying. Consistent with FAR 12.211 and 12.212, Commercial Computer Software, Computer Software Documentation, and Technical Data for Commercial Items are licensed to the U.S. Government under vendor's standard commercial license. The information contained herein is subject to change without notice.
Publishing History New editions of this manual will incorporate information that is new or has changed since the previous edition was published (minor typographical or formatting corrections do not result in the publication of a new edition). The edition, HP Manufacturing Part Number, and publication date all change each time a new edition is published, providing a unique identification for each edition.
Contents PCI Error Recovery Product Note What is PCI Error Recovery? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 Confirm PCI Error Recovery is Supported . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 Using ioscan to identify PCI Error Recovery Capability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Tunable Kernel Parameters . . . . . . . . . . . . . .
Contents 2
PCI Error Recovery Product Note What is PCI Error Recovery? The PCI Error Recovery feature provides the ability to detect, isolate, and automatically recover from a PCI error, avoiding a system crash. PCI Error Recovery is included with the HP-UX 11i v3 operating system, and it is enabled by default. NOTE PCI Error Recovery is not supported on all platforms. To determine if PCI Error Recovery is supported on your system, see the PCI Error Recovery Support Matrix, available at http://docs.hp.com/en/ha.
PCI Error Recovery Product Note Confirm PCI Error Recovery is Supported Confirm PCI Error Recovery is Supported Step 1. To confirm PCI Error Recovery (ER) is supported with your configuration and system firmware version, see PCI Error Recovery Support Matrix, HP-UX 11i v3 at: http://docs.hp.com/en/ha.html NOTE PCI-express ER functionality can be enabled only if the patch set: PHKL_37099, PHKL_37329, PHKL_37330, PHKL_37331, PHKL_37648, PHKL_37405, and PHKL_37510 is installed on HP-UX 11i v3 OS.
PCI Error Recovery Product Note Confirm PCI Error Recovery is Supported ED | 3.13 | | | | CLU | 15.2 | 15.2 | 15.2 | 15.2 | PM | 15.0 | 15.0 | 15.0 | 15.0 | CIO (bay 0, chassis 1) | 15.0 | 15.0 | 15.0 | 15.0 | CIO (bay 0, chassis 3) | 15.0 | 15.0 | 15.0 | 15.0 | CIO (bay 1, chassis 1) | 15.0 | 15.0 | 15.0 | 15.0 | CIO (bay 1, chassis 3) | 15.0 | | 15.0 | 15.
PCI Error Recovery Product Note Confirm PCI Error Recovery is Supported Cell 2 : 1.002 1.010 Cell 3 : 1.002 1.010 FIRMWARE: Core IO Master : Event Dict. : Slave : Event Dict. : A.007.008 0.009 A.007.008 0.009 Cell 0 PDHC : A.003.027 Pri SFW : 23.001 (PA) Sec SFW : 23.001 (PA) Cell 1 PDHC : A.003.027 Pri SFW : 23.001 (PA) Sec SFW : 23.001 (PA) Cell 2 PDHC : A.003.027 Pri SFW : 23.001 (PA) Sec SFW : 23.001 (PA) Cell 3 8 PDHC : A.003.027 Pri SFW : 23.
PCI Error Recovery Product Note Confirm PCI Error Recovery is Supported NOTE The sysrev command output on some systems includes extra zeros in the system firmware version number. These zeros can be ignored. For example, 3.88 and 3.088 on Integrity systems are the same firmware version, also 23.1 and 23.001 on HP 9000 systems represent the same firmware version. Step 3. The system firmware is the main component of the firmware recipe required to support PCI Error Recovery.
PCI Error Recovery Product Note Tunable Kernel Parameters # ioscan -P error_recovery -d lba Table 1 Error Recovery Attributes Class I H/W Path Error_Recovery ba 0 0/0/0 Supported ba 1 0/0/1 Supported ba 10 0/0/8 Supported ba 11 0/0/9 Supported ba 13 0/0/10 Supported ba 14 0/0/12 Supported This implies that PCI error recovery is supported for I/O adapters located under LBAs like 0/0/0, 0/0/1.
PCI Error Recovery Product Note Error Messages for PCI Error Recovery Error Messages for PCI Error Recovery All drivers that support PCI Error Recovery generate error messages for specific PCI Error Recovery events. Mass storage drivers post error messages to syslog and to diaglog. If a mass storage driver generates verbose error messages, they can be accessed in the Support Tools Manager (STM) diagnostic logs. Networking drivers post error messages to the console and to syslog.
PCI Error Recovery Product Note Automatic Recovery from a PCI Error 0-0-1-0 0/0/0/1 0 133 133 On Yes No Yes Yes PCI-X PCI-X 0-0-1-1 0/0/1/1 256 133 66 On Yes No Yes Yes PCI-X PCI 0-0-1-8 0/0/12/1 2304 133 66 On Yes No Yes Yes PCI-X PCI 0-0-1-9 0/0/10/1 2048 133 133 Off No N/A N/A N/A PCI-X PCI-X 0-0-1-10 0/0/9/1 1792 133 33 On Yes No Yes Yes PCI-X PCI 0-0-1-11 0/0/8/1 1536 133 133 Off No N/A N/A N/A PCI-X PCI-X PCI-Express Slots Information ---
PCI Error Recovery Product Note Manual Recovery from a PCI Error For more information on manual recovery from a PCI error, see “Manual Recovery from a PCI Error” on page 13. Manual Recovery from a PCI Error After a successful automatic PCI error recovery, if another PCI Error is detected within the time interval specified by the pci_error_tolerance_time tunable, the card in the I/O slot will be suspended. A manual PCI Error Recovery operation is required to restore the card.
PCI Error Recovery Product Note Manual Recovery from a PCI Error Driver(s) Capable Slot Path Link Max Max Link Spd Link Link Width Spd Width Pwr Occu Susp OLAR OLD Mode 0-0-1-2 0/0/2/0/0/0 2.5 2.5 x8 x4 On Yes No Yes Yes PCIe 0-0-1-3 0/0/4/0/0/0 2.5 2.5 x8 x4 On Yes No Yes Yes PCIe 0-0-1-4 0/0/5/0/0/0 2.5 2.5 x8 x8 Off No N/A N/A N/A PCIe 0-0-1-5 0/0/6/0/0/0 2.5 2.5 x8 x4 On Yes No Yes No PCIe 0-0-1-6 0/0/14/0/0/0 2.5 2.
PCI Error Recovery Product Note Manual Recovery from a PCI Error Activity : Target slot powered off, drivers suspended, OK to replace the card Target slot : 0-0-1-0 4. Execute the olrad -q command to confirm the power is off.
PCI Error Recovery Product Note Manual Recovery from a PCI Error Target slot : 0-0-1-0 6. Execute the olrad -q command to confirm that the card has been resumed.
PCI Error Recovery Product Note Manual Recovery from a PCI Error • You can perform an OL* online replacement operation to replace the I/O card with an I/O card that has the same HP Manufacturing Part Number and the same (or later) release version number. • You can perform an OL* online deletion operation to delete the card and the driver instance associated with that card. After a successful online deletion, the slot is available to be used with another I/O card.
PCI Error Recovery Product Note PCI Error Recovery Documentation PCI Error Recovery Documentation The documentation that supports this release of the PCI Error Recovery feature consists of: • PCI Error Recovery Product Note — available at http://docs.hp.com/en/ha.html in the PCI Error Recovery section. • PCI Error Recovery Support Matrix — available at http://docs.hp.com/en/ha.html in the PCI Error Recovery section. • Interface Card OL* Support Guide — available at http://docs.hp.com/en/ha.