PCI Error Recovery Product Note HP-UX 11i v3 Second Edition Manufacturing Part Number: 5992-1722 June 2007 © Copyright 2001-2007 Hewlett-Packard Development Company L.P.
Legal Notices Copyright 2007 Hewlett-Packard Development Company, L.P. Confidential computer software. Valid license from HP required for possession, use or copying. Consistent with FAR 12.211 and 12.212, Commercial Computer Software, Computer Software Documentation, and Technical Data for Commercial Items are licensed to the U.S. Government under vendor's standard commercial license. The information contained herein is subject to change without notice.
Publishing History New editions of this manual will incorporate information that is new or has changed since the previous edition was published (minor typographical or formatting corrections do not result in the publication of a new edition). The edition, HP Manufacturing Part Number, and publication date all change each time a new edition is published, providing a unique identification for each edition.
Contents PCI Error Recovery Product Note What is PCI Error Recovery? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Confirm PCI Error Recovery is Supported . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 Using ioscan to identify PCI Error Recovery Capability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 Tunable Kernel Parameters . . . . . . . . . . . . . . .
Contents 6
PCI Error Recovery Product Note What is PCI Error Recovery? The PCI Error Recovery feature provides the ability to detect, isolate, and automatically recover from a PCI error, avoiding a system crash. PCI Error Recovery is included with the HP-UX 11i v3 operating system, and it is enabled by default. NOTE PCI Error Recovery is not supported on all platforms. To determine if PCI Error Recovery is supported on your system, see the PCI Error Recovery Support Matrix, available at http://docs.hp.com/en/ha.
PCI Error Recovery Product Note Confirm PCI Error Recovery is Supported Confirm PCI Error Recovery is Supported Step 1. To confirm PCI Error Recovery is supported with your configuration and system firmware version, see PCI Error Recovery Support Matrix, HP-UX 11i v3 at: http://docs.hp.com/en/ha.html Step 2.
PCI Error Recovery Product Note Confirm PCI Error Recovery is Supported CIO (bay 1, chassis 3) | 15.0 | | 15.0 | 15.0 | On the mid-range systems that support PCI Error Recovery, the system firmware version will be listed with the Pri SFW heading as illustrated in this example: MP:CM> sysrev Cabinet firmware revision report PROGRAMMABLE HARDWARE : System Backplane : PCI-X Backplane : Core IO : GPM FM OSP ------- ------- ------- 1.002 1.002 1.002 LPM HS ------- ------- 2.000 1.
PCI Error Recovery Product Note Confirm PCI Error Recovery is Supported Event Dict. : Slave : Event Dict. : 0.009 A.007.008 0.009 Cell 0 PDHC : A.003.027 Pri SFW : 23.001 (PA) Sec SFW : 23.001 (PA) Cell 1 PDHC : A.003.027 Pri SFW : 23.001 (PA) Sec SFW : 23.001 (PA) Cell 2 PDHC : A.003.027 Pri SFW : 23.001 (PA) Sec SFW : 23.001 (PA) Cell 3 PDHC : Pri SFW : 23.001 (PA) Sec SFW : 23.001 NOTE 10 A.003.
PCI Error Recovery Product Note Confirm PCI Error Recovery is Supported Step 3. The system firmware is the main component of the firmware recipe required to support PCI Error Recovery. If you do not have the minimum system firmware version (or a later version) listed in the PCI Error Recovery Support Matrix (http://docs.hp.com/en/ha.html), you do not have a firmware recipe installed on your system that supports PCI Error Recovery. Go to the Business Support Center Web site at http://www.hp.
PCI Error Recovery Product Note Tunable Kernel Parameters ba 8 0/0/12 Supported ba 9 0/0/14 Supported ba 10 1/0/0 Unsupported ba 11 1/0/1 Supported ba 12 1/0/2 Supported ba 13 1/0/4 Supported ba 14 1/0/6 Supported ba 16 1/0/8 Supported ba 17 1/0/10 Supported ba 18 1/0/12 Supported ba 19 1/0/14 Supported This implies that PCI error recovery is supported for I/O adapters located under LBAs like 0/0/1 or 0/0/2 but not for those under 0/0/0 or 1/0/0.
PCI Error Recovery Product Note Error Messages for PCI Error Recovery Error Messages for PCI Error Recovery All drivers that support PCI Error Recovery generate error messages for specific PCI Error Recovery events. Mass storage drivers post error messages to syslog and to diaglog. If a mass storage driver generates verbose error messages, they can be accessed in the Support Tools Manager (STM) diagnostic logs. Networking drivers post error messages to the console and to syslog.
PCI Error Recovery Product Note Automatic Recovery from a PCI Error 3. The olrad -q command output will be normal after a PCI Error recovery.
PCI Error Recovery Product Note Manual Recovery from a PCI Error Manual Recovery from a PCI Error After a successful automatic PCI error recovery, if another PCI Error is detected within the time interval specified by the pci_error_tolerance_time tunable, the card in the I/O slot will be suspended. A manual PCI Error Recovery operation is required to restore the card.
PCI Error Recovery Product Note Manual Recovery from a PCI Error 2.
PCI Error Recovery Product Note Manual Recovery from a PCI Error 8-0-1-1 8-0-1-2 8-0-1-3 8-0-1-4 8-0-1-5 8-0-1-6 8-0-1-7 8-0-1-8 8-0-1-9 8-0-1-10 8-0-1-11 7/0/1/1 7/0/2/1 7/0/3/1 7/0/4/1 7/0/6/1 7/0/14/1 7/0/12/1 7/0/11/1 7/0/10/1 7/0/9/1 7/0/8/1 1813 1834 1855 1876 1897 2026 2004 1982 1960 1939 1918 133 133 133 133 133 133 133 133 133 133 133 133 133 66 133 133 66 133 66 33 133 66 On On On On On On Off On On On On Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes No No No No No No Yes No No No No Yes Yes
PCI Error Recovery Product Note Manual Recovery from a PCI Error 7. After the card has been resumed, a recovery message will be displayed in the console, for example: Hardware path 7/0/12 Successfully recovered from PCI Error 8. If the olrad -R command does not succeed, you have a persistent PCI error condition. There is a high probability that the I/O card is defective.
PCI Error Recovery Product Note PCI Error Recovery Documentation PCI Error Recovery Documentation The documentation that supports this release of the PCI Error Recovery feature consists of: • PCI Error Recovery Product Note — available at http://docs.hp.com/en/ha.html in the PCI Error Recovery section. • PCI Error Recovery Support Matrix — available at http://docs.hp.com/en/ha.html in the PCI Error Recovery section. • Interface Card OL* Support Guide — available at http://docs.hp.com/en/ha.
PCI Error Recovery Product Note PCI Error Recovery Documentation 20 Chapter