Optimizing Serviceguard Failover Time, Version A.11.19 and later, April 2009

Introduction ...................................................................................................................................2
The HP Serviceguard failover process.................................................................................................2
What happens when failover is triggered by a node failure................................................................2
Node failure detection..............................................................................................................4
Cluster reformation time............................................................................................................4
Election of cluster membership ...................................................................................................5
Lock acquisition.......................................................................................................................5
Quiescence ............................................................................................................................5
Cluster component recovery.......................................................................................................6
Serviceguard implementation: resource recovery...........................................................................6
Serviceguard implementation: applications recovery......................................................................6
Serviceguard with Serviceguard Extension for RAC: group membership reconfiguration.......................6
Serviceguard with Serviceguard Extension for RAC: RAC reconfiguration..........................................6
What happens when failover is triggered by a package failure...........................................................7
Serviceguard implementation: resource failure detection.................................................................8
Serviceguard implementation: package determination....................................................................8
Serviceguard implementation: resource recovery...........................................................................8
Serviceguard implementation: application startup..........................................................................9
Servicegaurd with Serviceguard Extension for RAC: group membership reconfiguration.......................9
Serviceguard with Serviceguard Extension for RAC: RAC reconfiguration and database recovery..........9
How you can optimize failover time...................................................................................................9
Some help in estimating time for failover.......................................................................................10
MEMBER_TIMEOUT value...........................................................................................................10
Testing.................................................................................................................................11
Lock acquisition (cluster lock, also called tie-breaker or arbitrator)......................................................11
Quorum server considerations..................................................................................................12
Heartbeat subnet.......................................................................................................................12
Network failure detection ...........................................................................................................12
Number of nodes and number of packages...................................................................................12
EMS resources..........................................................................................................................12
Package configuration...............................................................................................................12
System restart options................................................................................................................13
Applications.............................................................................................................................13
Conclusion..................................................................................................................................13
For more information.....................................................................................................................14
Optimizing Serviceguard Failover Time
Version A.11.19 & later

Summary of content (14 pages)