Managing HP Serviceguard for Linux Ninth Edition, April 2009

Migrating a Legacy Package to a Modular Package....................................................262
Reconfiguring a Package on a Running Cluster .........................................................262
Reconfiguring a Package on a Halted Cluster ............................................................263
Adding a Package to a Running Cluster.....................................................................263
Deleting a Package from a Running Cluster ..............................................................263
Resetting the Service Restart Counter.........................................................................264
Allowable Package States During Reconfiguration ....................................................264
Changes that Will Trigger Warnings......................................................................268
Responding to Cluster Events ..........................................................................................268
Single-Node Operation ....................................................................................................269
Removing Serviceguard from a System...........................................................................269
8 Troubleshooting Your Cluster....................................................................................................271
Testing Cluster Operation ................................................................................................271
Testing the Package Manager .....................................................................................271
Testing the Cluster Manager .......................................................................................272
Monitoring Hardware ......................................................................................................272
Replacing Disks.................................................................................................................273
Replacing a Faulty Mechanism in a Disk Array..........................................................273
Replacing a Lock LUN.................................................................................................273
Revoking Persistent Reservations after a Catastrophic Failure........................................274
Examples......................................................................................................................275
Replacing LAN Cards.......................................................................................................275
Replacing a Failed Quorum Server System......................................................................276
Troubleshooting Approaches ...........................................................................................278
Reviewing Package IP Addresses ...............................................................................278
Reviewing the System Log File ..................................................................................279
Sample System Log Entries ...................................................................................279
Reviewing Object Manager Log Files .........................................................................280
Reviewing Configuration Files ...................................................................................280
Reviewing the Package Control Script .......................................................................280
Using the cmquerycl and cmcheckconf Commands.............................................281
Reviewing the LAN Configuration ............................................................................281
Solving Problems .............................................................................................................281
Name Resolution Problems.........................................................................................282
Networking and Security Configuration Errors....................................................282
Cluster Re-formations Caused by Temporary Conditions..........................................282
Cluster Re-formations Caused by MEMBER_TIMEOUT Being Set too Low.............282
System Administration Errors ....................................................................................283
Package Control Script Hangs or Failures ............................................................284
Package Movement Errors ..........................................................................................285
Node and Network Failures .......................................................................................286
Troubleshooting the Quorum Server...........................................................................286
12 Table of Contents