Managing HP Serviceguard for Linux, Sixth Edition, August 2006

Troubleshooting Your Cluster
Testing Cluster Operation
Chapter 8264
Testing Cluster Operation
Once you have configured your Serviceguard cluster, you should verify
that the various components of the cluster behave correctly in case of a
failure. In this section, the following procedures test that the cluster
responds properly in the event of a package failure, a node failure, or a
LAN failure.
CAUTION In testing the cluster in the following procedures, be aware that you are
causing various components of the cluster to fail, so that you can
determine that the cluster responds correctly to failure situations. As a
result, the availability of nodes and applications may be disrupted.
Testing the Package Manager
To test that the package manager is operating correctly, perform the
following procedure for each package on the cluster:
1. Obtain the PID number of a service in the package by entering
# ps -ef | grep <
service_cmd
>
where service_cmd is the executable specified in the package control
script with the parameter SERVICE_CMD. The service selected must
not have SERVICE_RESTART specified.
2. To kill the service_cmd PID, enter
# kill
PID
3. To view the package status, enter
# cmviewcl -v
The package should be running on the specified adoptive node.
4. Halt the package, then move it back to the primary node using the
cmhaltpkg, cmmodpkg, and cmrunpkg commands:
# cmhaltpkg <
PackageName
>
# cmmodpkg -e <
PrimaryNode
> <
PackageName
>
# cmrunpkg -v <
PackageName
>