Designing Disaster Tolerant High Availability Clusters, 10th Edition, March 2003 (B7660-90013)

Building a Continental Cluster
Restoring Disaster Tolerance
Chapter 5256
Restoring Disaster Tolerance
After a failover to a cluster occurs, restoring disaster tolerance has many
challenges, the most significant of which are:
Restoring the failed cluster
Depending on the nature of the disaster you may need to create a
new cluster, or you may be able to restore the cluster. Steps for each
scenario are discussed in the following sections.
Before starting up the new or the failed cluster, make sure the
AUTO_RUN flag for all of the ContinentalClusters application
packages is disabled. This is to prevent starting the packages
unexpectedly with the cluster.
Resynchronizing the data
To resynchronize the data, you either restore the data to the cluster
and continue with the same data replication procedure, or set up
data replication to function in the other direction.
The following sections briefly outline some scenarios for restoring
disaster tolerance.
Restore Clusters to their Original Roles
If the disaster did not destroy the cluster, you can return both clusters to
their original roles. To do this:
1. Make sure that both clusters are up and running, with the recovery
packages continuing to run on the surviving cluster.
2. On each cluster, stop the ContinentalClusters monitor package if it
is still running:
# cmhaltpkg ccmonpkg
3. Compare the clusters to make sure their configurations are
consistent. Correct any inconsistencies.
4. For each recovery group where the new cluster will run the primary
package:
a. Synchronize the data from the disks on the surviving cluster to
the disks on the new cluster. This may be time-consuming.