Designing Disaster Recovery Clusters using Metroclusters and Continentalclusters, Reprinted October 2011 (5900-1881)

ManualsBrandsHP ManualsSoftwareHP Serviceguard Metrocluster with EMC SRDF

511

512

513

514

515

516

517

518

519

520

regional disaster A disaster, such as an earthquake or hurricane, that affects a large region. Local, campus, and

proximate metropolitan clusters are less likely to protect from regional disasters.

rehearsal package The recovery cluster package used to validate the recovery environment and procedure as part

of a rehearsal operation.

remote failover Failover to a node at another data center or remote location.

resynchronization The process of making the data between two sites consistent and current once systems are restored

following a failure. Also called data resynchronization.

rolling disaster A second disaster that occurs before recovering from a previous disaster, for example, while data

is being synchronized between two data centers after a disaster, one of the data centers fails,

interrupting the data synchronization process. Rolling disasters may result in data corruption that

requires a reload from tape backups.

single point of

failure (SPOF)

A component of a cluster or node that, if it fails, affects access to applications or services. See

also multiple points of failure.

single system high

availability

Hardware design that results in a single system that has availability higher than normal. Hardware

design examples are:

• n+1 fans

• n+1 power supplies

• multiple power cords

• on-line addition or replacement of I/O cards, memory, etc.

special device file The device file name that the HP-UX operating system gives to a single connection to a node, in

the format /dev/devtype/filename.

split-brain

syndrome

When a cluster reforms with equal numbers of nodes at each site, and each half of the cluster

thinks it is the authority and starts up the same set of applications, and tries to modify the same

data, resulting in data corruption. Serviceguard architecture prevents split-brain syndrome in all

cases unless dual cluster locks are used.

SRDF (Symmetrix Remote Data Facility) A level 1-3 protocol used for physical data replication between

EMC Symmetrix disk arrays.

sub-clusters Sub-clusters are clusterwares that run above the Serviceguard cluster and comprise only the nodes

in a Metrocluster site. Sub-clusters have access only to the storage arrays within a site.

SVOL A secondary volume configured in an P9000 or XP series disk array that uses Continuous Access.

SVOLs are the secondary copies in physical data replication with Continuos Access on the P9000

or XP.

SymCLI The Symmetrix command line interface used to configure and manage EMC Symmetrix disk

arrays.

Symmetrix device

number

The unique device number that identifies an EMC logical volume.

synchronous data

replication

Each data replication I/O waits for the preceding I/O to complete before beginning another

replication. Minimizes the chance of inconsistent or corrupt data in the event of a rolling disaster.

transaction

processing monitor

(TPM)

Software that allows you to modify an application to store in-flight transactions in an external

location until that transaction has been committed to all possible copies of the database or

filesystem, thus ensuring completion of all copied transactions. A TPM protects against data loss

at the expense of the CPU overhead involved in applying the transaction in each database replica.

Software that provides a reliable mechanism to ensure that all transactions are successfully

committed. A TPM may also provide load balancing among nodes.

transparent

failover

A client application that automatically reconnects to a new server without the user taking any

action.

520 Glossary