RDF System Management Manual for J-series and H-series RVUs (RDF Update 13)

ManualsBrandsHP ManualsServerHP Integrity NonStop H-Series

331

332

333

334

335

336

337

338

339

340

1. Determine which disks (the local disk on the primary system or the remote mirror on the standby

system) for all audit trails in the RDF configuration received the most audit records. The example

that follows shows how to do so for the MAT. If your RDF configuration includes one or more

auxiliary audit trails, you must do the same for each auxiliary audit trail.

NOTE: if you had CommitHoldMode configured ON at the time the primary system failed,

then you can omit this step because the mirrors on the remote system will either have more

data or the same data as the local mirrors connected to the primary system. This first step is

only useful if you had turned off CommitHoldMode before the primary system failed.

On the ZLT standby system, use SNOOP READAUDIT to read the final file in the MAT,

positioning at EOF and reading in reverse order for one record. This is sample output from

READAUDIT with the MAT position in bold:

* SEQNO = 8, RBA = 107628804, RBN = 26276 *

AC^RECORD^LENGTH=108, AC_VERSION=7, VERSION_FLAGS=000000 000000, PRIMARY^CPU=0

AUDITING^PROCESS=TMP , VSN=000000 000000 000004 077334

TRANSID=000000 000000 000000 000000, ACTTX=0, TYPE=1033 (DATAVOL STATE)

CREATING^SYSTEM=190, VOLNAME=$DATA13, STATE=8, STATE^TEXT=STARTED

On the former primary system, the last file in the MAT might have been left in the crashopen

state. You can determine that by issuing this command:

$system system 3> fileinfo $*.ztmfat.*

$audit.ztmfat

CODE EOF LAST MODIFIED OWNER RWEP PExt SExt

aa000001 134 125825024 01feb2005 10:15 255,255 gggg 3840 3840

aa000002 134 125829120 01feb2005 10:20 255,255 gggg 3840 3840

aa000003 134 125829120 01feb2005 10:24 255,255 gggg 3840 3840

aa000004 134 125808640 01feb2005 10:31 255,255 gggg 3840 3840

aa000005 134 125829120 01feb2005 10:38 255,255 gggg 3840 3840

aa000006 134 125829120 01feb2005 10:45 255,255 gggg 3840 3840

aa000007 134 125829120 01feb2005 10:54 255,255 gggg 3840 3840

aa000008 134 125829120 01feb2005 11:04 255,255 gggg 3840 3840

aa000009 134 125829120 01feb2005 11:14 255,255 gggg 3840 3840

aa000008 ? 134 107630592 01feb2005 11:04 255,255 gggg 3840 3840

The file marked with the question mark must be fixed. Use the SNOOP FIXUPEOF command

to reset the crashopen flag. Then use SNOOP READAUDIT to read the final record. You cannot

use the MERGE option when specifying the name of the audit trail file. Because the TMF

product is not started, attempting to use the MERGE option results in an error. Using the

example of the MAT, specify the MAT volume and subvolume when SNOOP issues this prompt:

Audit trail name or 'MERGE' (MERGE): $AUDIT.ZTMFAT.AA

Compare the MAT position of the two records to determine which disk has the most audit

records.

2. Recover the database on your former primary system. How you do this depends upon whether

local disks or remote mirrors received the most audit records (which you determined in the

preceding step).

CommitHoldMode ON

If all of the remote mirrors (MAT and all auxiliary audit trails) have more or the same number

of audit records as the local disks (this typically happens if CommitHoldMode was configured

and enabled on the primary system when the outage occurred):

a. Issue SCF STOP $audit-vol on the former primary system (this stops the local disk).

b. Issue SCF STOP $audit-vol on the ZLT standby system (this stops the remote mirror on

the ZLT standby system).

c. Issue SCF START $audit-vol -M (this starts only the remote mirror).

d. Once the remote mirror is started, issue SCF START $audit-vol (which causes the revive

from -M to -P)

Recovering the Primary System After an RDF ZLT Takeover 337