RDF/IMP and IMPX System Management Manual (RDF 1.3+)

ManualsBrandsHP ManualsServerHP NonStop G-Series

121

122

123

124

125

126

127

128

129

130

Managing RDF

Compaq NonStop™ RDF/IMP and IMPX System Management Manual—522204-001

5-8

TMF Audited Volume Crash

Updater Failure

If an updater process fails, RDF aborts. A subsequent START RDF command restarts

this process without requiring database resynchronization. To support restartability,

however, the updaters use a different mechanism than the extractor or receiver do: the

updaters rely entirely on context saving rather than checkpointing. For this reason, if the

backup member of an updater process pair takes over because the CPU of the primary

member failed, the backup updater might have to start at an earlier point in the image

trail and require several minutes to reach the point where the primary process was

positioned when the CPU failed.

When a primary CPU on which an updater process is running goes down and then

comes back up, RDF does not switch the updater to run on the reactivated primary CPU.

Instead, once the backup updater takes over, it becomes (and remains) the new primary

process. If you stop and then restart updating, however, the original CPU configuration

for this updater process is restored.

RDF State Transition Failure

Periods during which the RDF updating facility, or RDF itself, is either starting or

stopping are known as RDF state transitions. In rare instances, when a primary CPU

fails for an RDF process during execution of a STOP RDF or STOP UPDATE

command, not all RDF processes complete the state transition properly.

To minimize the chance of encountering this kind of failure, avoid CPU reloads during

RDF state transitions. Furthermore, if a CPU failure does occur during a state transition,

carefully review the EMS event log for signs of incorrect behavior. If the failure

occurred while RDF or the updating facility was stopping, check the Process Pair

Directory (PPD) to ensure that the appropriate RDF processes all have stopped; if they

have not, you must stop them manually.

If a state transition failure occurs during execution of a STOP RDF command and the

operation appears to be stalled, manually stop all of the RDF processes by issuing the

following command on both the primary and backup system:

STATUS *, PROG $SYSTEM.RDF.*, STOP

If a state transition failure occurs during execution of a STOP UPDATE command and

the operation appears to be stalled, manually stop all of the RDF updaters by issuing the

following command on the backup system:

STATUS *, PROG $SYSTEM.RDF.RDFUPDO, STOP

TMF Audited Volume Crash

RDF can recover from a crash of a TMF audited volume on the primary system. If the

volume is successfully recovered by volume recovery, then you do not have to perform

any special RDF procedures.

Caution. Issuing this command in this situation is only safe, however, if this is the backup

system for a single RDF environment.