RDF System Management Manual
Table Of Contents
- RDF System Management Manual
- What’s New in This Manual
- About This Manual
- 1 Introducing RDF
- RDF Subsystem Overview
- RDF Processes
- RDF Operations
- Reciprocal and Chain Replication
- Available Types of Replication to Multiple Backup Systems
- Triple Contingency
- Loopback Configuration (Single System)
- Online Product Initialization
- Online Database Synchronization
- Online Dumps
- Subvolume- and File-Level Replication
- Shared Access DDL Operations
- EMS Support
- SMF Support
- RTD Warning Thresholds
- Process-Lockstep Operation
- Support for Network Transactions
- RDF and NonStop SQL/MX
- Zero Lost Transactions (ZLT)
- Monitoring RDF Entities With ASAP
- 2 Preparing the RDF Environment
- 3 Installing and Configuring RDF
- 4 Operating and Monitoring RDF
- 5 Managing RDF
- Recovering From File System Errors
- Handling Disk Space Problems
- Responding to Operational Failures
- Stopping RDF
- Restarting RDF
- Carrying Out a Planned Switchover
- Takeover Operations
- Reading the Backup Database
- Access to Backup Databases in a Consistent State
- RDF and NonStop SQL/MP DDL Operations
- RDF and NonStop SQL/MX Operations
- Backing Up Image Trail Files
- Making Online Dumps With Updaters Running
- Doing FUP RELOAD Operations With Updaters Running
- Exception File Optimization
- Switching Disks on Updater UPDATEVOLUMES
- 6 Maintaining the Databases
- 7 Online Database Synchronization
- 8 Entering RDFCOM Commands
- 9 Entering RDFSCAN Commands
- 10 Triple Contingency
- 11 Subvolume- and File-Level Replication
- 12 Auxiliary Audit Trails
- 13 Network Transactions
- Configuration Changes
- RDF Network Control Files
- Normal RDF Processing Within a Network Environment
- RDF Takeovers Within a Network Environment
- Takeover Phase 1 – Local Undo
- Takeover Phase 2 – File Undo
- Takeover Phase 3 – Network Undo
- Takeover Phase 3 Performance
- Communication Failures During Phase 3 Takeover Processing
- Takeover Delays and Purger Restarts
- Takeover Restartability
- Takeover and File Recovery
- The Effects of Undoing Network Transactions
- Takeover and the RETAINCOUNT Value
- Network Configurations and Shared Access NonStop SQL/MP DDL Operations
- Network Validation and Considerations
- RDF Re-Initialization in a Network Environment
- RDF Networks and ABORT or STOP RDF Operations
- RDF Networks and Stop-Update-to-Time Operations
- Sample Configurations
- RDFCOM STATUS Display
- 14 Process-Lockstep Operation
- Starting a Lockstep Operation
- The DoLockstep Procedure
- The Lockstep Transaction
- RDF Lockstep File
- Multiple Concurrent Lockstep Operations
- The Lockstep Gateway Process
- Disabling Lockstep
- Reenabling Lockstep
- Lockstep Performance Ramifications
- Lockstep and Auxiliary Audit Trails
- Lockstep and Network Transactions
- Lockstep Operation Event Messages
- 15 NonStop SQL/MX and RDF
- Including and Excluding SQL/MX Objects
- Obtaining ANSI Object Names From Updater Event Messages
- Creating NonStop SQL/MX Primary and Backup Databases from Scratch
- Creating a NonStop SQL/MX Backup Database From an Existing Primary Database
- Online Database Synchronization With NonStop SQL/MX Objects
- Offline Synchronization for a Single Partition
- Online Synchronization for a Single Partition
- Correcting Incorrect NonStop SQL/MX Name Mapping
- Consideration for Creating Backup Tables
- Restoring to a Specific Location
- Comparing NonStop SQL/MX Tables
- 16 Zero Lost Transactions (ZLT)
- A RDF Command Summary
- B Additional Reference Information
- C Messages
- D Operational Limits
- E Using ASAP
- Index
Zero Lost Transactions (ZLT)
HP NonStop RDF System Management Manual—524388-003
16-9
ZLT Takeover Operations
ZLT Takeover Operations
If ZLT is configured and enabled when a TAKEOVER command is issued on the
backup system, the overall takeover operation executes in two phases.
Phase 1 (ZLT Processing)
RDFCOM stops all RDF processes on the backup system. If the standby and backup
systems are not the same system, RDFCOM copies the RDF configuration file on the
backup system to the standby system. RDFCOM then starts an RDF monitor process
on the backup system. That monitor then starts the extractor(s) on the standby system
and the receiver(s), purger, and updater(s) on the backup system.
Each extractor logs an event (901) reporting it is started for ZLT processing, starts a
special audit-fixup process to fix up the last file in the audit trail (see The Audit-Fixup
Process below), and sends all remaining audit records to its receiver. When an
extractor reaches the end of its audit trail, it sends a “ZLT finished” indication to its
receiver, and logs an event (900) reporting it has completed its ZLT task. When all
extractors are finished, they are terminated and deleted. Upon receiving the “ZLT
finished” indication, each receiver logs an event (903) reporting it has completed its
ZLT task, and tells its updater to commence normal takeover operations. When all
receivers have finished their ZLT processing, the overall takeover operation proceeds
to phase 2.
The Audit-Fixup Process
The audit-fixup process only ever runs on the remote standby system in an RDF/ZLT
environment and typically lasts only a few seconds. The audit-fixup process performs
file-fixup operations on audit-trail files on the remote mirror that have been left with the
CRASHOPEN flag set following a failure of the RDF primary node. The audit-fixup
process is started by an extractor whenever the extractor attempts to read an audit-trail
file that has the CRASHOPEN flag set. Unlike the other RDF processes, the audit-fixup
process does not persist for the duration of the RDF environment. The audit-fixup
process is started on demand by the extractor process, and terminates as soon as it
has performed the file-fixup processing on the audit-trail file.
This process does not run as process pair, but the extractor will start a new audit-fixup
process if the audit-fixup process is terminated due to a processor failure. No
configuration parameters are required for the audit-fixup process. The audit-fixup
process runs in the same CPU as the extractor primary process with a process priority
one less than the extractor priority.
Note. Before issuing the TAKEOVER command, you must have connected the remote mirrors
to the standby system. When the remote mirrors are connected to the standby system, the
audit records on the remote mirrors have no relationship to the audit trail on the standby
system. The remote mirrors are not part of the TMF configuration of the standby system.