RDF System Management Manual
Table Of Contents
- RDF System Management Manual
- What’s New in This Manual
- About This Manual
- 1 Introducing RDF
- RDF Subsystem Overview
- RDF Processes
- RDF Operations
- Reciprocal and Chain Replication
- Available Types of Replication to Multiple Backup Systems
- Triple Contingency
- Loopback Configuration (Single System)
- Online Product Initialization
- Online Database Synchronization
- Online Dumps
- Subvolume- and File-Level Replication
- Shared Access DDL Operations
- EMS Support
- SMF Support
- RTD Warning Thresholds
- Process-Lockstep Operation
- Support for Network Transactions
- RDF and NonStop SQL/MX
- Zero Lost Transactions (ZLT)
- Monitoring RDF Entities With ASAP
- 2 Preparing the RDF Environment
- 3 Installing and Configuring RDF
- 4 Operating and Monitoring RDF
- 5 Managing RDF
- Recovering From File System Errors
- Handling Disk Space Problems
- Responding to Operational Failures
- Stopping RDF
- Restarting RDF
- Carrying Out a Planned Switchover
- Takeover Operations
- Reading the Backup Database
- Access to Backup Databases in a Consistent State
- RDF and NonStop SQL/MP DDL Operations
- RDF and NonStop SQL/MX Operations
- Backing Up Image Trail Files
- Making Online Dumps With Updaters Running
- Doing FUP RELOAD Operations With Updaters Running
- Exception File Optimization
- Switching Disks on Updater UPDATEVOLUMES
- 6 Maintaining the Databases
- 7 Online Database Synchronization
- 8 Entering RDFCOM Commands
- 9 Entering RDFSCAN Commands
- 10 Triple Contingency
- 11 Subvolume- and File-Level Replication
- 12 Auxiliary Audit Trails
- 13 Network Transactions
- Configuration Changes
- RDF Network Control Files
- Normal RDF Processing Within a Network Environment
- RDF Takeovers Within a Network Environment
- Takeover Phase 1 – Local Undo
- Takeover Phase 2 – File Undo
- Takeover Phase 3 – Network Undo
- Takeover Phase 3 Performance
- Communication Failures During Phase 3 Takeover Processing
- Takeover Delays and Purger Restarts
- Takeover Restartability
- Takeover and File Recovery
- The Effects of Undoing Network Transactions
- Takeover and the RETAINCOUNT Value
- Network Configurations and Shared Access NonStop SQL/MP DDL Operations
- Network Validation and Considerations
- RDF Re-Initialization in a Network Environment
- RDF Networks and ABORT or STOP RDF Operations
- RDF Networks and Stop-Update-to-Time Operations
- Sample Configurations
- RDFCOM STATUS Display
- 14 Process-Lockstep Operation
- Starting a Lockstep Operation
- The DoLockstep Procedure
- The Lockstep Transaction
- RDF Lockstep File
- Multiple Concurrent Lockstep Operations
- The Lockstep Gateway Process
- Disabling Lockstep
- Reenabling Lockstep
- Lockstep Performance Ramifications
- Lockstep and Auxiliary Audit Trails
- Lockstep and Network Transactions
- Lockstep Operation Event Messages
- 15 NonStop SQL/MX and RDF
- Including and Excluding SQL/MX Objects
- Obtaining ANSI Object Names From Updater Event Messages
- Creating NonStop SQL/MX Primary and Backup Databases from Scratch
- Creating a NonStop SQL/MX Backup Database From an Existing Primary Database
- Online Database Synchronization With NonStop SQL/MX Objects
- Offline Synchronization for a Single Partition
- Online Synchronization for a Single Partition
- Correcting Incorrect NonStop SQL/MX Name Mapping
- Consideration for Creating Backup Tables
- Restoring to a Specific Location
- Comparing NonStop SQL/MX Tables
- 16 Zero Lost Transactions (ZLT)
- A RDF Command Summary
- B Additional Reference Information
- C Messages
- D Operational Limits
- E Using ASAP
- Index
Network Transactions
HP NonStop RDF System Management Manual—524388-003
13-7
Takeover Phase 3 – Network Undo
Takeover Phase 3 – Network Undo
Note that with RDF/ZLT no undo operations are performed during the network undo
phase because no committed transactions are undone.
Phase three determines if network transaction data is missing from any of the backup
systems in the RDF network, and marks those transactions to be undone on all of the
systems. For example, suppose you began a network transaction, updated tables on
ten different systems, and then committed the transaction. Now suppose that nine of
the ten systems were able to transmit their updates and commit records to their backup
systems, but the tenth primary system went down before its extractor was able to do
so. Phase three determines that the particular transaction involved all 10 databases,
that one of the backup databases is missing audit data for that transaction, and
identifies the transaction as one that must be undone on the other nine systems (it is
undone during phase 1 on the tenth system). All of the updaters then look for audit
data associated with the transaction, and undo it.
More specifically, each purger process has three phases of work to do:
1. produce the local undo list in the ZTXUNDO file
2. produce the file undo list, if required
3. produce the network undo list in the ZNETUNDO file
The purger of the network master determines what network transactions are
incomplete across the different backup systems, and it produces the master network
undo list. Each purger then uses this master list to ascertain the transaction data that
must be undone on its backup database. For example, if a network transaction
involved only four of the ten primary systems in an RDF network, then that transaction
only needs to be undone on the backup databases where that data was replicated.
Because the other systems were not involved, the transaction does not need to be
listed there. The list of network transactions that need to be undone on a specific
system resides in its ZNETUNDO file.
Takeover Phase 3 Performance
The speed with which a takeover completes for an entire RDF network varies based on
the number of systems in the network and how far any system had fallen behind when
the takeover was initiated.
For example, if you have three systems in your RDF network, and all extractors on all
three systems were keeping up with audit generation on their systems, and then one
system fails, the takeover operations may only take a modest number of additional
seconds to complete phase 3 takeover processing.
In contrast, if you have three systems in your RDF network, and one extractor had
fallen 60 minutes behind at the time its system went down, then phase 3 takeover
processing on the other two systems will take many more seconds to complete. The
reason for this is that phase 3 processing on the two systems that were not behind will