RDF System Management Manual
Table Of Contents
- RDF System Management Manual
- What’s New in This Manual
- About This Manual
- 1 Introducing RDF
- RDF Subsystem Overview
- RDF Processes
- RDF Operations
- Reciprocal and Chain Replication
- Available Types of Replication to Multiple Backup Systems
- Triple Contingency
- Loopback Configuration (Single System)
- Online Product Initialization
- Online Database Synchronization
- Online Dumps
- Subvolume- and File-Level Replication
- Shared Access DDL Operations
- EMS Support
- SMF Support
- RTD Warning Thresholds
- Process-Lockstep Operation
- Support for Network Transactions
- RDF and NonStop SQL/MX
- Zero Lost Transactions (ZLT)
- Monitoring RDF Entities With ASAP
- 2 Preparing the RDF Environment
- 3 Installing and Configuring RDF
- 4 Operating and Monitoring RDF
- 5 Managing RDF
- Recovering From File System Errors
- Handling Disk Space Problems
- Responding to Operational Failures
- Stopping RDF
- Restarting RDF
- Carrying Out a Planned Switchover
- Takeover Operations
- Reading the Backup Database
- Access to Backup Databases in a Consistent State
- RDF and NonStop SQL/MP DDL Operations
- RDF and NonStop SQL/MX Operations
- Backing Up Image Trail Files
- Making Online Dumps With Updaters Running
- Doing FUP RELOAD Operations With Updaters Running
- Exception File Optimization
- Switching Disks on Updater UPDATEVOLUMES
- 6 Maintaining the Databases
- 7 Online Database Synchronization
- 8 Entering RDFCOM Commands
- 9 Entering RDFSCAN Commands
- 10 Triple Contingency
- 11 Subvolume- and File-Level Replication
- 12 Auxiliary Audit Trails
- 13 Network Transactions
- Configuration Changes
- RDF Network Control Files
- Normal RDF Processing Within a Network Environment
- RDF Takeovers Within a Network Environment
- Takeover Phase 1 – Local Undo
- Takeover Phase 2 – File Undo
- Takeover Phase 3 – Network Undo
- Takeover Phase 3 Performance
- Communication Failures During Phase 3 Takeover Processing
- Takeover Delays and Purger Restarts
- Takeover Restartability
- Takeover and File Recovery
- The Effects of Undoing Network Transactions
- Takeover and the RETAINCOUNT Value
- Network Configurations and Shared Access NonStop SQL/MP DDL Operations
- Network Validation and Considerations
- RDF Re-Initialization in a Network Environment
- RDF Networks and ABORT or STOP RDF Operations
- RDF Networks and Stop-Update-to-Time Operations
- Sample Configurations
- RDFCOM STATUS Display
- 14 Process-Lockstep Operation
- Starting a Lockstep Operation
- The DoLockstep Procedure
- The Lockstep Transaction
- RDF Lockstep File
- Multiple Concurrent Lockstep Operations
- The Lockstep Gateway Process
- Disabling Lockstep
- Reenabling Lockstep
- Lockstep Performance Ramifications
- Lockstep and Auxiliary Audit Trails
- Lockstep and Network Transactions
- Lockstep Operation Event Messages
- 15 NonStop SQL/MX and RDF
- Including and Excluding SQL/MX Objects
- Obtaining ANSI Object Names From Updater Event Messages
- Creating NonStop SQL/MX Primary and Backup Databases from Scratch
- Creating a NonStop SQL/MX Backup Database From an Existing Primary Database
- Online Database Synchronization With NonStop SQL/MX Objects
- Offline Synchronization for a Single Partition
- Online Synchronization for a Single Partition
- Correcting Incorrect NonStop SQL/MX Name Mapping
- Consideration for Creating Backup Tables
- Restoring to a Specific Location
- Comparing NonStop SQL/MX Tables
- 16 Zero Lost Transactions (ZLT)
- A RDF Command Summary
- B Additional Reference Information
- C Messages
- D Operational Limits
- E Using ASAP
- Index
Managing RDF
HP NonStop RDF System Management Manual—524388-003
5-21
Takeover Operations
Takeover Operations
If the primary system fails and you want to switch application processing to the backup
system, you need to issue the TAKEOVER command from the backup system. The
TAKEOVER command causes RDF to shut down after bringing the backup database
to a consistent state.
The RDF Takeover Operation
When updating is enabled, updaters apply audit as soon as it is safe-stored in the
image trails on the backup system. In this respect, they apply audit without waiting to
determine if the associated transactions committed or aborted. At the moment when
you lose your primary system due to some unplanned outage, the updaters may have
applied audit for transactions whose outcomes were not resolved (committed or
aborted) on the primary system at the time the primary system failed. Alternatively, the
transactions may have been resolved on the primary system, but the extractor was
stopped before it could send the final outcomes to the backup system. The takeover
operation determines what audit needs to be backed out in order to bring the backup
database into a stable and consistent state. Audit is backed out of the backup
database during three possible undo passes, described below. For takeover
considerations in a ZLT environment, see Section 16, Zero Lost Transactions (ZLT).
Phase One Undo Pass
This is also known as Local Undo. Audit can be backed out of the backup database for
two possible reasons.
•
If an updater has applied audit for a transaction whose outcome is unknown, that
audit must be backed out.
•
If RDF is replicating audit from aux audit trails and if the final outcome is known,
but not all of the audit for the transaction from an aux trail reached the backup
system, that audit must be backed out.
Transactions that must be undone during this undo pass are stored in the ZTXUNDO
file in your Master Image Trail subvolume.
Phase Two Undo Pass
This is also known as File Undo. If one or more volumes failed on the primary system
and a transaction aborted, the TMF Backout process will backout the transaction on
the volumes that are still up, but it will be unable to backout the audit on the volumes
that are down. If the downed volumes come back online, the TMF Volume Recovery
process backs out the audit that the Backout process could not back out. If, however,
the primary system failed before Volume Recovery had enabled the downed volumes,
then, if you execute the RDF Takeover operation on the backup system, the updaters
execute an undo pass that will undo the audit the Volume Recovery would have
undone on the primary system if it had been able to.