Availability Guide for Application Design

Overview of Server and Network Fault Tolerance

Availability Guide for Application Design—525637-004

2-14

System Process Pairs

Checkpointing Data in the Disk Process

The need to checkpoint data depends on whether the operation is retryable. For a

retryable operation, such as a read or a full sector write, there is no need to checkpoint

any data. The file system can simply retry the operation on behalf of the application.

The extent to which partial sector writes are retryable depends on whether the file is

audited. TMF software provides audit protection for operations that take place in

transaction mode on audited files. Audit protection keeps a copy of the data record or

row before the file was updated.

In the case of audited files, if the primary disk process fails, the backup disk process

receives the resent request from the file system. The backup disk process can apply

the request to the before-image data before writing the record or row to disk. In the

case of audited files, the only information that need be checkpointed to the backup disk

process is a transaction identifier and record lock information. For more information on

audited files, refer to Section 4, Data Protection and Recovery.

For nonretryable operations, such as a partial sector write to a nonaudited file, the disk

process must checkpoint the entire updated segment before writing to the disk. Once

the checkpoint is acknowledged, the primary process can go ahead and write out the

data; the backup process has a full sector available for retry.

Figure 2-5. A Disk Process: An Example of an I/O Process Pair

Processor 0 Processor 1 Processor 2

ServerNet

Adapter

Dual ServerNet Fabrics

Mirrored Disks

Device

tables

Application

process

Primary disk

process

Backup disk

process

File system

VST205.vdd

ServerNet

Adapter