Managing HP Serviceguard for Linux, Tenth Edition, September 2012

ManualsBrandsHP ManualsSoftwareHP Serviceguard for Linux License Kit

301

302

303

304

305

306

307

308

309

310

advisable to take certain actions to minimize the amount of data that will be lost, as

explained in the following discussion.

Minimize the Use and Amount of Memory-Based Data

Any in-memory data (the in-memory context) will be lost when a failure occurs. The

application should be designed to minimize the amount of in-memory data that exists

unless this data can be easily recalculated. When the application restarts on the standby

node, it must recalculate or reread from disk any information it needs to have in memory.

One way to measure the speed of failover is to calculate how long it takes the application

to start up on a normal system after a reboot. Does the application start up immediately?

Or are there a number of steps the application must go through before an end-user can

connect to it? Ideally, the application can start up quickly without having to reinitialize

in-memory data structures or tables.

Performance concerns might dictate that data be kept in memory rather than written to

the disk. However, the risk associated with the loss of this data should be weighed against

the performance impact of posting the data to the disk.

Data that is read from a shared disk into memory, and then used as read-only data can

be kept in memory without concern.

Keep Logs Small

Some databases permit logs to be buffered in memory to increase online performance.

Of course, when a failure occurs, any in-flight transaction will be lost. However,

minimizing the size of this in-memory log will reduce the amount of completed transaction

data that would be lost in case of failure.

Keeping the size of the on-disk log small allows the log to be archived or replicated more

frequently, reducing the risk of data loss if a disaster were to occur. There is, of course,

a trade-off between online performance and the size of the log.

Eliminate Need for Local Data

When possible, eliminate the need for local data. In a three-tier, client/server environment,

the middle tier can often be dataless (i.e., there is no local data that is client specific or

needs to be modified). This “application server” tier can then provide additional levels

of availability, load-balancing, and failover. However, this scenario requires that all data

be stored either on the client (tier 1) or on the database server (tier 3).

Use Restartable Transactions

Transactions need to be restartable so that the client does not need to re-enter or back

out of the transaction when a server fails, and the application is restarted on another

system. In other words, if a failure occurs in the middle of a transaction, there should be

no need to start over again from the beginning. This capability makes the application

more robust and reduces the visibility of a failover to the user.

A common example is a print job. Printer applications typically schedule jobs. When

that job completes, the scheduler goes on to the next job. If, however, the system dies

in the middle of a long job (say it is printing paychecks for 3 hours), what happens when

Controlling the Speed of Application Failover 309