Managing HP Serviceguard for Linux, Seventh Edition, July 2007

Understanding Serviceguard Software Components

How the Cluster Manager Works

Chapter 340

as before. In such cases, packages do not halt or switch, though the

application may experience a slight performance impact during the

re-formation.

If heartbeat and data are sent over the same LAN subnet, data

congestion may cause Serviceguard to miss heartbeats during the period

of the heartbeat timeout and initiate a cluster re-formation that would

not be needed if the congestion had not occurred. To prevent this

situation, HP recommends that you dedicate a LAN for the heartbeat as

well as configuring heartbeat over the data network.

IMPORTANT When multiple heartbeats are configured, heartbeats are sent in

parallel; Serviceguard must receive at least one heartbeat to establish

the health of a node. HP recommends that you configure all subnets that

interconnect cluster nodes as heartbeat networks; this increases

protection against multiple faults at no additional cost.

Heartbeat IP addresses must be on the same subnet on each node. See

the entry for HEARTBEAT_IP, under “Cluster Configuration Parameters”

starting on page 106, for more information about heartbeat

requirements.

Each node sends its heartbeat message at a rate specified by the cluster

heartbeat interval. The cluster heartbeat interval is set in the cluster

configuration file, which you create as a part of cluster configuration,

described in Chapter 5, “Building an HA Cluster Configuration,” on

page 139.

Manual Startup of Entire Cluster

A manual startup forms a cluster out of all the nodes in the cluster

configuration. Manual startup is normally done the first time you bring

up the cluster, after cluster-wide maintenance or upgrade, or after

reconfiguration.

Before startup, the same binary cluster configuration file must exist on

all nodes in the cluster. The system administrator starts the cluster with

the cmruncl command issued from one node. The cmruncl command can

only be used when the cluster is not running, that is, when none of the

nodes is running the cmcld daemon.