HP XC System Software Installation Guide Version 4.0

Table 1-2 Role and Service Placement for Improved Availability (continued)
Special Considerations for Role Assignment
Service is Delivered in This
RoleService Name
By default, the management_server role is installed on the
head node. If you want improved availability for Nagios, the
management_server role must be assigned to two nodes,
the head node and one additional node.
In this case, the head node cannot have the management_hub
and console_network roles assigned to it, so you must move
those roles to the other node in the availability set.
The other node in the availability set acts as a Nagios monitor
unless the Nagios master fails over; at that time the other node
acts both as a Nagios master and a Nagios monitor.
HP recommends that the other node in the availability set also
has an external Ethernet connection so that you can run the
Nagios web interface on it.
Dependency on Other Services: To function properly, the
Nagios master service requires a highly available dbserver
service.
For more information about the management_server role,
see Section F.3.12 (page 220) .
management_server
Nagios master
To achieve improved availability of NAT, you must assign the
external role to both nodes in the availability set, and both
nodes must have a configured external Ethernet connection.
If you assign the external role to any other node that is not
part of an availability set, that node cannot act as a NAT server
because it cannot be managed by the availability tool.
During cluster_config processing, you are prompted to
supply the IP addresses of the NAT servers.
Dependency on Other Services: None.
For more information about the external role See
Section F.3.8 (page 219)
external
Network Address
Translation (NAT)
1.9.7.1 Configuring Failover Capabilities for SLURM and LSF with SLURM
Improved availability of SLURM and LSF with SLURM is not achieved through availability sets
or availability tools. Failover capabilities for SLURM and LSF with SLURM are achieved by
placing the resource_management role on two or more nodes. These nodes are not members
of any availability set, and the SLURM and LSF with SLURM software is not managed by any
availability tool.
When you assign two or more nodes with the resource_management role, SLURM availability
is automatically enabled. If you assign the resource_management to two or more nodes, you
must manually enable availability for LSF with SLURM; see Section 7.1.2 (page 124) for instructions.
Standard LSF also contains its own automatic failover mechanisms. See the Platform LSF
documentation for more information on node failure scenarios with standard LSF.
1.9.8 Using the Improved Availability Planning Worksheet
After you have completed the advance planning of your service availability strategy, use the
worksheet in Table 1-3 to record the following information:
The node names to associate into availability sets.
The availability tool that will manage the services in each availability set (if you installed
and configured more than one availability tool).
The roles (and thus, the services) to assign to both nodes in each availability set
The cluster_config utility prompts you for this information, so have the worksheet handy.
1.9 Planning a Service Availability Strategy 33