HP XC System Software Administration Guide Version 3.2

TIP: Run the badmin reconfig command after the spconfig command to update LSF HPC
with the information on each node's static resources (that is, core and memory), as reported by
SLURM.
15.2.1 Configuring SLURM System Interconnect Support
SLURM has system interconnect support for Quadrics ELAN, which assists MPI jobs with the
global exchange process during startup, when each process is establishing the communication
channels with the other processes in the job.
The SwitchType SLURM configuration setting is set during cluster_config and cannot be
adjusted by the installer (except manually). The cluster_config process queries the CMDB
for the HP XC system interconnect type, and if it is Quadrics Elan, the SwitchType is set to
switch/elan. Otherwise, it is set to switch/none. This setting enables or disables SLURM
support for Quadrics Elan.
If the SwitchType setting is adjusted manually, you will need to restart SLURM:
# cexec -a service slurm restart
15.2.2 Configuring SLURM Servers
The ControlMachine and BackupController settings are configured during the
cluster_config utility; these settings are the host name of the primary controller and backup
controllers, respectively. The installer chooses from among the nodes with the resource
management role the node to run the master slurmctld daemon and (if there is more than one
node with the resource management role) the node to run the backup slurmctld daemon.
Be sure to shut down SLURM on the HP XC system before adjusting these settings manually.
See the HP XC System Software Installation Guide for information about changing the choice of
primary and backup nodes for SLURM by using the cluster_config utility.
15.2.3 Configuring Nodes in SLURM
You can change the configuration of a set of nodes by editing the slurm.conf file. SLURM
enables you to describe various node characteristics on your system. SLURM uses this description
to enable your users to select an optimal set of nodes for their jobs.
Node Characteristics
The following characteristics are useful on an HP XC system:
Feature
Alphanumeric text with meaning in the local environment. This text is a
comma-separated list of features for the specified nodes.
RealMemory
The amount of real memory (in megabytes) on the node.
If SLURM finds that the node has less real memory than the value specified,
the node's state is set to DOWN to avoid disrupting jobs that require that
minimum.
Procs
The number of processors on the node.
If SLURM finds that the node has fewer processors than the value specified,
the node's state is set to DOWN to avoid disrupting jobs that require that
minimum.
State The initial state of the node (IDLE or DOWN).
TmpDisk
The minimum temporary disk space (in megabytes) available on node.
If SLURM finds that the node has less disk space than the value specified, the
node's state is set to DOWN to avoid disrupting jobs that require that minimum.
172 Managing SLURM