HP XC System Software Administration Guide Version 3.2

16.16 Enhancing LSF-HPC with SLURM
You can set environment variables to influence the operation of LSF-HPC with SLURM in the
HP XC system. These environment variables affect the operation directly and set thresholds for
LSF-HPC with SLURM and SLURM interplay.
16.16.1 LSF-HPC with SLURM Enhancement Settings
Table 16-3 describes the environment variables in the lsf.conf file that you can use to enhance
LSF-HPC with SLURM.
Table 16-3 Environment Variables for LSF-HPC with SLURM Enhancement (lsf.conf File)
DescriptionEnvironment Variable
This entry specifies the TCP port used for communication between
the LSF-HPC with SLURM allocation adapter (RLA) and the SLURM
scheduler plug-in.
The default port number is 6883.
LSB_RLA_PORT=port_number
This entry defines the communications timeout between RLA and its
clients (for example, sbatchd and the SLURM scheduler plug-in.)
The default value is 10 seconds.
LSB_RLA_TIMEOUT=seconds
This entry specifies how often the LSF-HPC with SLURM scheduler
refreshes free node information from RLA.
The default value is 600 seconds.
LSB_RLA_UPDATE=seconds
This entry specifies the directory to store the RLA status file. It enables
RLA to recover its original state when it restarts. When RLA first
starts, it creates the directory defined by LSB_RLA_WORKDIR if it
does not exist, then creates subdirectories for each host. Avoid using
/tmp or any other directory that is automatically cleaned up by the
system. Unless your installation has restrictions on the
LSB_SHAREDIR directory, use the default for LSB_SHAREDIR.
The default value is LSB_SHAREDIR/cluster_name/rla_workdir.
LSB_RLA_WORKDIR=directory
This setting provides for either of two systemwide scheduling modes:
first-fit Under this mode, the scheduler chooses the first-met
free nodes to allocate.
By default, HPC SLURM integration works under
first-fit mode.
best-fit Under this mode, the scheduler applies a set of criteria
to choose nodes with minimal capacities that satisfy
the job request.
You can set LSB_SLURM_BESTFIT=Y to switch the
scheduler to work under best-fit mode.
In a heterogeneous HP XC system, a best-fit allocation
may be preferable for clusters running a mix of serial
and parallel jobs. In this context, best fit means: "the
nodes that minimally satisfy the requirements." Nodes
with the maximum number of processors are chosen
first. For parallel and serial jobs, the nodes with
minimal memory, minimal tmp space, and minimal
weight are chosen.
LSB_SLURM_BESTFIT=Y
LSF-HPC with SLURM is configured to end unrecognized jobs in the
SLURM lsf partition periodically.
This entry stops LSF-HPC with SLURM from periodically terminating
unrecognized jobs in the SLURM lsf partition.
LSB_SLURM_NONLSF_USE=Y
16.16 Enhancing LSF-HPC with SLURM 207