SLURM Reference Manual for HP XC System Software

SRUN Control Options
These SRUN options control how a SLURM job manages its nodes and other resources, what its
working features (such as job name) are, and how it gives you help. Separate "constraint" options (page
36) (which behave like PSUB constraints) and I/O options (page 33) appear in other subsections on SRUN.
Most control options have a one-character, one-hyphen (UNIX) format and an alternative keyword,
two-hyphen (Linux) format, shown together here. System administrators see also SCONTROL (page 65).
Node Management
On All Machines:
-k (lowercase, --no-kill)
avoids automatic termination if any node fails that has been allocated to this job. The
job assumes responsibility for handling such node failures internally. (SLURM's
default is to terminate a job if any of its allocated nodes fail.)
-K (uppercase, --kill-on-bad-exit)
(default) terminates a job if any task has a nonzero exit code.
-m dist (lowercase, --distribution=dist)
tells SLURM how to distribute tasks among nodes for this job, where the choices for
dist are:
block assigns tasks in order to each CPU on one node before assigning
any to the next node. This is the default if the number of tasks
exceeds the number of nodes requested.
cyclic assigns tasks "round robin" across all allocated nodes (task1 goes
to the rst node, task2 goes to the second node, etc.). This is the
default if the number of nodes requested equals or exceeds the
number of tasks.
hostle assigns tasks to nodes in the order specied by the le named in the
environment variable SLURM_HOSTFILE.
-r n (lowercase, --relative=n)
offsets the rst job step to node n of this job's allocated node set (where the rst node
is 0). Option -r is incompatible with "constraint" options -w and -x, and it is ignored
when you run a job without a prior node allocation (default for n is 0). SRUN does
not support job steps on BlueGene/L.
-s (lowercase, --share)
allows this job to share nodes with other running jobs. Sharing nodes often starts the
job faster and boosts system utilization, but it can also lower application performance.
SLURM Reference Manual - 24