SLURM Reference Manual for HP XC System Software
Table Of Contents
- Preface
- Introduction
- SLURM Goals and Roles
- SLURM Features
- SLURM Operation
- SLURM Utilities
- SRUN (Submit Jobs)
- SQUEUE (List Jobs)
- SINFO (List Nodes)
- SMAP (Show Job Geometry)
- SCONTROL (Manage Configurations)
- Disclaimer
- Keyword Index
- Alphabetical List of Keywords
- Date and Revisions
SLURM and Operating Systems
SLURM was originally used as a resource manager for Linux (specifically for CHAOS) systems. But
starting in 2006, LC began gradually replacing IBM's native LoadLeveler with SLURM on its AIX systems
as well. The AIX-SLURM combination behaves (and has been configured by LC system administrators
to behave) slightly differently than the CHAOS-SLURM combination, however. This means that to answer
a job-control question increasingly requires knowing both the relevant resource manager and the current
operating system.
This table summarizes the known tool contrasts among the different resource-manager/operating-system
combinations now possible on LC machines:
AIX +
SLURM
AIX +
LoadLeveler
Linux (CHAOS) +
SLURM
POEPOESRUNStart a parallel job:
POE
-rmpool pname
(not number)
POE
-rmpool pnumber
(not name)
SRUN
-w or --nodelist
Specify a node pool:
SQUEUELLQSQUEUEGet job information:
SINFOLLSTATUSSINFOGet node information:
SCANCELLLCANCELSCANCELCancel a started job:
YESYESNOPSUB -g option applies:
SLURM Reference Manual - 9