HP XC System Software Administration Guide Version 3.2

HP BladeSystems information, 83
HP documentation
providing feedback for, 26
HP Graph, 97–101
HP Serviceguard, 48–51
HP XC
command set, 33
configuration file guidelines, 38
HP XC system
booting, 53
file system hierarchy, 29
log files, 32
shutdown, 56
startup, 53
hpasm, 89
/hptc_cluster directory, 31, 60, 144, 262, 263
guidelines, 31
troubleshooting mount failure, 246
I
I/O service, 28
image replication and distribution, 139
exclusion files, 149
image server services, 29
improved availability, 41, 47–52
availability tool, 47
imaging and starting nodes, 56
in a full imaging installation, 150
Nagios, 260
NAT administration, 131
NAT failover, 131
restarting Nagios, 63, 115
shutting down the system, 56
starting nodes, 55
stopping a services, 64
transfer_from_avail command, 35
transfer_to_avail command, 35
troubleshooting, 260–261
InfiniBand
administrative password, 164
root password, 164
troubleshooting, 255
installation
fresh, 33
updated RPMs, 135
upgrade, 33
IP port
open external ports, 153
open internal ports, 153
opening a port globally, 155
opening a temporary port, 155
iptables.proto file, 155
ITRC, 135
J
job accounting, 178
log file, 178
statistics, 179
turning off, 180
turning on, 180
jobacct.log file, 178, 181
K
kernel dependent module, 136
kernel dump
analyzing, 104
obtaining, 104
kernel module
rebuilding, 136
L
license management, 79–80
license manager
restarting, 80
starting, 80
stopping, 80
Linux Virtual Server (see LVS)
lmstat command, 79
Load Sharing Facility (see LSF)
local storage, 28
local user accounts, 159
adding, 159
deleting, 161
general administration, 159
modifying, 160
locatenode command, 34
log files, 32
logging
events, 92
logfiles, 31
Nagios log files, 248
login service, 28
LSF
switching type of LSF installed, 194
LSF daemon, 191
moving to backup node, 206
moving to primary node, 206
LSF documentation, 22
LSF execution host, 191, 197, 203
lsf partition, 193, 207, 264
LSF services, 28
LSF-HPC with SLURM
controlling LSF-HPC with SLURM service, 197
default user environment, 196
enhancing, 207
implementation, 191
inconclusive job termination, 264
installation details, 195
job accounting, 201
job starter script, 192, 197, 199, 200
job submission, 197
load indexes, 202
maintaining shell prompts in interactive shells, 199
monitoring, 203
resource information, 202
short RUN_WINDOW for queue, 264
shutting down, 197
starting up, 196
323