HP XC System Software Administration Guide Version 3.2.1

hostgroup command, 35
HowTo
web address, 20
HP BladeSystems information, 83
HP documentation
providing feedback for, 25
HP Graph, 97–101
HP Serviceguard, 48–51
HP XC
command set, 34
configuration file guidelines, 39
HP XC system
booting, 53
file system hierarchy, 30
log files, 33
shutdown, 56
startup, 53
hpasm, 89
/hptc_cluster directory, 31, 60, 148, 266, 267
guidelines, 31
troubleshooting mount failure, 250
I
I/O service, 29
image replication and distribution, 143
exclusion files, 153
image server services, 29
improved availability, 42, 47–52
availability tool, 47
imaging and starting nodes, 56
in a full imaging installation, 154
Nagios, 264
NAT administration, 133
NAT failover, 133
restarting Nagios, 63, 117
shutting down the system, 56
starting nodes, 55
stopping a services, 64
transfer_from_avail command, 36
transfer_to_avail command, 36
troubleshooting, 264–265
InfiniBand
administrative password, 168
root password, 168
troubleshooting, 259
installation
fresh, 34
updated RPMs, 139
upgrade, 34
IP port
open external ports, 157
open internal ports, 157
opening a port globally, 159
opening a temporary port, 159
iptables.proto file, 159
ITRC, 139
J
job accounting, 182
log file, 182
statistics, 183
turning off, 184
turning on, 184
jobacct.log file, 182, 185
K
kernel dependent module, 140
kernel dump
analyzing, 105
obtaining, 105
kernel module
rebuilding, 140
L
license management, 79–80
license manager
restarting, 80
starting, 80
stopping, 80
Linux Virtual Server (see LVS)
lmstat command, 79
Load Sharing Facility (see LSF)
local storage, 28
local user accounts, 163
adding, 163
deleting, 165
general administration, 163
modifying, 164
locatenode command, 35
log files, 33
logging
events, 92
logfiles, 31
Nagios log files, 252
login service, 28
LSF
switching type of LSF installed, 198
LSF daemon, 195
moving to backup node, 210
moving to primary node, 210
LSF documentation, 22
LSF execution host, 195, 201, 207
lsf partition, 197, 211, 268
LSF services, 29
LSF-HPC with SLURM
controlling LSF-HPC with SLURM service, 201
default user environment, 200
enhancing, 211
implementation, 195
inconclusive job termination, 268
installation details, 199
job accounting, 205
job starter script, 196, 201, 203, 204
job submission, 201
load indexes, 206
maintaining shell prompts in interactive shells, 203
monitoring, 207
resource information, 206
329