HP XC System Software Administration Guide Version 3.1

file system hierarchy, 27
log files, 30
shutdown, 52
startup, 49
hpasm, 85
/hptc_cluster directory, 28, 56, 134, 240–241
guidelines, 28
I
I/O service, 26
image replication and distribution, 129
exclusion files, 138
image server services, 26
improved availability, 38, 43–48
availability tool, 43
imaging and starting nodes, 51
in a full imaging installation, 139
NAT administration, 125
NAT failover, 125
restarting Nagios, 59, 110
shutting down the system, 52
starting nodes, 51
stopping a services, 60
transfer_from_avail command, 32
transfer_to_avail command, 32
InfiniBand
administrative password, 153
troubleshooting, 238
installation
fresh, 30
upgrade, 30
IP port
open external ports, 143
open internal ports, 143
opening a port globally, 145
opening a temporary port, 144
iptables.proto file, 145
J
job accounting, 165
log file, 165
statistics, 166
turning off, 167
turning on, 167
jobacct.log file, 165, 168
K
kernel dump
analyzing, 98
obtaining, 98
L
license management, 75–76
license manager
restarting, 76
starting, 76
stopping, 76
Linux Virtual Server (see LVS)
lmstat command, 75
Load Sharing Facility (see LSF)
local storage, 26
local user accounts, 149
adding, 149
deleting, 151
general administration, 149
modifying, 150
locatenode command, 31
log files, 30
logging
events, 87
logfiles, 28
Nagios log files, 230
login service, 26
LSF
switching type of LSF installed, 182
LSF daemon, 178
LSF documentation, 20
LSF execution host, 178, 184, 189
lsf partition, 181, 192, 241
LSF services, 26
LSF-HPC
default user environment, 183
job accounting, 186
load indexes, 187
resource information, 187
shutting down, 184
LSF-HPC with SLURM
controlling LSF-HPC with SLURM service, 184
enhancing, 191
implementation, 178
installation details, 182
job submission, 184
starter script, 179, 184
starting up, 183
troubleshooting, 241
LSF-HPC with SLURM failover, 189–190, 242
running jobs, 191
LSF-HPC with SLURM falover, 190
LSF-HPC with SLURM integration, 178
LSF-HPC with SLURM interplay, 196
LSF-HPC with SLURM jobs
controlling, 185
monitoring, 185
LSF-HPC with SLURM monitoring, 188
lsf.conf file, 191
lshosts command, 187
lsload command, 187
LVS, 38, 56
director service, 26
M
manage_enclosure command, 31
manage_mcs_status command, 273–274
managedb command, 31
archive, 80
backup, 79
dump, 81
purge, 80
285