HP XC System Software Administration Guide Version 3.2.1
short RUN_WINDOW for queue, 268
shutting down, 201
starting up, 200
troubleshooting, 267–269
LSF-HPC with SLURM failover, 207, 208, 268
running jobs, 209
LSF-HPC with SLURM integration, 195
LSF-HPC with SLURM interplay, 215
LSF-HPC with SLURM jobs
controlling, 202
monitoring, 202
lsf.conf file, 211
lshosts command, 206
lsload command, 206
LVS, 43, 60
director service, 28
M
manage_enclosure command, 35
manage_mcs_status command, 317, 318
managedb command, 35
archive, 83
backup, 83
dump, 85
purge, 84
restore, 83, 84
management hub services, 29
management nodes, 28
managing licenses, 79
manpages, 25
MCS, 317–320
log files, 319
MCS cluster monitor, 319
MCS traps monitor, 319
MCS device
as Nagios host, 319
monitored by Nagios, 317
status, 317
mcs.ini file, 317
mcs_config command, 318
mcs_local.cfg file, 317
mcs_trends.log file, 319
mcs_trends.staticdb file, 319
mdadm command, 232, 278
examining RAID array, 232
mdadm utility, 24
mirroring, 231
modifying a local user account, 164
Modular Cooling System (see MCS)
modulefile
loading, 44, 217
managing, 44, 217
unloading, 44, 217
viewing available, 44, 217
viewing loaded, 44, 217
monitoring
hierarchy, 88
strategy, 88
monitoring SLURM, 186
monitoring tools, 87
mounting file systems, 219
MPICH, 315–316
MUNGE authentication package, 266
Myrinet system interconnect
administrative password, 167
diagnostic tools, 242
troubleshooting, 256–257
MySQL, 29, 39, 81
accessing, 81
cannot connect to MySQL server, 249
N
Nagios, 61, 107–131, 186
changing default user name, 120
configuration files, 41
configuring, 122
customizing for MCS monitoring, 317
default alert message format, 126
default settings, 124
determining status of nagios service, 251
files, 109
global settings, 120
host, 108, 112
log files, 252
LSF monitoring, 207
LSF-HPC with SLURM monitoring, 207
main window, 109
MCS monitoring, 317
menu, 89
messages reported by, 253–256
optional configuration, 122
restarting, 117
Service Detail View, 112
Service Problems View, 114
stopping, 117
Tactical Overview, 112
troubleshooting, 251
updating configuration, 117
views, 111
web interface, 109
Nagios alert messages, 126
default format, 126
forwarding, 118
Nagios plug-in, 122
disabling, 122
MCS, 319
running manually, 252
Nagios report generator utility (see nrg utility)
nagios_monitor service, 61
nagios_vars.ini file, 119
MCS monitoring, 317
multiple %EXPR% expresseions, 253
Nan, 128
nand daemon (see Nan)
NAT, 43
administration, 133
client, 133
server, 133
330 Index