HP XC System Software Administration Guide Version 3.1

15.6 Controlling the LSF-HPC with SLURM Service..................................................................................184
15.7 Launching Jobs with LSF-HPC with SLURM.....................................................................................184
15.8 Monitoring and Controlling LSF-HPC with SLURM Jobs....................................................................185
15.9 Job Accounting................................................................................................................................186
15.10 LSF Daemon Log Maintenance........................................................................................................187
15.11 Load Indexes and Resource Information..........................................................................................187
15.12 LSF-HPC with SLURM Monitoring.................................................................................................188
15.13 LSF-HPC with SLURM Failover......................................................................................................189
15.13.1 Overview of LSF-HPC with SLURM Monitoring and Failover Support......................................189
15.13.2 Interplay of LSF-HPC with SLURM.........................................................................................190
15.13.3 Assigning the Resource Management Nodes............................................................................190
15.13.4 LSF-HPC with SLURM Failover and Running Jobs...................................................................191
15.13.5 Manual LSF-HPC with SLURM Failover..................................................................................191
15.14 Enhancing LSF-HPC with SLURM..................................................................................................191
15.14.1 LSF-HPC with SLURM Enhancement Settings..........................................................................191
15.14.2 Thresholds in LSF-HPC with SLURM and SLURM Interplay.....................................................196
15.15 Configuring an External Virtual Host Name for LSF-HPC with SLURM on HP XC Systems................197
16 Managing Modulefiles............................................................................................199
17 Mounting File Systems.............................................................................................201
17.1 Overview of the Network File System on the HP XC System...............................................................201
17.2 Understanding the Global fstab File..................................................................................................201
17.3 Mounting Internal File Systems Throughout the HP XC System..........................................................203
17.3.1 Understanding the csys Utility in the Mounting Instructions.......................................................204
17.3.2 Mounting Internal File Systems.................................................................................................205
17.4 Mounting Remote File Systems.........................................................................................................207
17.4.1 Understanding the Mounting Instructions.................................................................................208
17.4.2 Mounting a Remote File System................................................................................................208
18 Managing Software RAID Arrays..........................................................................211
18.1 Overview of Software RAID.............................................................................................................211
18.1.1 Software RAID-0......................................................................................................................211
18.1.2 Software RAID-1......................................................................................................................211
18.2 Installing Software RAID on the Head Node......................................................................................211
18.3 Installing Software RAID on Client Nodes.........................................................................................211
18.4 Examining a Software RAID Array...................................................................................................212
18.5 Error Reporting...............................................................................................................................213
18.6 Removing Software RAID from Client Nodes....................................................................................213
19 Using Diagnostic Tools............................................................................................215
19.1 Using the sys_check Utility...............................................................................................................215
19.2 Using the ovp Utility for System Verification.....................................................................................215
19.3 Using the dgemm Utility to Analyze Performance..............................................................................221
19.4 Using the System Interconnect Diagnostic Tools.................................................................................222
19.4.1 HP XC Diagnostic Tools for the Myrinet System Interconnect.....................................................222
19.4.1.1 The gm_prodmode_mon Diagnostic Tool..........................................................................222
19.4.1.2 The gm_drain_test Diagnostic Tool....................................................................................223
19.4.2 Using Diagnostic Tools for the Quadrics System Interconnect.....................................................223
19.4.2.1 The swmlogger Daemon...................................................................................................223
19.4.2.2 The qselantestp Diagnostic Tool........................................................................................224
19.4.2.3 The qsnet2_level_test Diagnostic Tool................................................................................225
19.4.2.4 The qsnet2_drain_test Diagnostic Tool...............................................................................227
19.4.3 Using Diagnostic Tools for the InfiniBand Interconnect...............................................................227
19.4.4 Using Diagnostic Tools for the Gigabit Ethernet System Interconnect..........................................228
8 Table of Contents