HP XC System Software Administration Guide Version 3.0
Propagating Resource Limits............................................................................................................107
Restricting User Access to Nodes...........................................................................................................109
Job Accounting...................................................................................................................................109
Using the sacct Command..............................................................................................................110
Disabling Job Accounting...............................................................................................................110
Configuring Job Accounting............................................................................................................111
Monitoring SLURM..............................................................................................................................113
Draining Nodes..................................................................................................................................113
Configuring the SLURM Epilog Script.....................................................................................................115
SLURM Daemon Log Maintentance........................................................................................................116
13 Managing LSF
Administering Standard LSF..................................................................................................................117
Administering LSF-HPC.........................................................................................................................118
Integration of LSF-HPC with SLURM...................................................................................................118
Job Starter Scripts.....................................................................................................................119
SLURM External Scheduler.........................................................................................................120
SLURM lsf Partition....................................................................................................................121
LSF-HPC Failover.......................................................................................................................121
Installation of LSF-HPC on SLURM.....................................................................................................122
LSF-HPC Startup and Shutdown........................................................................................................123
Starting Up LSF-HPC..................................................................................................................123
Shutting Down LSF-HPC.............................................................................................................123
Controlling the LSF-HPC Service.......................................................................................................123
Load Indexes and Resource Information............................................................................................124
Launching Jobs with LSF-HPC...........................................................................................................125
Monitoring and Controlling LSF-HPC Jobs..........................................................................................126
Job Accounting..............................................................................................................................127
LSF-HPC Failover............................................................................................................................127
Overview of LSF-HPC Monitoring and Failover Support..................................................................127
Interplay of LSF-HPC and SLURM................................................................................................128
Assigning the Resource Management Nodes................................................................................128
LSF-HPC Failover and Running Jobs.............................................................................................129
LSF-HPC Monitoring.......................................................................................................................129
LSF Execution Host Failure..........................................................................................................129
Enhancing LSF-HPC........................................................................................................................129
LSF-HPC Enhancement Settings...................................................................................................130
Thresholds in LSF-HPC-SLURM Interplay........................................................................................135
Configuring an External Virtual Host Name for LSF-HPC on HP XC Systems............................................135
LSF Daemon Log Maintentance.............................................................................................................136
14 Managing Modulefiles
15 Mounting File Systems
Overview of the Network File System on the HP XC System.......................................................................139
Understanding the Global fstab File.......................................................................................................139
Mounting Internal File Systems Throughout the HP XC System....................................................................141
Understanding the csys Utility in the Mounting Instructions...................................................................142
Mounting Internal File Systems.........................................................................................................142
Mounting Remote File Systems..............................................................................................................144
Understanding the Mounting Instructions...........................................................................................145
Mounting a Remote File System........................................................................................................145
16 Using Diagnostic Tools
Using the sys_check Utility....................................................................................................................149
Using the ovp Utility for System Verification............................................................................................149
6 Table of Contents