HP XC System Software Administration Guide Version 3.1

About This Document
This document describes the procedures and tools that are required to maintain the HP XC system. It
provides an overview of the administrative environment and describes administration tasks, node
maintenance tasks, Load Sharing Facility (LSF®) administration tasks, and troubleshooting information.
An HP XC system is integrated with several open source software components. Some open source software
components are being used for underlying technology, and their deployment is transparent. Some open
source software components require user-level documentation specific to HP XC systems, and that kind
of information is included in this document, if required.
HP relies on the documentation provided by the open source developers to supply the information you
need to use their product. For links to open source software documentation for products that are integrated
with the HP XC system, see “Supplementary Software Products” (page 20).
Documentation for third-party hardware and software components that are supported on the HP XC
system is supplied by the third-party vendor. However, information about the operation of third-party
software is included in this document if the functionality of the third-party component differs from standard
behavior when used in the XC environment. In this case, HP XC documentation supersedes information
supplied by the third-party vendor. For links to related third-party Web sites, see “Supplementary Software
Products” (page 20).
Standard Linux® administrative tasks or the functions provided by standard Linux tools and commands
are documented in commercially available Linux reference manuals and on various Web sites. For more
information about obtaining documentation for standard Linux administrative tasks and associated topics,
see the list of Web sites and additional publications provided in “Related Software Products and Additional
Publications” (page 21).
1 Intended Audience
This document is intended for experienced Linux and LSF system administrators who already know how
to perform standard administrative tasks and who are also familiar with the HP XC system architecture,
components, and concepts.
You must be familiar with the following:
The Linux operating system
System administration techniques and procedures
The accompanying cluster platform hardware documents and the HP XC Hardware Preparation Guide.
These documents describe how to physically set up the racks, switches, and nodes before beginning
the software installation process.
HP XC System Software Installation Guide
2 New and Changed Information in This Edition
New chapter on the improved availability feature.
New section for configuration file guidelines
Updated information for customizing roles and services.
New section for obtaining information on blade enclosures for HP XC systems with HP BladeSystems.
New section on the smartd daemon for monitoring disks.
New section on modifying syslog-ng rules files.
New section on graphical monitoring through Nagios, HP Graphs
New section on the netdump and crash utilities.
New chapter on Nagios including sections on using the Web interface, configuring Nagios, and the
Nagios Report Generator utility.
New section on changing administrative passwords.
New section on enabling SLURM to recognize a new node.
New section on removing SLURM.
The chapter on LSF-HPC with SLURM is reorganized for ease of use.
1 Intended Audience 17