HP XC System Software Release Notes for Version 3.1

7 System Administration and Management..................................................................55
7.1 Multiple %EXPR% Expressions Are Not Accepted In the nagios_vars.ini File..............................55
7.2 The collectl Utility Does Not Correctly Handle the Change to Daylight Savings Time ................55
7.3 Perform a Dry Run Before Using the si_updateclient Utility to Update Nodes.............................55
7.4 NAT Server Aliases Continue to Run When the Number of NAT Servers Are Reduced...............55
7.5 Re-edit the /etc/dhcpd.conf File if the cluster_config or discover Utilities Are Invoked Again.....56
7.6 Cannot Connect to Database During Configuration.......................................................................56
7.7 Consider Disabling Attribute Caching on Large-Scale Systems.....................................................57
7.8 Possible Problem with ext3 File Systems on SAN Storage..............................................................57
7.9 Notes That Apply to Improved Availability...................................................................................57
7.9.1 Restart Serviceguard Quorum Server if Quorum Server Node is Re-imaged........................57
7.9.2 Benign Messages.....................................................................................................................57
7.9.3 Known Limitation if Nagios is Configured for Improved Availability..................................58
7.9.4 Network Restart Command Negatively Affects Serviceguard...............................................58
7.9.5 Problem Failing Over Database Package Under Serviceguard...............................................58
8 Load Sharing Facility and Job Management............................................................61
8.1 Load Sharing Facility.......................................................................................................................61
8.1.1 Maintaining Shell Prompts in LSF-HPC Interactive Shells.....................................................61
8.1.2 Node Reboot Might Result in Inconclusive Job Termination.................................................62
8.1.3 Short LSF Queue RUN_WINDOW Can Suspend Other Jobs.................................................62
8.2 Job Management..............................................................................................................................63
9 Programming and User Environment.........................................................................65
9.1 Required HP-MPI Option on Systems With a Mix of InfiniBand PCI-X and PCI Express.............65
10 Cluster Platform 3000................................................................................................67
11 Cluster Platform 4000................................................................................................69
12 Cluster Platform 6000................................................................................................71
12.1 Network Boot Operation and Imaging Failures on HP Integrity rx2600 Systems........................71
12.2 Management Processor..................................................................................................................71
12.2.1 Required Task: Change MP Settings on Console Switches...................................................71
12.2.2 MP Disables DHCP Automatically.......................................................................................71
12.2.3 Finding the IP Address of an MP..........................................................................................71
13 Integrated Lights Out Console Management Devices............................................73
13.1 iLO2 Devices Can Become Unresponsive.....................................................................................73
14 Interconnects...............................................................................................................75
14.1 InfiniBand Interconnect.................................................................................................................75
14.1.1 No InfiniBand Graphs with Firmware Older Than Version 3.4.2.........................................75
14.2 Myrinet Interconnect.....................................................................................................................76
14.2.1 Myrinet Monitoring Line Card Can Become Unresponsive.................................................76
14.2.2 The clear_counters Command Does Not Work on the 256 Port Switch................................76
14.3 QsNet
II
Interconnect......................................................................................................................76
14.3.1 Possible Conflict with Use of SIGUSR2.................................................................................76
14.3.2 The qsnet Database May Contain Entries to Nonexistent Switch Modules.........................76
Table of Contents 5