Serviceguard Version A.11.
Legal Notices The information contained in this document is subject to change without notice. Hewlett-Packard makes no warranty of any kind with regard to this manual, including, but not limited to, the implied warranties of merchantability and fitness for a particular purpose. Hewlett-Packard shall not be liable for errors contained herein or direct, indirect, special, incidental or consequential damages in connection with the furnishing, performance, or use of this material.
A copy of the specific warranty terms applicable to your Hewlett-Packard product and replacement parts can be obtained from your local Sales and Service Office.
Serviceguard Version A.11.16 Release Notes, Second Edition 1 Chapter 1 Serviceguard Version A.11.
Serviceguard Version A.11.16 Release Notes, Second Edition Printing History Printing History Table 1-1 Printing Date Printing History for Serviceguard Version A.11.16 Release Notes (HP part number B3935-90075) Revision Sept 2004 Updated for HP-UX 11i v2, September 2004 June 2004 Initial release of version A.11.
Serviceguard Version A.11.16 Release Notes, Second Edition Announcements Announcements Serviceguard is a specialized software product that protects mission-critical applications from a wide variety of hardware and software failures, and ensures data integrity. The following Serviceguard versions are now available: • For HP-UX 11i v2 (B.11.23): — Product T1905BA — A.11.16 — software and license — Product T1906BA — A.11.16 — documentation • For HPUX 11i v1 (B.11.11): — Product B3935DA — A.11.
Serviceguard Version A.11.16 Release Notes, Second Edition Announcements availability consulting. In addition, you should work with your HP representative to ensure that you have the latest firmware revisions for disk drives, disk controllers, LAN controllers, and other hardware.
Serviceguard Version A.11.16 Release Notes, Second Edition What’s in this Version What’s in this Version The A.11.16 version of Serviceguard is a platform release. This release contains new functionality, defect repairs, and support for new hardware configurations. This second edition is revised with content for HP-UX 11i v2 September 2004 update (externally also known as HP-UX 11i v2 update 2).
Serviceguard Version A.11.16 Release Notes, Second Edition What’s in this Version • Clusters and packages can now be configured through Serviceguard’s graphical user interface, Serviceguard Manager. This interface now replaces all the functionality of SAM Cluster Tool. The Cluster Tool in SAM has been discontinued with the A.11.16 version of Serviceguard. • A new parameter, Network Failure Detection, gives users a choice about how a network monitor decides to declare a LAN card down.
Serviceguard Version A.11.16 Release Notes, Second Edition What’s in this Version Access Control Policies Non-root access to Serviceguard is now defined in the cluster and package configuration files, in a parameter called Access Control Policy. You can have up to 200 policies in a cluster. Policies can be added, modified, or deleted from the configuration without halting the cluster or the package.
Serviceguard Version A.11.16 Release Notes, Second Edition What’s in this Version — Full Admin: Includes Monitor and Package Admin privileges. This user can issue commands to administer the cluster. It is defined in the cluster configuration file. On the command line, users can issue the cmhaltcl, cmruncl, cmhaltnode, and cmrunnode commands. In the graphical user interface, these menu choices are offered: run or halt a cluster, run or halt a node, and run or halt a System Multi-node Package.
Serviceguard Version A.11.16 Release Notes, Second Edition What’s in this Version If you are not sure which method is best for your environment, consult the technical white paper, Serviceguard Network Manager, posted at http://docs.hp.com/hpux/ha -> Serviceguard -> White Papers. What Documents are Available for This Version The following documents relate to Serviceguard A.11.16 and related products. They can be found on the web at http://docs.hp.com/hpux/ha. • Managing Serviceguard, A.11.16 (B3936-90079).
Serviceguard Version A.11.16 Release Notes, Second Edition What’s in this Version • Writing Monitors for the Event Monitoring Service (B7611-90016) available from http://software.hp.com in the “High Availability” area.
Serviceguard Version A.11.16 Release Notes, Second Edition Compatibility Information and Installation Requirements Compatibility Information and Installation Requirements Read this entire document and any other Release Notes or READMEs you may have before you begin an installation. Compatibility Serviceguard version A.11.16 is compatible with HP-UX 11i v1 (11.11) and 11i v2 (11.23) The first release of 11i v2 includes the Cluster Object Manager (COM) Version B.03.00.
Serviceguard Version A.11.16 Release Notes, Second Edition Compatibility Information and Installation Requirements until a newer version of Serviceguard becomes available. In order to receive fixes for any defects found in a feature release after a newer version is released, the customer will need to upgrade to the newer, supported version. Patch A patch to a release is issued in response to a critical business problem found by a Serviceguard customer.
Serviceguard Version A.11.16 Release Notes, Second Edition Compatibility Information and Installation Requirements Required Firmware Upgrades for FibreChannel SCSI Multiplexer Model A3308A The Model A3308A FibreChannel SCSI Multiplexer should be upgraded to Firmware revision 3810 (980611) or newer. This firmware revision supports SCSI II Reserve/Release functionality. This is required in order to enforce exclusive access to tape devices.
Serviceguard Version A.11.16 Release Notes, Second Edition Compatibility Information and Installation Requirements Restriction on LUNs on 11i v2 September 2004 update Configurations using VxVM or CVM with 11i v2 September 2004 update are restricted to a maximu of 256 LUNs, pending fixes to JAGaf36081 and JAGaf36760. For updates on these issues, please check the web page http://www2.itrc.hp.com. Search the Technical Knowledge base for bug reports using the defect number as the keyword.
Serviceguard Version A.11.16 Release Notes, Second Edition Compatibility Information and Installation Requirements You can, however, create a cmclnodelist file to act as a “bootstrap” monitor access. Bootstrap files are useful if you are doing a rolling upgrade, so the nodes with older versions can still access the newer cluster nodes.
Serviceguard Version A.11.16 Release Notes, Second Edition Compatibility Information and Installation Requirements Serviceguard Manager checks two places for access: once when the user logs in to the Session Server, and again when the Session Server contacts the target node. For more information about Serviceguard Manager policies, see the Serviceguard Manager Release Notes, or the online help. For versions earlier than A.11.16, the /.rhosts file must not allow write access by group or other. If /.
Serviceguard Version A.11.16 Release Notes, Second Edition Compatibility Information and Installation Requirements NOTE Remember to tune the swap space and the HP-UX kernel parameters nfile, maxfiles and maxfiles_lim to ensure that they are set high enough for the number of packages you are configuring. Port Requirements Serviceguard uses the ports listed below. Before installing, check /etc/services and be sure no other program has reserved these ports.
Serviceguard Version A.11.16 Release Notes, Second Edition Compatibility Information and Installation Requirements General guidelines for using a system firewall with Serviceguard are listed below.
Serviceguard Version A.11.16 Release Notes, Second Edition Compatibility Information and Installation Requirements • to the cluster nodes — tcp on ports 5302 - and allow only packets with the SYN flag — udp on port 5302 Cluster Object Manager (COM) nodes If you are using a Cluster Object Manager (COM) on a node outside of the cluster to provide connections to Serviceguard Manager or Continental Clusters clients, follow these rules.
Serviceguard Version A.11.16 Release Notes, Second Edition Installing Software on HP-UX 11i v1 Installing Software on HP-UX 11i v1 Following are the directions for installing Servcieguard A.11.16 on HP-UX 11i v2 (uname -a = 11.11). For instructions on installing on HP-UX 11i v2, see page 29. To install your software, run the SD-UX swinstall command. It will invoke a user interface that will lead you through the installation.
Serviceguard Version A.11.16 Release Notes, Second Edition Installing Software on HP-UX 11i v1 • NOTE ATS-CORE.ATS-RUN There are files in CM-CORE that are reserved for HP support. Do not change these files. Do not move, alter, or delete the following: • /usr/contrib/bin/cmcorefr • /usr/contrib/bin/cmdumpfr • /usr/contrib/bin/cmfmtfr • /usr/contrib/lib/Q4/cmfr.pl • /var/adm/cmcluster/frdump.cmcld.
Serviceguard Version A.11.16 Release Notes, Second Edition Installing Software on HP-UX 11i v1 Bad binary config file directory format. Could not convert old binary configuration file. These messages may safely be ignored. Problems Installing EMS Software • What is the problem? If you have already installed EMS software with a later version number than the version you are trying to install, you may see errors similar to the following in the swinstall logfile, /var/adm/sw/swagent.
Serviceguard Version A.11.16 Release Notes, Second Edition Installing Software on HP-UX 11i v1 2. From the Software menu, select Options, then Change. Click OK. 3. Scroll to view and un-click the box for Enforce Dependency Analysis Errors in Agent. Click OK. 4. From the Action menu, select Install. The Status returns: Ready with Errors. Products scheduled: less than the full set. Excluded: older version of EMS 5. Click OK. Install begins.
Serviceguard Version A.11.16 Release Notes, Second Edition Installing Software on HP-UX 11i v1 De-Installing Serviceguard To deinstall your software, run the SD-UX swremove command. Before removing software, note the following: 1. Serviceguard must be halted (not running) on the node from which the swremove command is issued. 2. The system from which the swremove command is issued must be removed from the cluster configuration. 3. The swremove command should be issued from one system at a time.
Serviceguard Version A.11.16 Release Notes, Second Edition Installing Serviceguard on HP-UX 11i v2 Installing Serviceguard on HP-UX 11i v2 This section has directions for installing Serviceguard A.11.16 on HP-UX 11i v2 (uname -a = 11.23). To install on HP-UX 11i v1, see page 24 After you install HP-UX 11i version 2, use the swinstall command to install Serviceguard, product number T1905BA.
Serviceguard Version A.11.16 Release Notes, Second Edition Installing Serviceguard on HP-UX 11i v2 • /usr/contrib/bin/cmcorefr • /usr/contrib/bin/cmdumpfr • /usr/contrib/bin/cmfmtfr • /usr/contrib/lib/Q4/cmfr.pl • /var/adm/cmcluster/frdump.cmcld.x (where x is a digit) The Cluster Object Manager (COM) product (B8324BA) is installed along with Serviceguard. The September 2004 update to HP-UX 11i v2 installs the COM version A.03.00.01. Earlier versions install COM version A.03.00.
Serviceguard Version A.11.16 Release Notes, Second Edition Installing Serviceguard on HP-UX 11i v2 NOTE If you did a swremove of an older version of Serviceguard before the swinstall, your system may be left with a zero-length binary configuration file (/etc/cmcluster/cmclconfig). This file should be removed before you issue the swinstall command. If you do not remove the zero-length binary configuration file.
Serviceguard Version A.11.16 Release Notes, Second Edition Installing Serviceguard on HP-UX 11i v2 different version of Serviceguard, you should check if this version of Serviceguard requires a newer version of EMS, and if so, remove the existing EMS version separately before re-installing Serviceguard. If You Are Upgrading from Earlier Releases NOTE A cold install of the operating system during a rolling upgrade is not supported.
Serviceguard Version A.11.16 Release Notes, Second Edition Installing Serviceguard on HP-UX 11i v2 Table 1-2 Rolling Upgrade Paths (Continued) Serviceguard Release Serviceguard A.11.15 HP-UX Release Serviceguard A.11.15 is supported on HP-UX 11i v1 (11.11) It is supported on HP-UX 11i v2 (11.23) original release. It is supported on 11iv2 September 2004 update on Integrity servers, but not on HP 9000 servers.
Serviceguard Version A.11.16 Release Notes, Second Edition Patches and Fixes in this Version Patches and Fixes in this Version The contents of Serviceguard releases A.11.01 through A.11.15 have been incorporated into A.11.16. This section describes patches that are required and defects that have been fixed in version A.11.16 of Serviceguard. Required and Recommended Patches for HP-UX 11i v2 The following table lists patches required or recommended for Serviceguard A.11.16 on HP-UX 11i v2 (11.23).
Serviceguard Version A.11.16 Release Notes, Second Edition Patches and Fixes in this Version Required and Recommended Patches for HP-UX 11i v1 The following table lists patches required or recommended for Serviceguard A.11.16 on HP-UX 11i v1 (11.11). This list is subject to change without notice. Contact your HP support representative for up-to-the-moment information. Patches can be superseded or withdrawn at any time, so always be sure to check the status of any patch before downloading it.
Serviceguard Version A.11.16 Release Notes, Second Edition Patches and Fixes in this Version Table 1-4 Patches for HP-UX 11i v1 (11.11) (Continued) Patch PHKL_29527 Description This is a required patch for all Serviceguard clusters s700_800 11.11 filesystem buffer cache performance fix PHKL_ 29704 Required if using PRM or WLM s700_800 11.11 Psets Enablement, SCHED_NOAGE, FSS PHKL_29981 Required if VxVM 3.5 is used. s700_800 11.11 VxVM 3.
Serviceguard Version A.11.16 Release Notes, Second Edition Patches and Fixes in this Version Table 1-4 Patch PHKL__30833 Patches for HP-UX 11i v1 (11.11) (Continued) Description Required if using JFS 3.3 (VxFS). s700_800 11.11 dmapi; fsadm; ACL; locking order PHNE_24384 s700_800 11.11 gated (1M) patch PHNE_28328 This is a required patch for all Serviceguard clusters. s700_800 11.11 inetd (1M) cumulative patch PHNE_28778 Required for Auto-Port Aggregation s700_800 11.11.
Serviceguard Version A.11.16 Release Notes, Second Edition Patches and Fixes in this Version JAGaf13778 cmcld SIGSEV on systems with serial heartbeat • What was the problem? In 2 node Serviceguard cluster using a serial heartbeat link, in one code path an uninitialised pointer is referenced which can result in a cmcld abort. The code path is only executed if a serial heartbeat is configured.
Serviceguard Version A.11.16 Release Notes, Second Edition Patches and Fixes in this Version memory, fills in the required information and sends it back to the command. Later it is supposed to free up the memory that was created. In this case, freeing of the memory is not done. • What was the resolution? The fix is to free up the memory, if it was created successfully.
Serviceguard Version A.11.16 Release Notes, Second Edition Patches and Fixes in this Version • What was the resolution? Even if cmclconfd & cmcld daemon recalculates and sets new file descriptors limit, the original value will be restored for child processes so that they will have the original system limit.
Serviceguard Version A.11.16 Release Notes, Second Edition Patches and Fixes in this Version JAGae91702 - CONCURRENT_VGCHANGE_ OPERATIONS is not useful • What was the problem? There is a perception that the CONCURRENT_VGCHANGE_OPERATIONS functionality is not useful in some situations. However, when package failover test was run with 40 volume groups on an 8 processor system, a significant improvement was found in the time it takes package to failover.
Serviceguard Version A.11.16 Release Notes, Second Edition Patches and Fixes in this Version The cmcheckconf and cmapplyconf have been enhanced to not allow verification of a cluster if at least one of the netmasks in the cluster has been changed in the system. This is to prevent users from changing their network masks, which is an unsupported configuration change, after the cluster has been up and running.
Serviceguard Version A.11.16 Release Notes, Second Edition Patches and Fixes in this Version From: Disks which do not have IDs cannot be included in the topology description. Use pvcreate(1M) to initialize a disk for LVM or, use vxdiskadm(1M) to initialize a disk for VxVM. To: Disks were discovered which are not in use by either LVM or VxVM. Use pvcreate(1M) to initialize a disk for LVM or, use vxdiskadm(1M) to initialize a disk for VxVM.
Serviceguard Version A.11.16 Release Notes, Second Edition Patches and Fixes in this Version coordinator node does not send an SMN down trap. The SNM package current node MIB variable is also incorrect in the SNMP MIB table. • What was the resolution? Enhancements allow the subagent and its API to identify SMN packages and treat them differently than regular packages. JAGae61889: cmGetsatus for packages returns -10 after online reconfiguration.
Serviceguard Version A.11.16 Release Notes, Second Edition Patches and Fixes in this Version JAGae60038: cmapplyconf prints bogus message when cl_disk_init fails • What is the problem? A misleading error message is printed when trying to apply a configuration that needs to initialize the cluster lock disk when the cluster lock vg is not activated.
Serviceguard Version A.11.16 Release Notes, Second Edition Known Problems and Workarounds Known Problems and Workarounds JAGaf36760: Uncorrectable write errors on vxvm volumes. • What is the problem? Configurations using greater than 256 LUNs with VxVM on 11i v2 September 2004 update experience uncorrectable write errors. • What is the workaround? Restrict configurations to a maximum of 256 LUNs. Please continue to check the status of this problem at http://www2.itrc.hp.com for updated information.
Serviceguard Version A.11.16 Release Notes, Second Edition Known Problems and Workarounds • What is the workaround? Restrict configurations to 256 LUNs or less. Please continue to check the status of this problem at http://www2.itrc.hp.com for updated information. Search the Technical Knowledge base for bug reports with the keyword JAGaf36081. JAGaf32447: Windows: Avoid security risk posed by JRE bundle installed with Serviceguard Manager A.04.00.
Serviceguard Version A.11.16 Release Notes, Second Edition Known Problems and Workarounds • What is the workaround? For instructions on replacing the JRE on HP-UX, search the technical knowledge base for keyword = JAGaf32443 at your support site: http://us-support.external.hp.com (Americas and Asia Pacific) http://europe-support.external.hp.com (Europe) JAGaf32449: Linux: Avoid security risk posed by JRE bundle installed with Serviceguard Manager A.04.
Serviceguard Version A.11.16 Release Notes, Second Edition Known Problems and Workarounds JAGaf24444: Unable to receive device query message • What is the problem? Serviceguard configuration commands (cmquerycl, etc) may fail if you have no LVM volume groups configured on one or more of your systems. • What is the workaround? The work around is to create a volume group and import it on all nodes that have no LVM volume groups. There is a patch for Serviceguard A.11.14: PHSS_31015. Patch numbers for A.
Serviceguard Version A.11.16 Release Notes, Second Edition Known Problems and Workarounds 3. Now, when you are able to take the entire cluster down, run cmhaltcl -f on one of the nodes that is still running in the Serviceguard cluster. 4. After the cluster has successfully halted, start the Serviceguard cluster on the nodes that were upgraded to CVM 3.5. To do this, you must not run the normal cmruncl command to start up the Serviceguard cluster, since that will attempt to start all cluster nodes.
Serviceguard Version A.11.16 Release Notes, Second Edition Known Problems and Workarounds JAGaf08686: It is not possible to configure some combinations of roles • What was the problem? Duplicate roles and conflicting roles are not allowed in Access Control Policies. This is especially problematic when wild cards are used. For example, if ANY_USER from ANY_SERVICEGUARD_NODE has a role, no other Access Control Policy can be created that would not conflict or be redundant.
Serviceguard Version A.11.16 Release Notes, Second Edition Known Problems and Workarounds JAGae87101: LOG_PERIODIC messages should not be logged at log level 1 • What is the problem? Because periodic messages show up very often, enabling this sort of debug logging will fill up syslog.log very quickly. • What is the workaround? There is no workaround. JAGae62205: Serviceguard package cannot be restarted after hardware monitoring is re-enabled.
Serviceguard Version A.11.16 Release Notes, Second Edition Known Problems and Workarounds — Halt Serviceguard on the node, then restart it. This will create the monitor request. Then start the package. JAGad39695 User error can result in "ghost" services: • What is the problem? When the package was shutdown, references for this service had already been removed, and the service failed while the package was halting. Since the service no longer exists, it cannot be halted manually.
Serviceguard Version A.11.16 Release Notes, Second Edition Known Problems and Workarounds — Another solution is to add a new package with a service name matching that originally deleted and then halt the service with cmhaltserv. This would allow the problem to be resolved without halting either the package, node or cluster. — Alternatively, if the cluster/node were halted and re-started the problem would go away.
Serviceguard Version A.11.16 Release Notes, Second Edition Software Availability in Native Languages Software Availability in Native Languages The command line interface for Serviceguard Version A.11.16 does not provide Native Language Support.
Serviceguard Version A.11.