Managing Serviceguard 11th Edition, Version A.11.16, Second Printing June 2004

Troubleshooting Your Cluster
Solving Problems
Chapter 8342
Solving Problems
Problems with Serviceguard may be of several types. The following is a
list of common categories of problem:
Serviceguard Command Hangs.
Cluster Re-formations.
System Administration Errors.
Package Control Script Hangs.
Problems with VxVM Disk Groups.
Package Movement Errors.
Node and Network Failures.
Quorum Server Problems.
The first two categories of problems occur with the incorrect
configuration of Serviceguard. The last category contains “normal”
failures to which Serviceguard is designed to react and ensure the
availability of your applications.
Serviceguard Command Hangs
Many Serviceguard commands, including cmviewcl, depend on name
resolution services to look up the addresses of cluster nodes. When name
services are not available (for example, if a name server is down),
Serviceguard commands may hang, or may return a network-related
error message. If this happens, use the nslookup command on each
cluster node to see whether name resolution is correct. For example:
# nslookup ftsys9
Name Server: server1.cup.hp.com
Address: 15.13.168.63
Name: ftsys9.cup.hp.com
Address: 15.13.172.229
If the output of this command does not include the correct IP address of
the node, then check your name resolution services further.