Managing Serviceguard Fifteenth Edition, reprinted May 2008
Troubleshooting Your Cluster
Solving Problems
Chapter 8 429
nslookup ftsys9
Name Server: server1.cup.hp.com
Address: 15.13.168.63
Name: ftsys9.cup.hp.com
Address: 15.13.172.229
If the output of this command does not include the correct IP address of
the node, then check your name resolution services further.
In many cases, a symptom such as Permission denied... or
Connection refused... is the result of an error in the networking or
security configuration. Most such problems can be resolved by
correctingthe entries in /etc/hosts. See “Configuring Name Resolution”
on page 203 for more information.
Cluster Re-formations
Cluster re-formations may occur from time to time due to current cluster
conditions. Some of the causes are as follows:
• local switch on an Ethernet LAN if the switch takes longer than the
cluster NODE_TIMEOUT value. To prevent this problem, you can
increase the cluster NODE_TIMEOUT value, or you can use a different
LAN type.
• excessive network traffic on heartbeat LANs. To prevent this, you
can use dedicated heartbeat LANs, or LANs with less traffic on
them.
• an overloaded system, with too much total I/O and network traffic.
• an improperly configured network, for example, one with a very large
routing table.
In these cases, applications continue running, though they might
experience a small performance impact during cluster re-formation.