Managing Serviceguard 13th Edition, February 2007
Troubleshooting Your Cluster
Solving Problems
Chapter 8 369
Next, deactivate the package volume groups. These are specified by
the VG[] array entries in the package control script.
# vgchange -a n <volume-group>
4. Finally, re-enable the package for switching.
# cmmodpkg -e <package-name>
If after cleaning up the node on which the timeout occurred it is
desirable to have that node as an alternate for running the package,
remember to re-enable the package to run on the node:
# cmmodpkg -e -n <node-name> <package-name>
The default Serviceguard control scripts are designed to take the
straightforward steps needed to get an application running or stopped. If
the package administrator specifies a time limit within which these steps
need to occur and that limit is subsequently exceeded for any reason,
Serviceguard takes the conservative approach that the control script
logic must either be hung or defective in some way. At that point the
control script cannot be trusted to perform cleanup actions correctly, thus
the script is terminated and the package administrator is given the
opportunity to assess what cleanup steps must be taken.
If you want the package to switch automatically in the event of a control
script timeout, set the NODE_FAIL_FAST_ENABLED parameter to YES. (If
you are using Serviceguard Manager, check the box for Node Fail
Fast.) In this case, Serviceguard will cause a TOC on the node where
the control script timed out. This effectively cleans up any side effects of
the package's run or halt attempt. In this case the package will be
automatically restarted on any available alternate node for which it is
configured.
Problems with Cluster File System (CFS)
If you have a system multi-node package for VERITAS CFS, you may not
be able to start the cluster until SG-CFS-pkg starts. Check
SG-CFS-pkg.log for errors.
You will have trouble running the cluster if there is a discrepancy
between the CVM cluster and the Serviceguard cluster. To check, enter
gabconfig -a command.
The ports that must be up are:
1. a - which is llt, gab