Managing HP Serviceguard A.11.20.00 for Linux, June 2012

ManualsBrandsHP ManualsSoftwareHP Serviceguard for Linux RH AS Cluster

231

232

233

234

235

236

237

238

239

240

vgchange -a n <volume-group>

4. Finally, re-enable the package for switching.

cmmodpkg -e <package-name>

If after cleaning up the node on which the timeout occurred it is desirable to have that node

as an alternate for running the package, remember to re-enable the package to run on the

node:

cmmodpkg -e -n <node-name> <package-name>

The default Serviceguard control scripts are designed to take the straightforward steps needed to

get an application running or stopped. If the package administrator specifies a time limit within

which these steps need to occur and that limit is subsequently exceeded for any reason, Serviceguard

takes the conservative approach that the control script logic must either be hung or defective in

some way. At that point the control script cannot be trusted to perform cleanup actions correctly,

thus the script is terminated and the package administrator is given the opportunity to assess what

cleanup steps must be taken.

If you want the package to switch automatically in the event of a control script timeout, set the

node_fail_fast_enabled parameter (page 160) to YES. In this case, Serviceguard will cause

a reboot on the node where the control script timed out. This effectively cleans up any side effects

of the package’s run or halt attempt. In this case the package will be automatically restarted on

any available alternate node for which it is configured.

Package Movement Errors (Legacy Packages)

These errors are similar to the system administration errors except they are caused specifically by

errors in the package control script. The best way to prevent these errors is to test your package

control script before putting your high availability application on line.

Adding a set -x statement in the second line of your control script will give you details on where

your script may be failing.

Package startup failure due to uncleaned LVM2 hosttags

When LVM2 hosttags feature is used in Volumegroup, Serviceguard ensures that the hosttags are

cleaned up on every package halt process. However, in case of Node power failure or crash

initiated by SERVICE_FAIL_FAST / NODE_FAIL_FAST feature, hosttags will not be cleaned up. In

such cases, hosttags have to be manually cleaned up before starting the package on other node.

Following messages can be seen in the package log where package failed to startup on other

node and it also provides procedure to clean up the hosttags.

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: ERROR: Function activation_check:

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: Error vg01 may still be activated on xyz.hp.com

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: To correct this situation, logon to "xyz.hp.com " and

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: execute the following commands:

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: vgchange -a n vg01

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: vgchange --deltag xyz.hp.com vg01

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: Once "vg01" has been deactivated from “xyz.hp.com",

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: this package may be restarted via either cmmodpkg (1M)

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: or cmrunpkg(1M).

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: In the event that "xyz.hp.com" is either powered off

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: or unable to boot, then "vg01" must be forced

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: to be activated on this node.

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: ******************* WARNING ***************************

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: Forcing activation can lead to data corruption if

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: "xyz.hp.com" is still running and has "vg01"

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: active. It is imperative to positively determine that

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: "xyz.hp.com" is not running prior to performing

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: this operation.

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: *******************************************************

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: To force activate "vg01", execute the following

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: command on the local system:

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: vgchange --deltag xyz.hp.com vg01

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: The package may then be restarted via either

Feb 11 17:18:36 root@abc.hp.com volume_group.sh[1871]: cmmodpkg (1M) or cmrunpkg (1M) commands.

236 Troubleshooting Your Cluster