Managing Serviceguard A.11.20, March 2013

ManualsBrandsHP ManualsSoftwareHP Serviceguard Software

341

342

343

344

345

346

347

348

349

350

cmmodnet -r -i <ip-address> <subnet>

where <ip-address> is the address in the Address or the Address column and <subnet>

is the corresponding entry in the Network column for IPv4, or the prefix (which can be derived

from the IPV6 address) for IPv6.

3. Ensure that package volume groups are deactivated. First unmount any package logical

volumes which are being used for file systems. This is determined by inspecting the output

resulting from running the command bdf -l. If any package logical volumes, as specified

by the LV[] array variables in the package control script, appear under the “Filesystem”

column, use umount to unmount them:

fuser -ku <logical-volume>

umount <logical-volume>

Next, deactivate the package volume groups. These are specified by the VG[] array entries

in the package control script.

vgchange -a n <volume-group>

4. Finally, re-enable the package for switching.

cmmodpkg -e <package-name>

If after cleaning up the node on which the timeout occurred it is desirable to have that node

as an alternate for running the package, remember to re-enable the package to run on the

node:

cmmodpkg -e -n <node-name> <package-name>

If the package failed because of a failure in generic resource of evaluation type

during_package_start, then the status of the generic resource will be 'down'. The switching

parameter will be disabled on the node.

To enable switching:

• First fix the resource that went down.

• Once the resource is available, set the status of the generic resource to 'up' by running:

cmsetresource -r <resource_name> -s up

NOTE: If the resource is an extended resource, ensure to set the value for the generic resource

that satisfies the generic_resource_up_criteria.

• Re-enable the package to run on the node by running:

cmmodpkg -e -n <node-name> <package-name>

The default Serviceguard control scripts are designed to take the straightforward steps needed to

get an application running or stopped. If the package administrator specifies a time limit within

which these steps need to occur and that limit is subsequently exceeded for any reason, Serviceguard

takes the conservative approach that the control script logic must either be hung or defective in

some way. At that point the control script cannot be trusted to perform cleanup actions correctly,

thus the script is terminated and the package administrator is given the opportunity to assess what

cleanup steps must be taken.

If you want the package to switch automatically in the event of a control script timeout, set the

node_fail_fast_enabled parameter (page 248) to yes. In this case, Serviceguard will cause

the node where the control script timed out to halt (system reset). This effectively cleans up any side

effects of the package's run or halt attempt (but remember that the system reset will cause all the

packages running on that node to halt abruptly). In this case the package will be automatically

restarted on any available alternate node for which it is configured. For more information, see

“Responses to Package and Service Failures ” (page 95).

342 Troubleshooting Your Cluster