HP Serviceguard Quorum Server Version A.03.00 Release Notes, December 2007

HP Serviceguard Quorum Server Version A.03.00 Release Notes

Configuring Serviceguard to Use the Quorum Server

Chapter 118

Configuring Serviceguard to Use the Quorum

Server

About the QS Polling Interval and Timeout Extension

Serviceguard probes the Quorum Server at intervals determined by the

QS_POLLING_INTERVAL parameter in the cluster configuration file. The

default value for QS_POLLING_INTERVAL is 5 minutes and the minimum

value is 10 seconds.

If the quorum server process goes down while its node is still up, the

Serviceguard cluster nodes can detect the halt in the quorum server

process. Serviceguard will try to re-connect to the quorum server every

10 seconds until the quorum server is back up and the connection is

successful. If the quorum server is needed as a tie-breaker during this

downtime, the cluster will halt.

However, Serviceguard cannot immediately detect the loss of connection

to the process if the quorum server’s node goes down. Serviceguard will

continue to poll at the configured interval, and will not discover that the

quorum server connection is down until the next polling is done. If a

cluster reformation starts before the next polling has occurred,

Serviceguard assumes the Quorum Server is down. Because it requires

the Quorum Server as a tie-breaker, it will halt the cluster. (Even if the

Quorum Server comes back up before or during reformation,

Serviceguard will not know that it has until the next polling.)

The minimum value for the polling interval is 10 seconds. Reducing the

QS_POLLING_INTERVAL means Serviceguard will detect Quorum Server

failures sooner, but it will also increase the load on the Quorum Server. If

you set a short interval, you may have to reduce the number of clusters

or nodes using the Quorum Server to reduce the load. This is especially

important if you are using Serviceguard Extension for Faster Failover

(SGeFF) because in that case the lock acquisition time value is also set

very low. Test very low settings carefully to fine-tune all timing

parameters, and do the tests in an environment that imitates the actual

production environment as closely as possible.