Using Serviceguard Extension for RAC, 5th Edition, June 2007
Serviceguard Configuration for Oracle 10g RAC
Cluster Communication Network Monitoring
Chapter 270
• First network with primary for SG-HB #1 (lan1).
• Second network with primary for SG-HB #2 (lan2).
• Third network with primary and standby for CSS-HB and RAC-IC
(lan3/lan4).
• Single failure is protected by primary/standby.
If the subnet with CSS-HB fails, unless subnet monitoring is used, CSS
will resolve the interconnect subnet failure with a CSS cluster
reconfiguration. CSS will wait for CSS misscount time interval (Oracle
Clusterware configuration parameter) before handling the CSS-HB
subnet failure. The default value of CSS misscount in SGeRAC
configurations is 600 seconds.
CLUSTER_INTERCONNECT_SUBNET can be used in conjunction with the
NODE_FAIL_FAST_ENABLED option to monitor the CSS-HB network. A
failure of CSS-HB subnet on a node should be handled by bringing down
that node. Thus, set NODE_FAIL_FAST_ENABLED to YES for the package
monitoring the CSS-HB subnet. When Oracle Clusterware is configured
as a multi-node package and CLUSTER_INTERCONNECT_SUBNET is used to
monitor the CSS-HB subnet, the failure of the CSS-HB subnet on a node
will bring down the instance of the multi-node package and the node
where the subnet has failed. A failure of CSS-HB subnet on all nodes will
result in the multi-node package failing on the nodes one by one
(resulting in that node going down), and one instance of the multi-node
package and node will remain providing services to the clients.
NOTE Do not configure CLUSTER_INTERCONNECT_SUBNET in the RAC Instance
package since the RAC-IC network is the same as CSS-HB network.
Use a separate package to monitor the CSS-HB subnet alone and have
Oracle Clusterware multi-node package depend on the package
monitoring the CSS-HB subnet. NODE_FAIL_FAST_ENABLED will be set to
NO for Oracle Clusterware package and will be set to YES for the package
monitoring CSS-HB subnet (Oracle Cluster Interconnect Subnet
package).