Managing Serviceguard Eighteenth Edition, September 2010
Responding to Cluster Events ..........................................................................................397
Single-Node Operation ....................................................................................................397
Disabling Serviceguard.....................................................................................................398
Removing Serviceguard from a System...........................................................................398
8 Troubleshooting Your Cluster....................................................................................................399
Testing Cluster Operation ................................................................................................399
Start the Cluster using Serviceguard Manager...........................................................399
Testing the Package Manager .....................................................................................399
Testing the Cluster Manager .......................................................................................400
Testing the Network Manager ....................................................................................400
Monitoring Hardware ......................................................................................................401
Using Event Monitoring Service.................................................................................401
Using EMS (Event Monitoring Service) Hardware Monitors.....................................402
Hardware Monitors and Persistence Requests............................................................402
Using HP ISEE (HP Instant Support Enterprise Edition)...........................................402
Replacing Disks.................................................................................................................402
Replacing a Faulty Array Mechanism.........................................................................402
Replacing a Faulty Mechanism in an HA Enclosure...................................................403
Replacing a Lock Disk.................................................................................................404
Replacing a Lock LUN.................................................................................................405
Online Hardware Maintenance with In-line SCSI Terminator ...................................406
Replacing I/O Cards..........................................................................................................406
Replacing SCSI Host Bus Adapters.............................................................................406
Replacing LAN or Fibre Channel Cards...........................................................................407
Offline Replacement....................................................................................................407
Online Replacement....................................................................................................407
After Replacing the Card.............................................................................................408
Replacing a Failed Quorum Server System......................................................................408
Troubleshooting Approaches ...........................................................................................409
Reviewing Package IP Addresses ...............................................................................410
Reviewing the System Log File ..................................................................................410
Sample System Log Entries ...................................................................................411
Reviewing Object Manager Log Files .........................................................................411
Reviewing Serviceguard Manager Log Files ..............................................................412
Reviewing the System Multi-node Package Files........................................................412
Reviewing Configuration Files ...................................................................................412
Reviewing the Package Control Script .......................................................................412
Using the cmcheckconf Command..........................................................................412
Using the cmviewconf Command.............................................................................413
Reviewing the LAN Configuration ............................................................................413
Solving Problems .............................................................................................................413
Serviceguard Command Hangs..................................................................................414
16 Table of Contents