Managing HP Serviceguard A.12.00.00 for Linux, June 2014

8.1.2.4 Listing the Existing Session.................................................................................259
8.1.3 Managing the Cluster...............................................................................................259
8.1.4 Managing the Nodes in the Simulated Cluster.............................................................259
8.2 Simulation Scenarios for the Package................................................................................260
8.2.1 Creating a Simulated Package..................................................................................260
8.2.2 Running a Package .................................................................................................260
8.2.3 Halting a Package...................................................................................................261
8.2.4 Deleting a Package.................................................................................................261
8.2.5 Enabling or Disabling Switching Attributes for a Package.............................................261
8.3 Simulating Failure Scenarios.............................................................................................261
9 Cluster Analytics.....................................................................................263
9.1 Installing the Cluster Analytics Software..............................................................................265
9.1.1 Pre-requisites............................................................................................................265
9.1.2 Installing serviceguard-analytics Software....................................................................265
9.1.3 Verifying serviceguard-analytics Installation..................................................................265
9.1.4 Removing serviceguard-analytics Software...................................................................265
9.2 Starting Cluster Analytics Daemon.....................................................................................265
9.2.1 Cluster Event Message Consolidation..........................................................................266
9.3 Stopping Cluster Analytics Daemon...................................................................................266
9.4 Verifying Cluster Analytics Daemon...................................................................................267
9.5 Command to Retrieve KPIs................................................................................................267
9.6 Limitations......................................................................................................................268
10 Troubleshooting Your Cluster..................................................................269
10.1 Testing Cluster Operation ..............................................................................................269
10.1.1 Testing the Package Manager ..................................................................................269
10.1.2 Testing the Cluster Manager ...................................................................................270
10.2 Monitoring Hardware ...................................................................................................270
10.3 Replacing Disks............................................................................................................271
10.3.1 Replacing a Faulty Mechanism in a Disk Array..........................................................271
10.3.2 Replacing a Lock LUN............................................................................................271
10.4 Revoking Persistent Reservations after a Catastrophic Failure...............................................271
10.4.1 Examples..............................................................................................................272
10.5 Replacing LAN Cards....................................................................................................272
10.6 Replacing a Failed Quorum Server System.......................................................................273
10.7 Troubleshooting Approaches .........................................................................................274
10.7.1 Reviewing Package IP Addresses .............................................................................274
10.7.2 Reviewing the System Log File .................................................................................275
10.7.2.1 Sample System Log Entries ..............................................................................275
10.7.3 Reviewing Configuration Files .................................................................................276
10.7.4 Using the cmquerycl and cmcheckconf Commands.....................................................276
10.7.5 Reviewing the LAN Configuration ............................................................................276
10.8 Solving Problems .........................................................................................................276
10.8.1 Name Resolution Problems......................................................................................277
10.8.1.1 Networking and Security Configuration Errors.....................................................277
10.8.2 Halting a Detached Package..................................................................................277
10.8.3 Cluster Re-formations Caused by Temporary Conditions..............................................277
10.8.4 Cluster Re-formations Caused by MEMBER_TIMEOUT Being Set too Low.......................277
10.8.5 System Administration Errors ..................................................................................278
10.8.5.1 Package Control Script Hangs or Failures .........................................................279
10.8.6 Node and Network Failures ..................................................................................280
10.8.7 Troubleshooting the Quorum Server.........................................................................280
12 Contents