Providing Open Architecture High Availability Solutions
6
Providing Open Architecture High Availability Solutions
9.1.4 Consistent Programmatic Response......................................................87
9.1.5 Avoidance of Arbitrary Limits..................................................................87
9.1.6 Appropriate panic() Behavior .................................................................88
9.1.7 Handling of Spurious Events..................................................................88
9.2 Notification ..........................................................................................................88
9.3 HA-Enhanced OS Services.................................................................................88
9.3.1 Memory Protection .................................................................................89
9.3.2 Process Handling Overview ...................................................................89
9.3.3 I/O Device Drivers ..................................................................................90
9.3.4 Signal IPC Mechanism...........................................................................90
9.3.5 Management Access to Kernel Information ...........................................90
9.3.6 Configurable Restart/Reboot Behavior...................................................91
9.4 Hot-Swap Software Requirements......................................................................91
9.4.1 Device Resources Abstraction ...............................................................92
9.4.2 Stateful Device Driver Model..................................................................92
9.5 Support of the Application Layer .........................................................................92
10.0 Layer-Specific Capabilities – Management Middleware.............................................93
10.1 Collect System Data in Real Time ......................................................................94
10.2 Configuring and Maintaining State-Aware Model of the Total System................95
10.3 Checkpointing Data to Redundant Components.................................................96
10.4 Detecting, Diagnosing and Isolating Faults.........................................................97
10.5 Performing Rapid Recovery ................................................................................98
10.6 Dynamically Managing Configuration and Dependencies of All Components ....98
10.7 Providing Administrative Access and Control .....................................................99
10.8 Providing a Consistent Interface to Applications.................................................99
10.9 Alignment with Standards ...................................................................................99
11.0 Layer-Specific Capabilities – Applications................................................................100
11.1 Status................................................................................................................100
11.2 Notification ........................................................................................................100
11.3 State Preservation.............................................................................................101
11.4 Recovery...........................................................................................................101
11.5 Resilience..........................................................................................................102
12.0 Glossary........................................................................................................................103
13.0 Bibliography .................................................................................................................111