HP XC System Software Administration Guide Version 4.0

Table Of Contents
For more information on the resmon utility, see resmon(1).
7.10 The kdump Mechanism and the crash Utility
7.10.1 The kdump Mechanism
This release introduces kdump, a reliable mechanism to save crash dumps. The two major
components to the kdump mechanism are:
a minimal kernel
initrd, a RAM disk file system that contains the drivers and the initialization code that
enable the kernel to operate during the crash dump.
These components reside in a reserved area of memory at boot time. If a node crashes, the minimal
kernel initializes the network drivers it needs and begins to dump the crash data. It has no
dependency on the system's state when it is crashing.
A kdump service is part of the node_management and avail_node_management roles. In
the high availability configuration, the file system for the crash dumps resides on the shared
storage.
The kdump facility has a client piece and a server piece: The client makes the minimal kernel and
initrd and reserves the area in the memory space. The server side is an nfs export.
On the server side, you need to set up the file system area beforehand to receive the crash dumps.
HP recommends that you use a dedicated file system with adequate disk space to store a specified
number of dump files plus some additional extra space.
NOTE: In the high availability configuration, the file system for the crashes resides on the
shared storage. Only one server has the file system, the kdump server, locally mounted at a time
and exports it to the clients. There is an IP alias for it and HP ServiceGuard handles the failover.
7.10.2 The crash Utility
The crash utility is a self-contained tool that you can use to investigate live systems or to examine
kernel core dumps created with the kdump package.
7.10.3 Installing kdump and crash
The kdump mechanism and the crash utility are installed during the installation of the HP XC
System Software. The RPMs for these facilities are:
kexec-tools-rev_number.platform.rpm
busybox-rev_number.platform.rpm
crash-rev_number.platform.rpm
kernel-debuginfo-rev_number.platform.rpm
kernel-debuginfo-common-rev_number.platform.rpm
mkinitrd-debuginforev_number.platform.rpm
where:
rev_number
Is the revision number of the software.
platform
Is the platform code.
Use x86_64 for CP4000 systems.
Use ia64 for CP6000 systems.
If you did not install them during HP XC installation, use the rpm command to install that them
on the HP XC system:
1. Log in as superuser (root) on the head node.
2. Use the rpm command with the -ihv options to install the RPMs.
102 Monitoring the System