HP-UX HB v13.00 Ch-08 - Crash Dumps

HP-UX Handbook Rev 13.00 Page 20 (of 38)
Chapter 08 Crash Dumps
October 29, 2013
that) they get decompressed automatically during the execution of crashinfo. This can take a while. Be sure to
have enough space left in the crash directory.
With the help of the webcourse mentioned above it should be possible to solve most of the
problems.
Anyway in some cases you might need information that is beyond the standard output of
crashinfo. In this case you can use one of crashinfo’s options or use the P4 debugging
environment to perform a deeper analysis.
At this point you have to decide whether to ship the debugging tools to the customer and provide
a remote connection for a HP RCE or to ship the dump to the Response Center, either via ftp
upload or on a DDS tape or CD-ROM by mail. Which of the above possibilities (remote login,
ftp upload, ship by mail) is appropriate depends on availability of remote login, the size of the
dump and the severity of the problem.
If remote login is not possible, ship the dump to the Response Center
If you choose to analyze the dump on the customers system, obtain the tools either from the
Ktools server (refer to the Additional information section below). Select p4, then shared, then
internet or via anonymous ftp from
ftp://tahoe.grc.hp.com/dumpreading/dump_analyse.tar.gz (HP internal)
Size is about 7MB.
Unpack the files e.g. below /usr/contrib/dumpreading on the system, where the dump is located.
Before starting you need to set some path variables:
export P4_ROOT=/usr/contrib/dumpreading
export PATH=”${P4_ROOT}/bin:${P4_ROOT}/p4:${PATH}”
export SHLIB_PATH=”${P4_ROOT}/bin:${P4_ROOT}/p4”
Either put the above lines in /etc/profile or simply source the included set_env file in order
to set these variables:
# cd /usr/contrib/dumpreading
# . ./set_env
If remote login is not possible, ship the dump to the Response Center
If you choose to ship the dump to the Response Center additional information from the
customers system depending on the type of the crash is needed.
Examine crashinfo output to determine which of the crash event types this is:
PANIC the system ran into an unhandable condition and paniced
TOC the system was hung and you TOCed it
SG TOC the TOC was initiated by MC/ServiceGuard via the safety timer mechanism
HPMC High Priority Machine Check. The crash was caused by a HW failure
Example:
# cat ci.out | grep "Note: Crash"
Note: Crash event 0 was a PANIC !