Using High Availability Monitors (June 2003)

Troubleshooting HA Monitors
Steps to Obtain EMS Data to Reproduce an EMS Problem
Appendix B 93
4. Set up everything for reproduction.
Backup the /etc/opt/resmon/ tree:
# mkdir -p /tmp/RESMON/before
# cp -Rp /etc/opt/resmon/* /tmp/RESMON/before
Save logfiles (if needed):
# cd /etc/opt/resmon/log
# tar cvf /tmp/oldlogs.tar *
# cd /var/opt/resmon/log
# tar rvf /tmp/oldlogs.tar diskmond.log
Clear logfiles:
# for i in /etc/opt/resmon/log/*log*
/var/opt/resmon/log/diskmond.log
> do
> rm $i
> done
5. Restart EMS and reproduce the problem.
# kill <pid of diskmond>
# kill <pid of registrar>
# date > /tmp/RESMON/startdate; kill <pid of p_client>
# <steps to reproduce>
6. Check if you have successfully reproduced the problem.
7. Collect the data.
# cd /etc/opt
# tar cvf /tmp/RESMON/after.tar ./resmon
# cd /var/opt/resmon/log
# tar rvf /tmp/RESMON/after.tar diskmond.log
8. Disable debug logging.
rm /etc/opt/resmon/debug
Edit the diskmond.dict file.
Kill diskmond, registrar, p_client.
NOTE If you are not troubleshooting diskmond, make sure that you include the
specific logfiles of the EMS Monitor you are looking at. Refer to the
documentation of the EMS Monitor to find the location of the logfile.