HP Insight Management Agents User Guide Part number: 308447-005a Fifth edition: June 2009
Legal notices © Copyright 2002, 2009 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. The only warranties for HP products and services are set forth in the express warranty statements accompanying such products and services. Nothing herein should be construed as constituting an additional warranty. HP shall not be liable for technical or editorial errors or omissions contained herein. Confidential computer software.
Contents 1 HP Insight Management Agents for Servers…………………………………………………………………………5 Browser requirements ............................................................................................................................... 5 Java Virtual Machine requirements ........................................................................................................ 5 Updating Netscape Communicator for Tru64 UNIX workstations....................................................................
3 Subsystem specific to a NetWare operating system………………………………………………………………79 Operating system overview ..................................................................................................................... 79 Summary page ...................................................................................................................................... 79 File System page.........................................................................................................................
1 HP Insight Management Agents for Servers The HP System Management Homepage version 8.25 and later acts as the Web server for the Management Agents. For additional information, see the System Management Homepage Online Help. Browser requirements The minimum browser requirements include support for tables, frames, Java™, JavaScript, and Java Development Kit (JDK) 1.1.
Obtaining JVM from Microsoft Java support for Internet Explorer can be downloaded from Microsoft at http://www.microsoft.com/java/sdk, where the JVM is contained in the Microsoft Software Development Kit (SDK) for Java. Microsoft Windows Update, http://windowsupdate.microsoft.com, also provides a Microsoft virtual machine (VM) (or updated version of the VM if available).
After you enter the URL, the System Management Homepage appears. Security The HP Insight Management Agents allow SNMP sets for some system parameters. This capability requires security that includes the three predefined users. Agents running on Microsoft and Linux operating systems have no default passwords. On a fresh install the administrator password, operator password, and user passwords are configured during installation.
• The WEBAGENT.INI file is located in the directory /var/opt/CPQIMddd/web/im/webagent if the pathname is /var/opt/CPQIMddd/bin/cpqthresh_mib, where the value ddd indicates the version of the Management Agents installed on the system. The Web Agent service must be stopped and restarted for any changes to take effect for Tru64 UNIX operating systems. Deploying the configurations to servers running Linux The configuration settings for the Management HTTP Server are stored in three files.
administrators if the file system enables it. For private key security reasons, HP recommends that the Management HTTP Server be installed on Windows NT® file systems (NTFS). NOTE: For Microsoft Windows operating systems, the \compaq\wbem subdirectory must exist on an NTFS file system for the private key to have administrator-only access through the file. If the private key has been compromised, the administrator can delete the \compaq\wbem\cert.pem file and restart the server.
NOTE: If you have enabled anonymous access, then you can access the System Management Homepage displays without asking for any credentials locally. 3. Enter the user name and password. If the login fails, specify the domain and user name username in the User Name field (domain name\username). 4. Click Sign In. The System Management Homepage appears. NOTE: For the Version Control Repository Manager, the anonymous login (if enabled) and the user login enable you to access all pages.
Header frame The header frame is always visible regardless of which tab you are viewing. A link, located in the top section, displays the path you are currently viewing along with the System Management Homepage tabs.
Host Name Displays full computer name of the system. System model The System Model displays the model of the device. In some cases, the System Model might display “Unknown” if the HP Insight Management Agents are not installed on the device. Management Processor – Integrated Lights Out Provides access to the HP Integrated Lights Out page associated with the system. Data Source Indicates which agents are populating data to the System Management Homepage.
Refresh Click on the Refresh link to refresh the System Management Homepage. . Body frame The body of the window displays the status for all HP management system details. System Management Homepage tabs The System Management Homepage displays up to five tabbed pages that enable you to access and configure settings related to participating HP web-enabled System Management software.
System Status The System Status window displays an icon for the overall status of the system. The system status is determined by the status of the system components. For example, if a system component has a status of “major,” then the overall system status is set to “major” even if all other components have a status of “minor.” Overall Status Summary The Overall Status Summary window displays the components that have a status of minor, major or critical.
• Logical Disks • Processors • Server • Memory • Network • Physical Disks • Processes • Processor Utilization Storage The Storage window displays the storage related information, this lists out the controller depending on the type of controller connected which may be any of the following types: • IDE Controllers • Drive array controllers • External Array Storage Systems • Fibre channel tape controller • SCSI controllers • SAS Host Bus Adapters System Configuration The System Conf
SNMP Webagent The HP Management Agents page is used to view and set the HP Management Agents configuration. It allows settings for Server Role, Data Collection Interval, SNMP Sets and Remote Reboot settings. If you make Management Agents Configuration changes, you must click the Restart Agents button for changes to the HP Management Agents (Server Agents, Foundation Agents, Storage Agents, and NIC Agents) settings to take effect on the server.
Management HTTP Server section The Management HTTP Server section provides links allowing you to configure your Management HTTP Server settings.
System Management Homepage. Some web applications restrict viewing of critical information from individuals with User access. • Local Server Certificate • Current Certificate—SMH allows setting a certificate with alternative names in addition to the Common Name (CN). Server names are separated by semi-colons without blank spaces.
Tasks Tab The Tasks tab displays links to task-oriented pages provided by participating HP Web-Enabled System Management software. NOTE: If the HP Web-Enabled System Management software provides no tasks, the Tasks tab is not visible. Logs tab The Logs tab includes various log information like Integrated Management Log, HP Version Control Agent Log, Integrated Lights-Out Log, and System Management Homapage Log.
Support tab The Support tab contains of links to available support services like – ProLiant Essential Software Info, Integrity Essential Software Info, Support Links, and Forum links. Help tab The Help tab contains information about the System Management Homepage infrastructure and its configuration and log pages. The rest of the entries in this column link directly to the help systems associated with the webapps installed in the system (that provide a help system).
2 Agent information Management Host agent The Management Host agent gathers data for the HP Host OS MIB.
System Agent Software version information The software version section displays the versions of the system software installed on this machine: BIOS, drivers, and agents. This section also displays a string that specifies the version of HP Insight Management Agents running on the system. Cluster information The cluster information section displays the overall status of a cluster.
• • Client/Internal—The network is used to connect client systems and for internal cluster communication. • None—The network is not used by the cluster for communication. State—Displays the current state of the network, which can be one of the following: • Online—The network is online and functioning normally. • Offline—The network is offline. • Partitioned—The network is operational, but two or more nodes on the network cannot communicate. Typically, a path-specific problem has occurred.
• Orring Diode Failed • Brownout • Give Up On Startup • NVRAM Invalid • Calibration Table Invalid • If the power supply is absent, the status is "Not Applicable" • Used Capacity (%)—Represents the current power supply capacity which is a percentage of its maximum capacity. • Used Capacity (W)—Represents the current power supply capacity in watts. • Max Capacity—Represents the maximum capacity of the power supply in watts. • Model—Represents the power supply model name.
NOTE: A Failed condition does not occur in a client PC since the power supply for the client is cut off if thermal condition reaches a permanently damaging level. • Unknown— If the Server Agents or the Server Agent cannot determine the status of the device, you may need to upgrade your driver software. If you are managing a client with an unknown temperature status, the client may not support thermal detection. • Sensor—The number that uniquely specifies the temperature sensor description.
• Disabled—ECC memory correction is supported, but errors are not logged for this device. • When a certain rate of errors is exceeded the health driver automatically disables logging of these errors, and sends an alarm. The errors are corrected, but are no longer logged. Logging is re-enabled when the driver is reloaded or the operating system restarts. • Not Supported—Logging of correctable memory errors is not available for this device.
• Mirroring—This system is configured for Mirrored Memory Protection. All memory banks are duplicated in Mirrored Memory, as opposed to only one for Online Spare Memory. If enough ECC errors occur, the spare memory is activated and the memory experiencing the errors is disabled. • RAID-XOR—This system is configured for Advanced Memory Protection using the XOR engine. • Advanced ECC—This system is configured for Advanced Memory Protection using the AdvancedEcc engine.
• Other—Memory type can not be determined. • Board—The value determine the memory module is permanently mounted (not modular) on a system board or memory expansion board. • CPQ Single Width Module • CPQ Double Width Module • SIMM • PCMCIA • Compaq Specific • DIMM • Small Outline DIMM • RIMM • SRIMM • DDR2 FB-DIMM • Size - Displays the Size of the memory Module in Mega Bytes. • Technology - Displays the technology of Memory Module.
Management Processor Integrated Lights-Out (iLO) NIC The NIC section displays the following information about the NIC in the iLO. Not all fields are supported by all models of Remote Insight Board and/or NIC. • Model—Displays the NIC model. • DNS Name—Displays the fully qualified DNS name assigned to this iLO. • Type displays if the NIC is embedded or pcmcia and whether it is ethernet or token ring. • IP Address—Displays the IP address for this NIC.
• Pending Alarm—Displays if the alert is unable to determine the state of the iLO, or if all alerts have been delivered or if there are alerts pending that still need to be sent. If your system supports the Integrated Lights-Out Edition, the following alerts are available: • Remote Insight Alerts—Allow users to enable or disable the alerts by clicking the button. • Host Alerts allow users to enable or disable the alerts by clicking the button.
Self Test Results The Self Test Results section indicates various error-status depending on the Remote Insight model.
Select Reset to Original Values to return to the original threshold values, or the values from the last time the thresholds were saved. This option also clears the Synchronize thresholds for all volumes checkbox so that thresholds can be set individually. Select Save Thresholds to save any thresholds that have been modified and delete any disabled thresholds.
interrupts and DPCs. A high rate of privileged time might be attributable to many interrupts generated by a failing device. This counter displays the average busy time as a percentage of the sample time. • % DPC Time—Percentage of time that the processor spent receiving and servicing deferred procedure calls (DPCs) during the sample interval. DPCs are interrupts that run at a lower priority than standard interrupts.
• Context Block Queue/sec—Rate per second at which the work context blocks must be placed on the FSP queue of the server to await server action. • % Total PageFile Usage (Thresholds Supported) —Amount in percent of the Page File instance in use. For details, see the Process Object: Page File Bytes information. • Available KBytes —Amount of physical memory available to processes running on the computer. It is calculated by summing space on the Zeroed, Free, and Stand-by memory lists.
Nonpaged Bytes, so it might not equal Process: Pool Nonpaged Bytes: _Total. This counter displays the last observed value only; it is not an average. • Cache Copy Reads/sec—Frequency of reads from pages of the file system cache that involve a memory copy of the data from the cache to the application's buffer. • Cache Copy Read Hits %—Percentage of cache copy read requests that hit the cache, that is, they did not require a disk read to provide access to the page in the cache.
• Page Faults/sec—Rate at which the page faults occur in the executing threads within this process. A page fault occurs when a thread refers to a virtual memory page that is not in its working set in main memory. This does not cause the page to be fetched from disk if it is on the standby list and already in main memory, or if it is in use by another process with whom the page is shared.
• • Firmware Primary/Secondary—The array controller is using a primary/secondary algorithm implemented in the controller firmware and the operating system driver. • Unknown—Indicates that the Storage Agents cannot determine the redundancy type for the controller. You may need to upgrade the Storage Agents. Redundancy Error—Displays the redundancy error for the controller. The following values are valid: • No Failure—No failures have been detected.
• • High—Indicates the rebuild priority is high. • Unknown—Indicates that the rebuild priority is not recognized. You may need to upgrade the Storage Agents. Expand Priority—Displays the logical drive expand priority of the controller. The following values are valid: • Low—Indicates the expand priority is low. • Medium—Indicates the expand priority is medium. • High—Indicates the expand priority is high. • Unknown—Indicates that the expand priority is not recognized.
temporarily suspended until the batteries are fully charged. Array accelerator operations automatically resume when charging is complete. • Degraded—The battery pack is still operating, but one of the batteries in the pack has failed to recharge properly. Your board should be serviced as soon as possible. • Not Present—The battery pack is not present. Some controllers do not have a battery-backed cache. • Unknown—The Storage Agents do not recognize the battery status.
• POST ECC Errors—Indicates that write cache operations have been permanently disabled. The cache has been disabled due to a large number of ECC errors detected while testing the cache during the Power On Self Test (POST). • Battery Hot Removed—Indicates that write cache operations have been permanently disabled. The cache has been disabled because a battery has been hot removed. • Unknown—The Storage Agents do not recognize the error code. You may need to update your software.
Physical Drive Information This section provides an overview of all disk drives attached to the controller. Each physical drive is listed as a separate entry in the Mass Storage submenu. The information displayed next to the physical drive includes the condition of the drive, the location of the drive and drive size. Select any of the physical drives from the Mass Storage submenu to display more information about the drive.
• Current Speed—Displays the current negotiated data transfer rate for a SCSI physical drive. The possible values are: • Asynchronous—The negotiated data transfer rate for this drive is asynchronous. • Fast—The negotiated data transfer rate for this drive is 20 megabytes per second. • Ultra—The negotiated data transfer rate for this drive is 40 megabytes per second. • Ultra2—The negotiated data transfer rate for this drive is 80 megabytes per second.
• • Status—Indicates the status of the data path. Possible values are: • OK—The path is operational. • Link down—The path is not operational. • Unknown—The path status cannot be determined. Role—Indicates the role of this path in the configuration. Possible values are: • Active—This path is the preferred data path to the physical drive. • Alternate—This path is the alternate data path to the physical drive. • Unknown—The role of this path cannot be determined.
Predictive Indicators Use the Predictive Indicators to predict when a drive, which is now operating normally, may need to be replaced. • S.M.A.R.T. Status—Displays the S.M.A.R.T. status as reported by the physical drive. This is only displayed if the drive supports S.M.A.R.T. predictive failure. The possible values are: • Other—The Storage Agent is unable to determine the status of S.M.A.R.T. predictive failure monitoring for this drive. • OK—Indicates the drive is functioning properly.
IMPORTANT: Never turn off a storage system when the attached system is still turned on. • Check the physical proximity of the system to other electrical devices. Since electrical noise may cause this error, check the AC circuit for other electrical devices. Timeouts can be caused when two or more drives are set to the same SCSI ID. Ensure that the storage system and system SCSI IDs do not conflict. • On a storage system, check the SCSI ID cable on the drive tray.
• Reallocation Aborts—When the physical drive is failed due to an error that occurred when the controller was trying to reallocate a bad sector, a Reallocation Abort error occurs. Because of the nature of magnetic disks, certain sectors on a drive may have media defects. The reallocation area part of the drive is set aside to compensate for these defects. The array controller writes information addressed from unusable sectors to available sectors in the reallocation area.
• Hard Read Errors—Displays the number of read errors that could not be recovered by a physical drive's Error Correction Code (ECC) algorithm or through retries. Over time, a drive may produce these errors. If you receive these errors, a problem may exist with your drive. The severity of these errors depends on whether the managed system is running in a fault tolerant mode. With fault tolerance, the controller can remap data to eliminate the problems caused by these errors.
• Shutdown—Indicates that the drive array enclosure that contains the logical drive has overheated. The logical drive is no longer functioning. • Expanding—Indicates that the logical drive is currently doing Automatic Data Expansion. During Automatic Data Expansion, fault tolerance algorithms redistribute logical drive data to the newly added physical drive. • Not available—Indicates that the logical drive is currently unavailable.
• RAID 50—Distributed data guarding (RAID 5) with multiple parity groups. • RAID 60—Advanced data guarding (RAID 6) with multiple parity groups. • Unknown—You may need to upgrade your software. • Capacity—Displays the size of the logical drive. • Accelerator—Indicates whether the logical drive has an Array Accelerator board configured and enabled. The following values are valid: • Enabled—The Array Accelerator board is configured and enabled for this logical drive.
• Serial Number—Displays the unit serial number for the tape library. Use this value for identification purposes. • Protocol Type—Displays the protocol used to communicate with the tape library. The following values are valid: • • • Parallel SCSI—The protocol is Parallel SCSI. • Serial ATA—The protocol is Serial ATA. • SAS—The protocol is Serial Attached SCSI. • Unknown—The Storage Agents are unable to determine the protocol.
• Missing - Was Offline —Indicates that a tape drive that was located in a system and had a status of offline has been removed. • Unknown—Indicates that the state of the tape drive cannot be determined. You may need to upgrade the Storage Agents. • Model—Displays the model name of the tape drive. Use this value for identification purposes. • Firmware Revision—Displays the firmware revision level of the tape drive. Use this value for identification purposes.
• Tape Errors—Displays the total number of read and write errors encountered. This value is maintained from the moment the Tape Hardware Interface driver was loaded. Tape errors may occasionally occur. If this value rises dramatically, clean the device. If you continue to have errors, you may have a problem.
• Fan Status—Displays the status of the fan subsystem in the drive enclosure, or box. The following values are possible: • OK—The fan subsystem is working properly. • Failed—A fan has failed and there are not enough fans in the fan subsystem to keep the enclosure cool. Check your fan subsystem as soon as possible. Continued operation may cause failure of the drives. • Degraded—A fan has failed but there are still enough fans in the fan subsystem to keep the enclosure cool.
• Path—Indicates the path to the storage system. Each path is identified by a descriptor. For example, "Port 2E Box 1" indicates a path from the host adapter external port number 2 ("Port 2E") to the first box. • Status—Indicates the status of the data path. Possible values are: • OK—The path is operational. • Link down—The path is not operational. • Unknown—The path status cannot be determined. • Role—Indicates the role of this path in the configuration.
the channel, then “Channel unknown” displays. If the device position cannot be determined, then “Device unknown” displays and driver software or the Storage Agents might need to be updated. Select any of the disk drives from the submenu to display more information about the disk drives. The following information is displayed for all disk drives: • Model—Displays the model of the disk drive. • Status—Displays the current status of the disk drive.
• • • Rebuilding—Indicates that the logical drive is rebuilding a physical drive. When complete, the logical drive returns to normal operation. • Failed—Indicates that more physical drives have failed than the RAID level of the logical drive can handle without data loss. • Unknown—The agent cannot determine the logical drive status. You might need to upgrade your software. Fault Tolerance—Displays the fault tolerance mode of the logical drive.
• • Compaq 32-Bit Fast-Wide SCSI-2/E Controller • Compaq 32-Bit Fast-Wide SCSI-2/P Controller • Compaq Wide-Ultra SCSI Controller • Compaq Wide-Ultra2 SCSI Controller • Compaq 64-Bit Dual Channel Wide-Ultra2 SCSI Controller • Compaq Wide Ultra3 SCSI Adapter • HP 64-Bit/133MHz PCI-X 2CH Ultra320 HBA • The StorageWorks Library Adapter • Third-party SCSI Controller Model • Unknown—The driver software or storage agents might need to be upgraded, or you have a SCSI controller in the system th
• Power Drive CD-ROM—A storage device that can read from a CD and write to or read from an optical disk. • CR3500 RAID Controller—A three-channel SCSI RAID controller. • Scanner—A scanning device. • Optical—An optical memory or storage device. • Jukebox—A media-changer device, such as a jukebox. • Tape Library—A tape library or autoloader device. • Communications Device—A communications device, such as a LAN bridge. • Unknown—You might need to upgrade your support software or Storage Agents.
• • Not supported—Indicates that the library cannot detect or report redundancy status. • Unknown—The redundancy status of the tape library cannot be determined. Ensure the latest drivers and Storage Agents are installed. Hot Swap—Displays the tape library hot swap status which denotes the presence of hot swappable internal components, such as drives, fans, power supplies, etc.
• • Model—Displays a description of the SCSI tape autoloader media changer model as returned by the SCSI inquiry command. • Firmware Version—Displays the firmware revision level of the tape autoloader media changer as returned by the SCSI inquiry command. • Serial Number—Displays the serial number assigned to the tape autoloader media changer. This value is based on thee serial number as returned by the SCSI inquiry command.
• Vendor—Displays the vendor name for the CD Library. This item can be used for identification purposes. • Model—Displays the model name of the CD Library. This value can be used for identification purposes. • Serial Number—Displays the serial number of the CD Library. This value can be used for identification purposes. • Firmware Version—Displays the firmware revision of the CD Library.
• • Duplex Top—This storage system is the top part of a duplexed unit. • Duplex Bottom—This storage system is the bottom part of a duplexed unit. • None—A duplex option is not installed. Power Supply Status—Displays the status of the Redundant Power supply. The following values are possible: • OK—All component power supplies that make up the redundant power supply are in normal working order. • Degraded—One of the component power supplies that make up the redundant power supply has failed.
• • Available—This drive supports predictive failure monitoring. • Unknown—The Storage Agents cannot determine if the drive supports predictive failure monitoring. You might need to upgrade your driver or Storage Agents. Placement—Indicates if the physical drive is in an internal or external storage system. The following values are valid: • Internal—The physical drive is in an internal storage system. • External—The physical drive is in an external storage system.
• Failure Indicator—Use this utility to determine the cause of failure for a failed drive. If the drive has failed and this counter is non-zero, replace the drive. If the drive condition is OK and the failure indicator is not zero, the drive might have an intermittent problem and you might have to replace it. There is no other corrective action for this error. • Self-Test Errors—Displays the number of times that a physical drive failed its self-test.
SCSI logical drives Select a SCSI logical drive from the SCSI controller submenu to display the following information. The following values are valid: • Status—Shows the status of the physical drive selected. • OK—The logical drive is in normal operation mode. No user action is required. • Failed—There are more failed physical drives than the fault tolerance mode of the logical drive can handle without data loss. • Unconfigured—The logical drive is not configured.
Spare drives A list of spare drives that can be used by this logical drive to replace a failed drive. Select one of the listed spare drives to see more information about the drive. SCSI bus information Select a SCSI device from the SCSI controller submenu to display more information about the device. The following information might appear depending on the type of device: • Parity Errors—Displays the number of parity errors that occurred on the SCSI bus while the bus was processing commands.
NOTE: If the current data transfer width is Narrow (8 bits) then the speed in megabytes per second is equal to the million transfers per second speed. If the current width is Wide (16 bits) then the speed in megabytes per second is twice the million transfers per second speed. For example, if the current speed is Ultra and the width is Wide then the speed would be 40 megabytes per second.
Use the ASR Reset Limit feature in conjunction with the ASR Reset Count feature in the same window. The ASR Reset Count feature displays the number of times that ASR has rebooted the server. If the ASR Reset Count is approaching the reset limit, immediately investigate the server for problems by checking the Critical Error Log and running Diagnostics. • ASR Reset Count—Displays how many times the ASR feature has rebooted the server. ASR reboots (or resets) the server a limited number of times.
• Serial Number—Displays the Physical serial number of the device or client system board. Use this number for identification and registration purposes. N/A appears if you do not have a device or client that supports the asset management feature. Use the System Configuration Utility (or the appropriate utility for your device or client) to enter a system serial number if one does not appear and you have a device or client that supports the asset management feature.
• Other—The system does not support Advanced Memory Protection or the Management Agent cannot determine the status. • Not Protected—This system supports Advanced Memory Protection but the feature is disabled. • Protected—The system supports Advanced Memory Protection. The feature is enabled but not engaged. • Degraded—The system was protected, but the Advanced Memory Protection has been engaged; therefore Advanced Memory Protection is no longer available.
• Advanced Memory Protection Condition—displays the current condition of the Advanced Memory Protection subsystem. The following states are supported: • Other—The system does not support fault tolerant memory or the Management Agent cannot determine the state. • OK—This system is operating normally. • Degraded— The Advanced Memory Protection sub-system has been engaged. Schedule server down time to replace the deactivated memory.
• FPM (Fast-Page Mode) • EDO (Extended Data Out) • BEDO (Burst Extended Data Out) • Synch DRAM (Synchronous DRAM) • Unknown System Resources This section lists the resources in use by the device or client workstation in the following order: • IRQ Numbers—The Interrupt Request number displays, followed by the slot number of the board that is using this interrupt. • Port Address—The port address range displays, followed by the slot number of the board that is using this I/O port range.
• Monitor Serial Number displays the serial number for the monitor. Use this number for identification purposes. • Monitor Manufacture Date displays the monitor's date of manufacture date Enclosure Information • If your system is a blade server based system, the navigation frame displays the Enclosure Information link under the Configuration section. The middle Navigation frame is the Enclosure Information window and it displays the detailed information about the enclosure.
• Enclosure Model • Asset Tag • Part Number • Spare Pat Number • Serial Number • Firmware Revision • Previous Enclosure Name • Next Enclosure Name • Height \ Width \ Depth • Unit Identification LED • Previous Enclosure Serial Number • Next Enclosure Serial Number • Address • Maximum Blade Number • Enclosure Blade Information • Name • Part Number • Spare Part Number • Position (Slot Number) • Height \ Width \ Depth • Number of Bays Occupied • Enclosure Fuses • Con
Namespace: root\default Object Class: CPQ_System_Performance Subclasses: • • • • • • • CPQ_System • ContextSwitchRate • CpqQueueLength • Processes • RegistryUsage • SystemUpTime • TotalThreads CPQ_Server • AccessPermissionErrors • ContextBlockQueueRate • GrantedAccessErrors • LogonErrors • ServerSessions • SessionsErroredOut • TotalByteRate CPQ_Processor • CpuTimePercent • CpuUserTimePercent • InterruptRate • PercentDPCTime • PercentInterruptTime • PrivelegedCpuTi
• • • • • DiskWriteBytesPersec • DiskWritesPersec • PhysicalDisk CPQ_LogicalDisk • DiskQueueLength • DiskTimePercent • FreeMegabytes • FreeSpacePercent • LogicalDisk CPQ_NetworkInterface • BytesReceivedPersec • BytesSentPersec • CurrentBandwidth • NetworkInterface • OutputQueueLength • PacketOutboundErrs • PacketRate • PacketReceiveErrs • PacketsReceivedPersec • PacketsSentPersec • TotalByteRate CPQ_Tcp (CPQ_Tcpv4 for Windows Server 2003) • ConnectionFailures • C
IMPORTANT: To mark entries as corrected, the agents must have sets enabled and the SNMP Community string must be correct. The following list describes errors that might be logged. If any of these errors occur, run System Diagnostics on the system or see the software documentation. • Abnormal Program Termination—A device has detected a fatal software error resulting in a device failure. • ASR Base Memory Parity Error—The system detected a data error in base memory following a reset because an ASR timeout.
• Server Manager Failure—An error occurred in the server interface with the Server Manager. • UPS A/C Line Failure/Shutdown or Battery Low—The device has initiated a UPS or operating system shutdown, or the battery is almost depleted after an AC line failure. The Last Failure Message on this window displays the last failure message associated with a critical error. Correctable errors This alarm indicates that a block of memory has failed or is failing and might need to be replaced.
3 Subsystem specific to a NetWare operating system Operating system overview HP now provides operating system management for NetWare environments. The NetWare operating system information on the server being monitored displays when running NetWare 4.x with NetWare Management Agent from Novell or NetWare 5.0,.
File System page The File System page provides the file system information for the server. The page is divided into the following three sections: • File System • File Volume • Open Files File System The File System section displays the following information: • Reads (Kbytes)—The total number of Kilobytes read by the file system. This value provides a measure of server activity. • Writes (Kbytes)—The total number of Kilobytes written by the file system.
General information The General Information section displays the following information: • User Count—The number of entries in the User Account Table. • Logged-In Users—The number of licensed connections (logins) in this file server. • Maximum Logins—The maximum number of licensed connections (logins) supported by this file server. The value is zero if the maximum number is unlimited. • Connection Count—The current number of entries in the Connection Table.
• Memory (bytes) (sortable)—The total memory, in bytes, used by the NLM. This value is a composite of Short Term Memory, Semi-Permanent Memory, and Non-movable Cache Memory allocated by the NLM, plus the sizes of the code and data sections of this instance of an NLM. • Description (sortable)—A brief description of the NLM. • Version (sortable)—The major and minor version numbers of the NLM. Server Parameter page The Server Parameter page includes a table of the NetWare set parameters.
• InwDos • Other • Description (sortable)—A brief description of this partition. • Size (sortable)—The size (in Kilobytes) of this physical partition. Adapter Information page The Adapter Information page displays general information for each adapter board in the host. The Adapter table is sorted by the Description value by default. Description—A description of the hardware information for the adapter. The description usually includes manufacturer, model, and version information.
4 CR3500 RAID Array SCSI controller Mass storage RAID Array This section displays RAID array information. Five banners appear in this section. RAID array status • • Status—Displays the status of the RAID array. The following conditions are valid: • Good—The RAID array is fully operational. • Reduced—The RAID array is operating in a degraded or reduced state. One or more of the physical drives that make up the RAID array are either missing or failed.
• Formatting—The drive is currently being formatted.
• Critical—The controller temperature sensor has detected a critical temperature condition in the primary enclosure. • Non-critical—The controller temperature sensor has detected an abnormal temperature condition in the primary enclosure. • Unknown—The enclosure temperature information is unavailable. • Primary enclosure temperature—Displays the current primary enclosure temperature in degrees Celsius. • Primary enclosure fan status—Displays the current primary enclosure fan status.
5 Where to go for additional help In addition to this guide, the following information sources are available: • HP Insight Management Agents 7.30 Installation Guide • HP Insight Manager software Telephone numbers For the name of the nearest HP authorized reseller, see the HP website http://www.hp.com/service_locator. For HP technical support: • In North America: • Call 1-800-HP-INVENT (1-800-474-6836). This service is available 24 hours a day, 7 days a week.
Appendix A Troubleshooting Insight Management Agents for Servers issues Inability to perform remote reboot on a server from the Management Console 1. Load CPQAGIN. 2. Verify that Remote Reboot is enabled. 3. Verify that Sets are enabled.
When attempting to browse to Web-Enabled System Management software on port 2381, the system is not accessible because of an invalid password This issue is due to either of the following conditions: • This system is not fully configured and is not accessible because there is not a valid administrator password. An administrator password must be configured by either reinstalling the Web-enabled System Management software or following the instructions in the Security white paper located at http://www.hp.
d. Click Reset>OK. • There are known browser issues in the Management Agents for Tru64 UNIX. However, many have been corrected in later releases. For additional information, see the Management Agents for AlphaServers for Tru64 UNIX Reference Guide. SNMP community string issues • In NetWare, the community string can be changed through the INETCFG utility. After the community string has been changed, the administrator must restart the server at a convenient time to have the change take effect.
4. Click OK. When prompted to restart the Management Agents, click Yes. Insight Manager cannot manage a system If the Microsoft® SNMP service was installed after installing the Management Agents for Servers, run Install from the Management CD, and select Express to automatically update the Management Agents. Cannot delete the CPQMGMT.CPL file when uninstalling or upgrading When running Windows® 2000, close the Control Panel before deleting the CPQMGMT.CPL file.
NOTE: The HP Diagnostics utility initializes physical drives attached to HP Drive Arrays. Disk Subsystem button disabled in the Disk Storage window This condition might result because the Drive Array Agent is not loaded. Open the Management Agents by selecting the icon from the Windows® Control Panel. Verify that the Drive Array Information is located in the Active Agents lists. SCSI Adapter button is disabled This condition might result because the HP device driver is not loaded.
The HP Foundation Agents service could not terminate agent "CPQMHOST." The data contains the error code. The HP Foundation Agents service could not start agent "CPQMHOST." The data contains the error code. • NIC information does not appear. • Server Status Information appears incorrectly. For example, the Server Status information is highlighted in green. However, when the customer drills down to mass storage, the Server Status information is highlighted in red.
System restart to disk-based utilities fails for NetWare If you click Reboot on the Insight Manager display, and select the option to boot to the disk-based utilities, the server might fail to start the disk-based utilities. Use the System Configuration utility to verify that the utilities are actually installed on the system partition. Inability to view Web pages on the NetWare server 1. Verify that the latest Novell operating system patch is installed on the server to provide a recent version of TCPIP.
Glossary Automatic Server Recovery (ASR) A server feature designed to automatically restart the server after a critical hardware or software error. If a critical error occurs, the server records the error in the Server Health Logs, reboots the system, and pages (if a modem is installed in the server). Client A computer connected to a server on the network. Community String The SNMP Community String is similar to a password, offering a limited amount of protection for the SNMP data.
Simple Network Management Protocol (SNMP) SNMP defines a set of commands that a management application uses to retrieve or change the values of items a management agent makes available. SNMP Multiplexing (SMUX) The protocol defining a mechanism for communication between SNMP agent and multiple user daemons (called SMUX peers). The SMUX protocol has the advantage of being vendor-independent.
Index A F nodes, cluster, 22 Novell NetWare: community string settings, 90 accounts, user, 8 Adapter Information Page, 83 array controllers, 84 ASR POST failure, 93 File System Page, 80 Foundation Agents: configuration subsystem, 22 O B browser, 89 browser requirements, 5 C Client data collection, 21 clusters: Foundation Agents, 22; RAID Array controller, 85 colors in browser, 89 community strings, SNMP, 90, 93 community strings, SNMP, 91 Compaq CR3500 RAID Array SCSI Controller: clustered controller
software, cluster, 23 spare drives: RAID Array, 84 subsystem button, disabled, 92 Summary Page, 79 System Management Homepage, 9, 11 system restart problems, 94 T tape device errors, 93 telephone numbers, 87 thresholds: SNMP, 92 trap messages: troubleshooting, 93 troubleshooting: Web-enabled Management Agents, 89 troubleshooting, 90 U user accounts, 8 User Information Page, 80 V values, inability to change, 93 W WEBAGENT.