Maintenance Guide HP bh5700 ATCA 14-Slot Blade Server First Edition Manufacturing Part Number : AD171-9605A June 2006
Legal Notices The information in this document is subject to change without notice. Hewlett-Packard makes no warranty of any kind with regard to this manual, including, but not limited to, the implied warranties of merchantability and fitness for a particular purpose. Hewlett- Packard shall not be held liable for errors contained herein or direct, indirect, special, incidental or consequential damages in connection with the furnishing, performance, or use of this material. Restricted Rights Legend.
Contents 1. Field Replaceable Unit Removal and Replacement Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Required Tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Power-Off Field Replaceable Units . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Contents HP bc2100 ATCA Server Blade Hard Disk Drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Removing the Hard Disk Drive and Mounting Kit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Replacing the Hard Disk Drive and Mounting Brackets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . HP bc2100 ATCA Server Blade Memory Modules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Contents Network Link and Speed LEDs (10, 100, 1000 Mbps Ethernet) . . . . . . . . . . . . . . . . . . . . . . . . . . . . User-Programmable LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ethernet Controller Port State (Base Ethernet Activity/Link) LEDs . . . . . . . . . . . . . . . . . . . . . . . . Hot-Swap LED (DS10) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Contents Power-On Self Test (POST). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Online Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . shmm_uart_test [-e] . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . shmm_eth_test [-a ] . . . . . . . . . . . . . . . . . . . . . .
Contents Module. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Device . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Parameter . . . . . . . . . . . . . . . . . . . . . . . . . .
Contents /MINUTES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . /LOOPS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . /N . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . /QUIT. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Contents 9
Contents 10
Contents 11
Contents 12
Tables Table 1-1. HP bh5700 ATCA 14-Slot Blade Server Field Replaceable Units . . . . . . . . . . . . . . . . . . 22 Table 2-1. Troubleshooting the Ethernet Switch Blade with Status Indicators. . . . . . . . . . . . . . . . 78 Table 2-2. Solution Table for LED Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 Table 2-3. Shelf Alarm Display Panel LED Indicators. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83 Table 2-4.
Tables Table 3-10. Test Sequence Pane Button Descriptions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Table 3-11. Test Message Pane Button Descriptions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Table 3-12. Test Summary Pane Button Descriptions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Table 3-13. Device Node Field Descriptions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Table 3-14.
Figures Figure 1-1. Electrostatic Discharge Wrist Strap Connection Sockets. . . . . . . . . . . . . . . . . . . . . . . . 28 Figure 1-2. Power Entry Module (PEM) A, Power Cables and Returns Attached . . . . . . . . . . . . . . 31 Figure 1-3. PEM with Fuses installed (Showing Backplane Connector Power Contacts) . . . . . . . . 33 Figure 1-4. Shelf Manager (ShMM) Front Panel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34 Figure 1-5.
Figures Figure 3-1. Diagnostics Boot Option . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Figure 3-2. Default Device Selection for Diagnostics Boot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Figure 3-3. Network Configuration Agent Operation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Figure 3-4. XTC Program Environment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
About This Document 17
Publishing History The document printing date and edition number indicate the document’s current edition. The printing date will change when a new edition is produced. Document updates may be issued between editions to correct errors or document product changes. The latest version of this document is available on line at: docs.hp.com.
HP Encourages Your Comments HP encourages your comments concerning this document. We are truly committed to providing documentation that meets your needs. Please send comments to: http://docs.hp.com/en/feedback.html. Please include document title, manufacturing part number, and any comment, error found, or suggestion for improvement you have concerning this document. Also, please include any comments about what you think we did right, so that we can incorporate it into other documents.
1 Field Replaceable Unit Removal and Replacement Chapter 1 21
Field Replaceable Unit Removal and Replacement Introduction Introduction This section provides detailed procedures for removing and replacing all HP bh5700 ATCA 14-Slot Blade Shelf Field Replaceable Units (FRUs), including important safety information. Field Replaceable Units (FRUs) for the HP bh5700 Telco Blade 14-Slot Shelf include those shown in Table 1-1.
Field Replaceable Unit Removal and Replacement Introduction Table 1-1 HP bh5700 ATCA 14-Slot Blade Server Field Replaceable Units (Continued) FRU Description Adapter Cable, RS-232 Serial Connection, RJ-45 to DB-9 Manufacturing P/N Replacement P/N Exchange P/N (if repairable) A6900-63006 The above part numbers are current as of the publication date of this manual. Check http://partsurfer.hp.com/cgi-bin/spi/main for the most current part numbers.
Field Replaceable Unit Removal and Replacement Required Tools Required Tools The following tools are required to safely remove and replace components within the chassis • ESD safe mat • Torx T-20 driver • Torx T-15 driver • #1 Phillips screwdriver • #2 Phillips screwdriver • 10-mm nut driver 24 Chapter 1
Field Replaceable Unit Removal and Replacement Power-Off Field Replaceable Units Power-Off Field Replaceable Units Field Replaceable Units are considered to be power-off FRUs if electrical power must first be removed from the 14-Slot Shelf before the FRU can be removed and replaced.
Field Replaceable Unit Removal and Replacement Hot-Swap Field Replaceable Units Hot-Swap Field Replaceable Units Hot-swap FRUs are considered such if they can be removed from the 14-Slot Shelf and be replaced while the Shelf remains fully operational, and if they require no software intervention before being removed.
Field Replaceable Unit Removal and Replacement Safety and Environmental Considerations Safety and Environmental Considerations Safety and environmental considerations include electrical safety, communications interference, and electrostatic discharge hazards to equipment. Electrical Safety Electrical safety precautions must be followed at all times when working on the 14-Slot Shelf. WARNING High Voltage. There are no system power controls on the 14-Slot Shelf.
Field Replaceable Unit Removal and Replacement Safety and Environmental Considerations Figure 1-1 Electrostatic Discharge Wrist Strap Connection Sockets 1. ESD Socket, Lower Left Front 2. Shelf Manager (ShMM) 3. ESD Socket, Lower Left Rear 4. Power Entry Module, (PEM) B Take these ESD precautions: • Prepare an ESD-safe work surface large enough to accommodate the various assemblies to be handled during servicing.
Field Replaceable Unit Removal and Replacement Removing Power from the Shelf Removing Power from the Shelf In a telecommunications environment, the 14-Slot Shelf is typically installed with a separate High Availability (HA) four-domain input power supply configuration to each of the 14-Slot Shelf Power Entry Modules (PEMs). In this configuration input power can be removed from either or both PEMs on the Shelf, as required. WARNING WARNING: High Voltage.
Field Replaceable Unit Removal and Replacement Power Entry Modules (PEMs) and PEM Fuses Power Entry Modules (PEMs) and PEM Fuses The two 14-Slot Shelf PEMs are located at the lower right and left corners at the rear of the Shelf. The PEM is a hot-swap unit. Power Entry Module fuses are mounted in the main body of each PEM, and are accessible only after the respective PEM has been removed from the Shelf. WARNING Electrical Shock Hazard to Personnel.
Field Replaceable Unit Removal and Replacement Power Entry Modules (PEMs) and PEM Fuses Figure 1-2Power Entry Module (PEM) A, Power Cables and Returns Attached Step 3. With the blue hot-swap LED steadily illuminated and all input power disabled/disconnected, remove the PEM Terminal Block Cover (#5, Figure 1-2. Step 4. Remove and label each terminal block power and return cable for replacement hookup. Step 5. Loosen both PEM fixing screws (#7, Figure 1-2). Step 6.
Field Replaceable Unit Removal and Replacement Power Entry Modules (PEMs) and PEM Fuses Step 3. Connect the power cables and return lines, ensuring that each power domain cable and return line is attached to the correct power terminal lug (Figure 1-2). It may be easier to connect all PEM cables before actually replacing the PEM into the 14-Slot Shelf chassis. Step 4. Position the terminal block cover over the terminal block, then tighten the terminal block cover retaining screws (Figure 1-2). Step 5.
Field Replaceable Unit Removal and Replacement Power Entry Modules (PEMs) and PEM Fuses Figure 1-3PEM with Fuses installed (Showing Backplane Connector Power Contacts) Step 4. Replace the PEM, following the replacement procedures provided above. Step 5. Re-enable or re-connect all power domain circuit breakers, and apply power to the reinstalled PEM. Step 6. With all fuses either replaced or having tested as functional, the green PEM OK LED (#4, Figure 1-2) should illuminate steadily. Step 7.
Field Replaceable Unit Removal and Replacement Shelf Manager Shelf Manager The 14-Slot Shelf contains two redundant Shelf Managers (ShMMs) as hot-swap FRUs. These units are located in the lower left and right corners at the front of the Shelf, and protrude into the air intake plenum. Both ShMMs are accessible from the outside front of the Shelf. WARNING Potential Electrical Shock Hazard to Personnel.
Field Replaceable Unit Removal and Replacement Shelf Manager Step 2. When the blue H/S LED illuminates continuously, pull the extraction handle as far as it will go. This will release the ShMM unit from the 14-Slot Shelf. Step 3. Remove the ShMM by pulling it out of its 14-Slot Shelf mounting slot, and place it in an anti-static bag. Replacing a Shelf Manager CAUTION Electrostatic Discharge Hazard. Observe all ESD safety precautions while completing this procedure.
Field Replaceable Unit Removal and Replacement Ethernet Switch Blade Ethernet Switch Blade The Ethernet Switch Blade (typically installed in 14-Slot Shelf physical hub slots 7 and 8) is a hot-swap FRU and can be removed and replaced while the 14-Slot Shelf is fully operational. WARNING Electrical Shock Hazard. System power is applied to the 14-Slot Shelf at all times during this hot-swap removal and replacement procedure. Observe all electrical safety procedures while working on this hot-swap unit.
Field Replaceable Unit Removal and Replacement Ethernet Switch Blade Figure 1-5Ethernet Switch Blade Front Panel (Lower Half), Showing Blue Hot Swap LED Step 3. With the blue hot-swap LED illuminated continuously, simultaneously pull both extraction handles to the full-out extract position. This will disengage the Ethernet Switch Blade backplane connectors from the 14-Slot Shelf chassis. Step 4. Carefully pull the Ethernet Switch Blade out of its mounting slot.
Field Replaceable Unit Removal and Replacement Ethernet Switch Blade Step 2. Carefully align the Ethernet Switch Blade in its assigned slot, and slide the blade approximately half-way into the slot. Step 3. Holding the two extraction levers in the fully extracted position (away from each other), carefully slide the Blade into the slot until the backplane connectors properly mate with the 14-Slot Shelf connectors and the blade is fully seated.
Field Replaceable Unit Removal and Replacement HP bc2100 ATCA Server Blade HP bc2100 ATCA Server Blade The HP bc2100 ATCA Server Blade is a hot-swap FRU and can be removed and replaced while the 14-Slot Shelf is fully operational. WARNING Electrical Shock Hazard. System power is applied to the 14-Slot Shelf at all times during this hot-swap removal and replacement procedure. Observe all electrical safety procedures while working on this hot-swap unit.
Field Replaceable Unit Removal and Replacement HP bc2100 ATCA Server Blade Figure 1-6HP bc2100 ATCA Server Blade, Front Panel Indicators Step 3. With the blue hot-swap LED illuminated continuously, pull both extraction handles to the full-out (extract) position. This will disengage the HP bc2100 ATCA Server Blade backplane connectors from the 14-Slot Shelf chassis. Step 4. Carefully pull the HP bc2100 ATCA Server Blade out of its mounting slot. CAUTION Possible equipment damage due to excessive heat.
Field Replaceable Unit Removal and Replacement HP bc2100 ATCA Server Blade Step 6. Place the removed blade into an anti-static bag. Replacing an HP bc2100 ATCA Server Blade CAUTION Electrostatic Discharge Hazard. Observe all ESD safety precautions while completing this procedure. Failure to follow ESD safety precautions could result in damage to the 14-Slot Shelf and equipment. To replace an HP bc2100 ATCA Server Blade, complete the following steps: Step 1.
Field Replaceable Unit Removal and Replacement Fan Trays Fan Trays The 14-Slot Shelf has three interchangeable cooling fan trays that can be hot-swapped. The fan trays are individual plug-in modules, located at the top rear of the Shelf. WARNING Electrical Shock Hazard. System power is applied to the 14-Slot Shelf at all times during this hot-swap removal and replacement procedure. Observe all electrical safety procedures while working on this hot-swap unit.
Field Replaceable Unit Removal and Replacement Fan Trays Step 3. Raise and hold the fan tray retention lever (#6, Figure 1-7) while pulling on the fan tray extraction handle (#5, Figure 1-7), and then remove the fan tray from the 14-Slot Shelf enclosure. Step 4. Place the fan tray in an anti-static bag. Replacing a Fan Tray To replace a fan tray, complete the following steps: Step 1. Carefully insert the fan tray into the empty fan tray enclosure of the Shelf. Step 2.
Field Replaceable Unit Removal and Replacement Air Filter Air Filter The 14-Slot Shelf air filter is located near the bottom front of the Shelf, immediately beneath the blade slots (Figure 1-8). The air filter is a hot-swap unit, and can be removed and replaced while the 14-Slot Shelf is in normal operation. There is no voltage present at the air filter element and filter tray. WARNING Figure 1-8 Electrical Shock Hazard.
Field Replaceable Unit Removal and Replacement Air Filter To remove the 14-Slot Shelf replaceable air filter, complete the following steps: Step 1. Using the filter tray handle (Figure 1-8), pull the filter tray out, and remove it from the 14-Slot Shelf. Step 2. Remove the air filter element from the filter tray and discard the used element. Replacing the Air Filter To replace the air filter, complete the following steps: Step 1.
Field Replaceable Unit Removal and Replacement Backplane SEEPROM Assemblies Backplane SEEPROM Assemblies The redundant 14-Slot Shelf Backplane SEEPROM Assemblies contain chassis configuration information for the system, and are power-off, cold-swap FRUs. Backplane SEEPROM Assemblies (also known as Chassis Data Modules CDM1 and CDM2) are small, pluggable circuit board modules that are mounted on the back side of the 14-Slot Shelf chassis backplane.
Field Replaceable Unit Removal and Replacement Backplane SEEPROM Assemblies Removing a 14-Slot Shelf SEEPROM Assembly WARNING Electrical Shock Hazard to Personnel. Shut down all normal operation of the 14-Slot Shelf and disable/disconnect all external power supply power domain circuit breakers in the OFF (O) position before completing any of the following steps. Failure to disable/disconnect all power supplied to the 14-Slot Shelf at the power source may result in personal injury.
Field Replaceable Unit Removal and Replacement Backplane SEEPROM Assemblies Step 5. This completes the SEEPROM Assembly replacement procedure. IMPORTANT If replacing a single backplane SEEPROM Assembly, the active ShMM will copy all existing 14-Slot Shelf system configuration information from the remaining installed SEEPROM Assembly into the new replacement SEEPROM Assembly when power is applied to the 14-Slot Shelf.
Field Replaceable Unit Removal and Replacement Alarm Display Panel and Alarm Connector Panel Alarm Display Panel and Alarm Connector Panel This section provides information on removing and replacing the Alarm Display Panel and the Alarm Connector Panel. Removing and Replacing the Alarm Display Panel The Alarm Display Panel is a hot-swap FRU, and is mounted in the upper, left front corner of the Shelf. There is no hot-swap button or blue hot-swap LED for the Alarm Display Panel.
Field Replaceable Unit Removal and Replacement Alarm Display Panel and Alarm Connector Panel Figure 1-10Alarm Display Panel, Front View Step 2. Pull the Alarm Display Panel unit from the chassis and place it in an anti-static bag. Replacing the Alarm Display Panel WARNING Electrical Shock Hazard to Personnel.
Field Replaceable Unit Removal and Replacement Alarm Display Panel and Alarm Connector Panel Step 1. Carefully insert the Alarm Display Panel assembly into its chassis mounting slot, using the fiber board tabs to align the display connector pins (Figure 1-11). Figure 1-11Alarm Display Panel, Rear View Step 2. Ensure that all connector pins are properly aligned with the 14-Slot Shelf receptacle. Using firm pressure, push the Alarm Display Panel into the chassis connector until it is fully seated. Step 3.
Field Replaceable Unit Removal and Replacement Alarm Display Panel and Alarm Connector Panel Figure 1-12Alarm Connector Panel, Front View Step 2. Pull the Alarm Connector Panel assembly from the 14-Slot Shelf chassis and place it in an anti-static bag. Replacing the Alarm Connector Panel WARNING Electrical Shock Hazard to Personnel.
Field Replaceable Unit Removal and Replacement Alarm Display Panel and Alarm Connector Panel Step 2. Ensure that all connector pins are properly aligned with the 14-Slot Shelf receptacle. Using firm pressure, push the Alarm Connector Panel into the chassis connector until it is fully seated. Step 3. Tighten the two retaining thumb screws to secure the Alarm Connector Panel to the chassis.
Field Replaceable Unit Removal and Replacement Cable Trays and Rack Mounting Flange Kit Cable Trays and Rack Mounting Flange Kit This section provides procedures for removing and replacing the 14-Slot Shelf rack mount Flanges, front and rear Cable Trays, and front and rear blade cage Filler Panels. Removing and Replacing Rack Mount Flanges To remove the 14-Slot Shelf rack mounting Flanges, complete the following steps: Step 1.
Field Replaceable Unit Removal and Replacement Cable Trays and Rack Mounting Flange Kit Step 3. Slide each flange down until the flanges drop out of the five retaining clips that hold them to the Shelf. Step 4. Set the flanges aside. To replace the rack mounting flanges, complete the following steps: Step 1. Position each flange over the five retaining clips on each side of the 14-Slot Shelf chassis for either the 4-post or 2-post mounting position, as required for your installation.
Field Replaceable Unit Removal and Replacement Cable Trays and Rack Mounting Flange Kit Figure 1-1514-Slot Shelf, Front View (Blade Cage Empty) 56 Chapter 1
Field Replaceable Unit Removal and Replacement Cable Trays and Rack Mounting Flange Kit Figure 1-16 14-Slot Shelf, Rear View (Blade Cage Empty) Step 2. Set the front/rear cable trays and attaching screws aside. To replace the 14-Slot Shelf front and rear cable trays, complete the following steps: Step 1. Position the front cable tray (upper front of the blade cage, shown as #7 in Figure 1-15), and insert and tighten the tray mounting screws. Step 2.
Field Replaceable Unit Removal and Replacement 14-Slot Shelf Chassis and Backplane 14-Slot Shelf Chassis and Backplane Although the 14-Slot Shelf chassis with backplane is replaced as an individual FRU, the replacement 14-Slot Shelf chassis and backplane will contain a full complement of chassis components when delivered.
Field Replaceable Unit Removal and Replacement 14-Slot Shelf Chassis and Backplane Figure 1-17 Chapter 1 14-Slot Shelf, Front View (Blade Cage Empty) 59
Field Replaceable Unit Removal and Replacement 14-Slot Shelf Chassis and Backplane Figure 1-18 14-Slot Shelf, Rear View (Blade Cage Empty) Step 1. Shut down normal operation of the HP bh5700ATCA 14-Slot Blade Server. Step 2. Disconnect/disable all external input power sources to both PEMs. WARNING Electrical Shock Hazard. Power is always present at the PEM power terminals unless the external power source is turned off.
Field Replaceable Unit Removal and Replacement 14-Slot Shelf Chassis and Backplane Step 5. Remove all Ethernet Switch Blades, HP bc2100 ATCA Server Blades, and any other installed blades from the 14-Slot Shelf blade cage, and individually set aside in anti-static bags for reinstallation in the replacement chassis (follow removal procedures in “Ethernet Switch Blade” on page 36). Step 6.
Field Replaceable Unit Removal and Replacement 14-Slot Shelf Chassis and Backplane Step 4. Remove both PEMs from the replacement chassis. Step 5. Remove both Backplane SEEPROM Assemblies from the replacement chassis and set aside for installation into the defective chassis for return to Hewlett-Packard. Step 6. Install the two Backplane SEEPROM Assemblies you removed from the defective chassis into the replacement chassis. Step 7. Reinstall both PEMs in the replacement chassis.
Field Replaceable Unit Removal and Replacement 14-Slot Shelf Chassis and Backplane Step 16. The replacement 14-Slot Shelf should be ready to reapply external power and resume normal operation.
Field Replaceable Unit Removal and Replacement HP bc2100 ATCA Server Blade Hard Disk Drive HP bc2100 ATCA Server Blade Hard Disk Drive This section provides information on removing and replacing the HP bc2100 ATCA Server Blade optional Hard Disk Drive and Mounting Kit. Removing the Hard Disk Drive and Mounting Kit CAUTION Electrostatic Discharge Hazard. Observe all ESD safety precautions while completing this procedure.
Field Replaceable Unit Removal and Replacement HP bc2100 ATCA Server Blade Hard Disk Drive Figure 1-19 HP bc2100 ATCA Server Blade, Access Cover Panel Installed Step 3. Remove the four countersunk retaining screws that secure the blade cover panel, and remove the access cover panel. Set the cover panel and the retaining screws aside for reuse. Step 4. Locate the HDD (Figure 1-20), and remove the four pan head screws holding the HDD mounting brackets to the blade circuit board stand-off posts.
Field Replaceable Unit Removal and Replacement HP bc2100 ATCA Server Blade Hard Disk Drive Figure 1-20 HDD Installed Step 5. Grasp the HDD body with one hand, holding it at the mounting brackets. Gently pull the HDD body away from the blade SAS connector (in the opposite direction of the arrows in Figure 1-21) to disconnect the HDD.
Field Replaceable Unit Removal and Replacement HP bc2100 ATCA Server Blade Hard Disk Drive Figure 1-21Hard Disk Drive (Arrows Point Toward the Blade Connector) Step 6. When disconnected from the blade SAS connector, lift the HDD body with mounting brackets free of the HP bc2100 ATCA Server Blade circuit board. Step 7. Lay the HDD body with mounting brackets flat on an ESD-safe working surface with the circuit-side facing upward (see Figure 1-22).
Field Replaceable Unit Removal and Replacement HP bc2100 ATCA Server Blade Hard Disk Drive Replacing the Hard Disk Drive and Mounting Brackets CAUTION Electrostatic Discharge Hazard. Observe all ESD safety precautions while completing this procedure. Failure to follow ESD safety precautions could result in damage to the 14-Slot Shelf and equipment. To replace the HDD and HDD mounting brackets, complete the following steps: Step 1.
Field Replaceable Unit Removal and Replacement HP bc2100 ATCA Server Blade Memory Modules HP bc2100 ATCA Server Blade Memory Modules The HP bc2100 ATCA Server Blade Dual In-line Memory Modules (DIMMs) are always installed in pairs. Each individual DIMM within a pair must be identical in storage capacity, revision, and part number to the other DIMM in the pair.
Field Replaceable Unit Removal and Replacement HP bc2100 ATCA Server Blade Memory Modules 3. Remove the countersunk retaining screws that secure the blade access cover panel and remove the cover panel. Set the cover panel and the retaining screws aside for reuse. 4. To release a DIMM from its socket, pull the two locking levers outward and away from the DIMM ends (see Figure 1-25). Figure 1-25 Unlocking DIMM in DIMM Socket U5 5. Remove the DIMM by angling it away from the socket and gently pulling on it.
Field Replaceable Unit Removal and Replacement HP bc2100 ATCA Server Blade Memory Modules IMPORTANT DIMM sockets U5 and U12 must always be populated (Figure 1-26). Figure 1-26 DIMM Socket Assignments If you are only loading one DIMM pair, load the first DIMM into socket U12. This will ease the installation of the second DIMM into socket U5. If you are populating all four DIMM sockets, load the DIMMs in this order: Socket U17, U15, U12, and then U5. 1. Angle the selected DIMM over the empty socket. 2.
Field Replaceable Unit Removal and Replacement HP bc2100 ATCA Server Blade Memory Modules 72 Chapter 1
2 LED Indicators Chapter 2 73
LED Indicators Introduction Introduction Many of the components of the HP bh5700 have Light Emitting Diodes (LEDs) that are a visual diagnostic for the health of the server. The usual colors used for LEDs are amber (yellow), green, red, and blue. Each component has a variety of LEDs that have different server health meanings. Listed are the LEDs for the Ethernet Switch Blade, the 14-slot Shelf Manager (ShMM), and the HP bc2100 ATCA Server Blade.
LED Indicators Ethernet Switch Blade Ethernet Switch Blade The following information covers normal and abnormal LED activity.
LED Indicators Ethernet Switch Blade Normal LED Activity Figure 2-1 76 Ethernet Switch blade LED Display Chapter 2
LED Indicators Ethernet Switch Blade During a successful boot, the following sequence of LED activity should be observed: 1. All LEDs are off as the board is released from reset (Initial insertion into chassis, or pressing the front panel reset button). 2. IPMC turns on the OUT OF SERVICE and HEALTHY LEDs on the front panel. 3. HEALTHY LED turns green when the IPMC is up and running. 4.
LED Indicators Ethernet Switch Blade Abnormal LED Activity Use the table below as a guide to troubleshoot the LED indicators after the Ethernet Switch Blade has been powered up in the chassis for at least 60 seconds and allowed to go through the normal boot process. Table 2-1 translates the LED indicators on the front panel of the Ethernet Switch Blade into a Tip number. Use the tip number listed in Table 2-1 to find a solution in Table 2-2.
LED Indicators Ethernet Switch Blade Table 2-2 Solution Table for LED Indicators Tip 1 Problem No power Solution 1. Examine other blades in the front card cage. If all blades indicate no power, verify that the power is applied to the chassis and that the Power Entry Modules are operational. 2. Verify that the Switch is fully inserted into the front card cage. If not, fully seat the switch. 3. Verify that the switch is in either chassis slot 7 or 8. If in another slot, move it to either slot 7 or 8. 4.
LED Indicators Ethernet Switch Blade Table 2-2 Tip 3 Solution Table for LED Indicators (Continued) Problem Solution Voltage or temperature sensors are outside of threshold 1. Examine the other Blades in the Shelf. If all of the healthy LEDs are illuminated red, there may be a power failure on one of the two redundant power busses. Check that both power busses to the ATCA are on (energized) and check both Power Entry Modules as described in the PEM section. 2.
LED Indicators Ethernet Switch Blade Table 2-2 Tip 7 Solution Table for LED Indicators (Continued) Problem Board held in reset Solution 1. The Shelf Manager is holding the Switch in RESET. Follow the troubleshooting tips in "Diagnosing a failed Shelf Manger Activation" in the Ethernet Switch Blade Manual to extract the Shelf Manager information for further analysis. 2. If, after troubleshooting the Shelf Manager information the Switch is still being held in RESET, replace the Switch.
LED Indicators 14-Slot Shelf 14-Slot Shelf This section provides LED indicator and LED connection information on the HP bh5700 Telco Blade 14-Slot Shelf and its components. Shelf Alarm display The Alarm Display Panel (ADP) is located at the right front top side of the Shelf. It provides the Telco Alarm connector (DB15-male). The I²C-bus devices on the ADP are connected to the Master-Only I²C-bus of both Shelf Managers. Only the active Shelf Manager has access to the SAP.
LED Indicators 14-Slot Shelf Figure 2-2 Alarm Display Panel The LEDs USER (1, 2, 3) are user definable and connected to the I²C-bus I/O port of the PCA 9555 on the ADP. Troubleshooting the Shelf Alarm Display Panel The Shelf Alarm Display Panel has a low failure rate.
LED Indicators 14-Slot Shelf Table 2-3 Shelf Alarm Display Panel LED Indicators (Continued) User LED 2 (Green) User LED 3 (Amber ) Off Off Off 2 - - - - 3 - - - - - 4 Yellow - - - - - 5 - - Red - - - - 6 - - - - - Red - - 7 - - - - - - Green - 8 - - - - - - - Amber 9 Fan Tray Alarm (Red) Fan Tray OK (Green) Critical Alarm (Red) Major Alarm (Amber ) Minor Alarm (Yellow ) Off Off Off Off Off Red - - - - Amber - - - - Table 2-4 Ti
LED Indicators 14-Slot Shelf Table 2-4 Tip 2 Shelf Alarm Display Panel LED Tips (Continued) Symptom All LEDs are off Action 1. Check the Power Entry Module for proper operation. 2. Check the Shelf Alarm Display for proper seating. 3. Cycle both primary power busses to the system at the same time. During the initial boot cycle all 12 LEDs should turn on during the LED check cycle. If all LEDs turn on the Shelf Alarm Display is good. Look for other error conditions and LEDs 4.
LED Indicators 14-Slot Shelf Connection Between the Shelf Manager and ADP The following figures show the connection between the Shelf Manager and the ADP, and block diagrams of the Shelf Manager and the ADP (Figure 2-3, Figure 2-4, and, Figure 2-5).
LED Indicators 14-Slot Shelf Figure 2-4 Chapter 2 Alarm Display Panel Block Diagram (1) 87
LED Indicators 14-Slot Shelf Figure 2-5 Alarm Display Panel Block Diagram (2) Shelf Alarm Display Horizontal Board Connector Figure 2-6 shows the connector for the Shelf Alarm Display. Table 2-5 lists the pin assignments for the SAD.
LED Indicators 14-Slot Shelf Figure 2-6 Alarm Display Panel Horizontal Board Connector Table 2-5 Alarm Display Panel Horizontal Board Pin Assignment Pin Signal Name Description 1 GND Ground 2 GND Ground 3 GND Ground 4 RXD0_ACB1 Receive Data to primary Shelf Manager 5 RXD0_ACB2 Receive Data to secondary Manager 6 TXD0_ACB1 Transmit Data from primary Shelf Manager 7 TXD0_ACB2 Transmit Data from secondary Shelf Manager 8 DSR_ACB1 Data Set Ready to primary Shelf Manager 9 DSR_AC
LED Indicators 14-Slot Shelf Table 2-5 Alarm Display Panel Horizontal Board Pin Assignment (Continued) Pin Signal Name Description 22 RST_PB Signal from Alarm Cutoff Push Button 23 LED_FT-FAIL_1 Signal to Fan Tray Left Alarm LED 24 LED_FT_FAIL_2 Signal to Fan Tray Center Alarm LED 25 LED_FT_FAIL_3 Signal to Fan Tray Right Alarm LED 26 LED_FT_OK_1 Signal to Fan Tray Left OK LED 27 LED_FT_OK_2 Signal to Fan Tray Center OK LED 28 LED_FT_OK_3 Signal to Fan Tray Left OK LED Alarm Displ
LED Indicators 14-Slot Shelf The push button only activates the Alarm Cutoff (ACO) state, but does not clear the alarm completely. NOTE Telco Alarm Connector on the Shelf Alarm Panel The Shelf Alarm Panel (SAP) is a user interface that provides a Telco Alarm connector for troubleshooting.
LED Indicators 14-Slot Shelf Troubleshooting the Shelf Alarm Panel The Shelf Alarm Panel is a pass-through device with no logical circuits. Problems with the SAP are generally mechanical damage to the Telco Alarm Connector or the connector between the Shelf Alarm Panel and the Horizontal Board. Table 2-8 Tip 1 Shelf Alarm Panel Troubleshooting Symptom Action Communicat ion problems to the Shelf thorough the SAP connector 1.
LED Indicators 14-Slot Shelf Table 2-9 Pin Alarm Connector Panel Horizontal Board Connector Pin Assignment Signal Name Description 3 LED_USER3 Signal to User definable LED3 on Shelf Alarm Display 4 LED_MIN Signal to Minor Alarm LED on Shelf Alarm Display 5 LED_MAJ Signal to Major Alarm LED on Shelf Alarm Display 6 LED_CRIT Signal to Critical Alarm LED on Shelf Alarm Display 7 RST_PB Signal from Alarm Cutoff Push Button on Shelf Alarm Display 8 GND Ground 9 SCL_CH0 Serial Clock Master
LED Indicators 14-Slot Shelf Figure 2-9 94 Fan Tray, Front and Rear View Chapter 2
LED Indicators 14-Slot Shelf Figure 2-10 Fan Tray Number Scheme Table 2-10 Fan and Fan Tray Numbering Scheme 0 Fan #0 4 Fan #4 8 Fan #8 1 Fan #1 5 Fan #5 10 Fan Tray #0 (Left) 2 Fan #2 6 Fan #6 11 Fan Tray #1 (Center) 3 Fan #3 7 Fan #7 12 Fan Tray #2 (Right) Fan Tray Signals The Fan Tray provides signals for: • Voltage monitoring • Switching off the fans • Status of the DC/DC converter Chapter 2 95
LED Indicators 14-Slot Shelf These signals are controlled by the PCA9555 I²C device on the Fan Tray PCB. The Shelf Manager has access to these signals via Channel 3 of the Master-Only I²C-bus. The fan signals are shown in Table 2-11, and Figure 2-11. Table 2-11 Fan Tray Signals Signal Description -48 V_A bus voltage Indicates the presence of the -48 V_A/VRTN_A at the Horizontal Board Connector. -48 V_A fan tray voltage Indicates the presence of the -48 V_A/VRTN_A after the fan tray’s main fuse.
LED Indicators 14-Slot Shelf Fan Tray Temperature Sensor The temperature sensors (LM75) in the Fan Trays measure the exhaust temperatures of the Shelf. The temperature sensors are connected to Channel 3 of the Master-Only I²C-bus. Fan Tray Control Board SEEPROM The SEEPROM on the Fan Tray control board stores the FRU data and is connected to Channel 3 of the Master-Only I²C-bus.
LED Indicators 14-Slot Shelf Table 2-13 Fan Tray Backplane Connector Pin Assignment (Continued) Pin Number Signal Pin Number Signal 9 FAN_TK2 21 -48 V_B 10 FAN_TK1 22 -48 V_A 11 FAN_Speed 23 FAN_24V 24 FAN_PWR_RTN 12 Troubleshooting Fan Trays Table 2-14 Hot-Swap (Blue) Fan Tray LED Indicators Ran Tray Alarm (Red) Fan Tray OK (Green) Tip - - Green Normal Blue Red Green 1 - - - 2 - Red - 3 Flashing Blue - - 4 Blue - - 5 Table 2-15 Tip 1 Fan Tray LED Indicator
LED Indicators 14-Slot Shelf Table 2-15 Tip 2 Fan Tray LED Indicator Tips (Continued) Symptom No Fan Tray indicators on. Action 1. Check the Fan Tray for proper seating. 2. Verify that the Fan Tray is installed right side up. The Extraction Handle be down. 3. Remove Fan Tray and examine the Horizontal Board Connector on the Fan Tray for bent or broken pins. If pins are bent or damaged, replace the Fan Tray. 4.
LED Indicators 14-Slot Shelf To indicate the presence of the PEM, a PEM presence signal is grounded by the PEM body. A Blue Hot-Swap LED and a Hot-Swap Push Button provide Hot-Swap functionality. A red (power failure) and a green (OK) LED provide status indication. Each of the four redundant power-feeds supply power to a separate part of the ATCA Backplane.
LED Indicators 14-Slot Shelf Figure 2-14 PEM Block Diagram PEM I/O Device The PEM I/O device (PCA9555): • Controls the status of the LEDs. • Reads the status of the Hot-Swap push button. • Reads the status of the -48 VDC inputs. Table 2-17 PCA9555 I/O Pin 0.0 Chapter 2 PEM PCA Pin Assignments Function Power input 2 Backplane connector present State -48 V present = 0 -48 V absent = 1 (3.
LED Indicators 14-Slot Shelf Table 2-17 PEM PCA Pin Assignments (Continued) PCA9555 I/O Pin Function 0.1 State Power input 2 behind the fuse present -48 V present = 0 Power input 1 at Backplane connector present -48 V present = 0 Power input 1 behind the fuse present. -48 V present = 0 0.4 N/C Pulled High 0.5 N/C Pulled High 0.
LED Indicators 14-Slot Shelf Table 2-18 Power Entry Modules LED Indicators (Continued) PEM Alarm (Red) Hot Swap (Blue) PEM OK (Green Tip - - - 2 - Red - 3 Flashing Blue - - 4 Blue - - 5 Table 2-19 Tip 1 Power Entry Module Troubleshooting Actions Symptom LED Test Action 1. The LEDs are tested when primary power is first applied to the chassis by turning on all LEDs, regardless of the true status, for a period of up to 15 seconds. 2.
LED Indicators 14-Slot Shelf Table 2-19 Power Entry Module Troubleshooting Actions (Continued) Tip 5 Symptom Blue LED on solid Action PEM is ready for hot-swap. PEM may be safely removed from the chassis. Shelf Manager LEDs Front Panel Components Figure 2-15 Shelf Manager Front Panel Components Ethernet Channels The Shelf Manager Module provides two 10/100 Ethernet interfaces.
LED Indicators 14-Slot Shelf Hardware Redundancy Interface The Shelf Manager supports redundant operation with automatic switchover using redundant Shelf Managers. In a configuration where two Shelf Manager are present, one acts as the active Shelf Manager and the other as a standby. The Shelf Managers monitor each other and either can trigger a switchover if necessary.
LED Indicators 14-Slot Shelf Hot-Swap LED The Shelf Manager provides a blue Hot-Swap LED. This LED indicates when it is safe to remove the Shelf Manager from a powered Shelf. Table 2-20 Hot-Swap LED LED State Condition Off The Shelf Manager is not ready to be removed/disconnected from the Shelf. Blue The Shelf Manager is ready to be removed/disconnected from the Shelf. Long Blink The Shelf Manager is activating itself. Short Blink Deactivation has been requested.
LED Indicators 14-Slot Shelf Table 2-21 ETH1 Speed Shelf Manager Status LEDs (Continued) ETH1 Activity (Green) ETH0 Speed (Yellow) ETH0 Activity (Green) Shelf Manager Status (Red) Shelf Manager Status (Green) Hot Swap (Blue) Tip - - - - Red - - 4 - - - - - - Flashing Blue 5 - - - - - - Blue 6 Table 2-22 Tip 1 Shelf Manager Troubleshooting Tips Symptom LED test Action 1. Does the shelf Manager have an LED test? 2.
LED Indicators 14-Slot Shelf Table 2-22 Tip 4 Shelf Manager Troubleshooting Tips (Continued) Symptom Something is seriously wrong with the Shelf Manager and the Shelf Manager was able to indicate the failure. Action 1. Observe the other Shelf Manager to determine if it is in the Normal Active mode. 2. Press the reset button on the Shelf Manager exhibiting the problem using a pen or paperclip. 3.
LED Indicators HP bc2100 ATCA Server Blade HP bc2100 ATCA Server Blade This section provides information on the HP bc2100 ATCA Server Blade front panel hard reset button and the LED indicators.
LED Indicators HP bc2100 ATCA Server Blade Figure 2-17 110 HP bc2100 ATCA Server Blade Front Panel Chapter 2
LED Indicators HP bc2100 ATCA Server Blade Out of Service (OOS) LED The HP bc2100 ATCA Server Blade supports one bi-color Out of Service (OOS) LED, mounted on the front faceplate. The OOS LED is identified by the following graphical legend on the faceplate, as shown in Figure 2-17: The OOS LED can be driven to display a red or amber color. When this LED is lit, it indicates that the board is not in service. Its back-end (payload) power could be OFF or ON.
LED Indicators HP bc2100 ATCA Server Blade The LED’s error state color default to red, but the color can be overridden using PICMG 3.0-defined commands. NOTE Table 2-24 Health LED LED Status Meaning Solid Green Healthy Solid Amber/Red Fault or error condition The default color and override capabilities of the LED follow the LED management requirements defined in Section 3.2.5 of the PICMG 3.0 Specification.
LED Indicators HP bc2100 ATCA Server Blade The Gigabit Ethernet (GbE) Network Speed activity LEDs are identified by the following graphical legends located on the Server Blade front panel: Network Speed activity LED states are defined as shown in Table 2-27.
LED Indicators HP bc2100 ATCA Server Blade The user-programmable LEDs are connected to the GPIO pins on the ICH device, as shown in Table 2-29. Table 2-29 GPIO Pin Connections LED Pin User_Prog_LED1_Red# GPIO20 User_Prog_LED1_GRN# GPIO21 User_Prog_LED2_Red# GPIO28 User_Prog_LED2_GRN# GPIO23 Ethernet Controller Port State (Base Ethernet Activity/Link) LEDs The front panel of the HP bc2100 ATCA Server Blade provides a bi-color LED for each Ethernet channel that indicates the Ethernet port state.
LED Indicators HP bc2100 ATCA Server Blade The CMM or higher level software can reject the request to move to the M6 state. If this occurs, the Hot-Swap LED returns to a solid off condition, indicating that the HP bc2100 ATCA Server Blade has returned to M4 state.
LED Indicators HP bc2100 ATCA Server Blade Table 2-32 HP bc2100 ATCA Server Blade LED States (Continued) Healthy (Red / Green) Hot Swap (Blue) Out of Service (Red) Tip Off Red - 3 - - Red 4 Table 2-33 Tip 1 HP bc2100 ATCA Server Blade LED Tips Symptom No Power Action 1. Examine other blades in the front card cage. If all Blades indicate no power, verify that the power is applied to the chassis and the Power Entry Modules are operational 2.
LED Indicators HP bc2100 ATCA Server Blade Table 2-33 Tip 3 HP bc2100 ATCA Server Blade LED Tips (Continued) Symptom Voltage or temperature sensors are outside of threshold Action 1. Examine the other Blades in the Shelf. If all of the Healthy LEDs are illuminated red, there may be a power failure on one of the two redundant power busses. Make sure both power busses to the ATCA are on and Check both Power Entry Modules for proper operation. 2.
LED Indicators Shelf Manager Redundant Operation Shelf Manager Redundant Operation The active Shelf Manager exposes the ShMC device (address 20h) on IPMB, manages IPMB and the IPM controllers, and interacts with the System Manager over RMCP and other shelf-external interfaces. It maintains an open TCP connection with the backup Shelf Manager. It communicates all changes in the state of the managed objects to the backup Shelf Manager.
3 Diagnostics Chapter 3 119
Diagnostics Introduction Introduction The HP bh5700 ATCA 14-Slot Blade Server component diagnostics are to be run in offline mode. The operating system must not be loaded and the component to be tested must not be in operation. The diagnostics described in this document are intended to be used by trained Hewlett-Packard field engineers or Network Equipment Providers (NEPs). CAUTION Entering these diagnostic commands on a running system can cause the system to halt operations with data loss.
Diagnostics General ATCA Diagnostics for the Shelf Management Module (ShMM) General ATCA Diagnostics for the Shelf Management Module (ShMM) The ATCA specification defines provisions for diagnosing ATCA components through the Shelf Management Module, or “Shelf Manager” (ShMM). The ShMM functionality includes a command line interface (clia) that can provide detailed diagnostic and management functions for all other configured ATCA components within the HP bh5700 ATCA 14-Slot Blade Server.
Diagnostics General ATCA Diagnostics for the Shelf Management Module (ShMM) Table 3-1 ShMM Diagnostic Commands (Continued) Command Function shelf Display information about various shelf subsystems. showunhealthy Show all unhealthy FRUs. board [physical slot #] board is used to print status and information about a board in the chassis. The command displays the Device ID String, the How Swap State, and the reason the previous Hot Swap State change was made.
Diagnostics General ATCA Diagnostics for the Shelf Management Module (ShMM) debuglevel [mask] The debuglevel command raises or lowers the debug level for the ShMM. If the status of the ShMM is suspect, raising the debug level may assist in debugging the problem. The default debug level is 0x0007 (Errors, Warnings, and Informational Messages).
Diagnostics General ATCA Diagnostics for the Shelf Management Module (ShMM) Hot Swap State: M4 (Active), Previous: M3 (Activation In Process), Last State Change Cause: Normal State Change (0x0) Device ID String: "IPM Sentry ShM" 12: FRU # 0 Entity: (0xf0, 0x60) Hot Swap State: M7 (Communication Lost), Previous: M4 (Active), Last State Change Cause: Unknown (0xf) Device ID String: "IPM Sentry ShM" 20: FRU # 0 Entity: (0xf0, 0x1) Hot Swap State: M4 (Active), Previous: M3 (Activation In Process), Last State
Diagnostics General ATCA Diagnostics for the Shelf Management Module (ShMM) Manufacturer Name = ZNYX Networks Product Name = ZX7120-HP Product Part / Model# = 700-0174-002 Product Version = Product Serial Number = 01CS00111HWP Asset Tag = FRU Programmer File ID = 03 Multi Record Area: PICMG Board Point-to-Point Connectivity Record (ID=0x14) Version = 0 PICMG Board Point-to-Point Connectivity Record (ID=0x14) Version = 0 CLI> frucontrol The frucontrol command sends
Diagnostics General ATCA Diagnostics for the Shelf Management Module (ShMM) Type: Threshold (0x01), "Temperature" (0x01) Lower Non-Critical Threshold, Raw Data: 0x0a, Processed Data: 10.000000 degrees C Lower Critical Threshold, Raw Data: 0x05, Processed Data: 5.000000 degrees C Lower Non-Recoverable Threshold, Raw Data: 0x00, Processed Data: 0.000000 degrees C Upper Non-Critical Threshold, Raw Data: 0x28, Processed Data: 40.000000 degrees C Upper Critical Threshold, Raw Data: 0x3c, Processed Data: 60.
Diagnostics General ATCA Diagnostics for the Shelf Management Module (ShMM) 0x0089: Event: at Jan 10 02:12:33 1970; from:(0x88,0,0); sensor:(0x22,14); event:0x6f(asserted): 0x0C 0xFF 0xFF 0x008A: Event: at Jan 10 04:30:24 1970; from:(0x88,0,0); sensor:(0x1b,10); event:0x3(asserted): 0x01 0xFF 0xFF 0x008B: Event: at Jan 10 04:30:29 1970; from:(0x88,0,0); sensor:(0x1b,10); event:0x3(asserted): 0x00 0xFF 0xFF 0x008C: Event: at Jan 10 04:30:29 1970; from:(0x88,0,0); sensor:(0x22,14); event:0x6f(asserted): 0x0C
Diagnostics General ATCA Diagnostics for the Shelf Management Module (ShMM) Assertion Mask: 0x0001 System Firmware Error Deassertion Mask: 0x0001 Settable / Readable Mask: 0x0001 System Firmware Error sensordata [ [[lun: ]] The sensordata command displays the current value for a sensor.
Diagnostics General ATCA Diagnostics for the Shelf Management Module (ShMM) • BDSelGrounded 1/0 (1 means Enabled, 0 means Disabled) • PwrCapability • PwrDelay <10ths_of_second> • Allowance • PwrReorder before/after • info_refresh • info_force_update showunhealthy The showunhealthy command displays information on failing or substandard FRUs in the system.
Diagnostics Shelf Manager (ShMM) Functional Specifications Shelf Manager (ShMM) Functional Specifications . ShMM Diagnostics Functionality and Features The ShMM has three levels of diagnostics. The first level runs during Power-On-Self-Test (POST), and runs a self-check on the board. The results of this test are available using the dmesg command from the ShMM’s Linux operating environment. The second level is a CPU error-monitoring daemon that runs in the background and reports all CPU errors to the SEL.
Diagnostics Shelf Manager (ShMM) Functional Specifications • Manually executed tests The categories that are run and when the tests run are controlled by the U-boot environment variables post_normal and post_poweron. These variables are accessible from the pre-boot firmware menu, over a serial console connection. The variables are described below: post_normal (U-boot environment variable) post_normal determines the list of POST tests that are run on each boot-up.
Diagnostics Shelf Manager (ShMM) Diagnostics User Interface Shelf Manager (ShMM) Diagnostics User Interface ShMM Diagnostics Overall Interface Description This section describes the ShMM diagnostics interface. Power-On Self Tests Power-On Self-Test (POST) diagnostics run at power-on and during boot.
Diagnostics Shelf Manager (ShMM) Diagnostics User Interface > mem02 > sleep 5 > done Additional information on the memory tests is found in the section “Memory tests” on page 136. CPU Error Daemon The CPU error monitoring daemon runs in the background and reports CPU errors to /var/log/messages. It is part of the ShMM standard operating environment.
Diagnostics Shelf Manager (ShMM) Diagnostics User Interface Online Tests The online tests discussed in this section are available in the /bin directory. If these utilities do not exist in the /bin directory on the ShMM, please contact Hewlett-Packard support for assistance. shmm_uart_test [-e] NOTE The external test (-e) requires a loopback cable. Before running the test verify that the loopback cable is properly installed.
Diagnostics Shelf Manager (ShMM) Diagnostics User Interface The shmm_eth_test checks the ethernet interfaces on the ShMM. Three tests are run to verify proper operation of the interfaces: internal checks, PHY and ARP tests. The internal and PHY tests are run automatically and the ARP test is run when the -a flag is used. When the –a argument is added with a valid IP address, the ARP test will send ARP packets to the target IP address and wait for a response. Two Ethernet ports are exercised by this test.
Diagnostics Shelf Manager (ShMM) Diagnostics User Interface For example: # shmm_crc_test Performing CRC Checksum Tests... Testing MTD IMAGE 0: FAILURE: Unknown type of image. Testing MTD IMAGE 1: FAILURE: Unknown type of image. Testing MTD IMAGE 2: PASSED. Testing MTD IMAGE 3: PASSED. Memory tests Online memory tests are not installed on the system delivered from the factory.
Diagnostics Shelf Manager (ShMM) Diagnostics User Interface # mem01 Free Mem: 35 Mb Free Swap: 0 Mb Total Free: 35 Mb Total Tested: 35 Mb mem01 0 INFO : touching 35MB of malloc'ed memory (linear) mem01 1 PASS : malloc - alloc of 35MB succeeded : touching 35MB of malloc'ed memory (random) # mem01 -r -v Free Mem: 35 Mb Free Swap: 0 Mb Total Free: 35 Mb Total Tested: 35 Mb mem01 0 INFO .................................................................................... ..........
Diagnostics Shelf Manager (ShMM) Diagnostics User Interface Table 3-2 CPU Error Types (Continued) Exception Description TLBL TLB exception (load or instruction fetch) TLBS TLB exception (store) AdEL Address error exception (load or instruction fetch) AdES Address error exception (store) Sys Syscall exception BP Breakpoint exception RI Reserved instruction exception CpU Coprocessor Unusable exception Ov Arithmetic Overflow exception Tr Trap exception NOTE Many of these exceptions can
Diagnostics Ethernet Switch Blade Functional Specifications Ethernet Switch Blade Functional Specifications Ethernet Switch Blade Diagnostics Features There are no executable diagnostics for the ethernet switch blade.
Diagnostics HP bc2100 ATCA Server Blade Functional Specifications HP bc2100 ATCA Server Blade Functional Specifications HP bc2100 ATCA Server Blade Diagnostics Features The diagnostics are based on the Intel® Modular Test Architecture (Intel® MTA) for Linux*, an execution environment and framework for Intel® architecture (IA) platforms running on a Linux Service OS. This framework consists of test executives and test modules written as Linux applications.
Diagnostics HP bc2100 ATCA Server Blade Functional Specifications Server Blade Diagnostics Valid Configurations The diagnostic service OS is delivered in the form of an ISO image. The following are the recommended options for deploying the ISO image: PXE Boot The ISO image is downloaded onto the PXE server and the compressed kernel and RAMdisk image (vmlinuz and initrd.gz) are extracted into the tftpboot directory. The diagnostics package is integrated with the Service OS.
Diagnostics HP bc2100 ATCA Server Blade Functional Specifications 2. Choose the correct default device for the selected boot option. Available options for the Default PXE device will be based on which ports were enabled for PXE in the PCI configuration screen. The user may select a specific PXE-enabled port to be used for diagnostic boot based on their support configuration. Refer to Figure 3-2.
Diagnostics HP bc2100 ATCA Server Blade Functional Specifications Selecting the Diagnostic Boot Sequence When the default device(s) to be used for diagnostic boot have been configured, the Server Blade will respond to IPMI requests to set the system boot options and boot from one of the configured diagnostic boot devices. The following table summarizes the IPMI commands to invoke the one-time diagnostic boot sequence.
Diagnostics HP bc2100 ATCA Server Blade Functional Specifications There are two default users in the Service OS, “root” and “bdp”. It is recommended to use “bdp” as the user when running the diagnostics and other operations. The following are the default passwords for these users:. Refer to “Customizing the Service OS ISO Image” on page 192 for information on customizing the Service OS.
Diagnostics HP bc2100 ATCA Server Blade Functional Specifications The script modifies the ifcfg file for the base interface. The system name can be generated from the system hardware address (that is, the MAC address can be read by running ifconfig and reading the output). By default, the host name is set to bdp-XX-XX-XX-XX-XX-XX, where XX-XX-XX-XX-XX-XX denotes the MAC address of the “eth0” Ethernet interface. Figure 3-3 shows the operation of the network configuration agent.
Diagnostics HP bc2100 ATCA Server Blade User Interface HP bc2100 ATCA Server Blade User Interface HP bc2100 ATCA Server Blade Diagnostics Interface Description The diagnostics package (referred to as the “Board Diagnostics Package (BDP)” from this point forward) is provided as an ISO image. The ISO image is a bootable CD image that can be installed in the production environment to boot using various boot methods.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI BDP CLI The BDP Command Line Interface (CLI) can be accessed by logging into the Service OS using telnet or ssh. To run the diagnostics, use “bdp” as the user name (default password “bdp”). CLI Usage This section describes the various command line options available in the BDP CLI utility.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Table 3-5 BDP Command Options (Continued) Option Description -n Specifies the number of times the test will be repeated. If 0 run forever. The default is 1. -o Redirect all output to the specified file. -rv Redirect verbose output to the specified files. -re Redirect error messages to the specified file. -v Display BDP and MTA version number.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI [n]:preferred Where n is the configuration number. NOTE You must create a new configuration “preferred” as described below for this new configuration to work properly. The BDP application uses /opt/intel/bdp/MPCBLXXXX/Configuration/bdp_MPCBLXXXX.pkg to load the configuration data. The configurations are listed by name in this file.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI NOTE • If you are having problems viewing “t” command output, use the -BG white option for a white terminal background or the -BG black option for a black terminal background. Import all module configuration files (.snp extension) into a new package file (default name: “t.pkg”). ./t -autoi • To import only one module configuration file, type the name of the file after the “autoi” option: ./t -autoi lxCPU.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI NOTE If the current directory is not included in the “PATH” environment variable, replace all “t” references with “./t”. The test directory for the Service OS is /opt/intel/bdp/common/iMTA-1.5/ Both runlist and flags are optional. If “t” is run with no command line arguments, all enabled tests are executed in the order they appear in the module definition list.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Otherwise lists “-”. The second column shows the test, flow or group number that can be used on the command line. The third column shows the name of a test, flow or group. The following are some examples of running “t” (assuming a t.pkg file exists from previous steps): Table 3-7 t Command Examples Command Line Description t Run all test in main section of package file. t 2 Run all test of “memory” device in package file.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Table 3-8 t Command Line Interface Flags (Continued) Flag Range Description Valid SNP file name in current directory Imports SNP file(s) into the package file. If it is used with the PKG flag, it imports files into the specified PKG file. Otherwise, it uses the default PKG file. If the file already exists, the user is given the option to override the current file or append to it.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Table 3-8 t Command Line Interface Flags (Continued) Flag Range Description /PC [configname] Any valid string. It must match the configuration name in the package file. Whether running tests or displaying a list of tests, it refers to the indicated “configname” section of the package file. /PKG filename Any valid Linux filename Use the specified file name as the test package file. The default is “T.PKG”.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Table 3-8 Flag t Command Line Interface Flags (Continued) Range Description /V [maskvalue] 32-bit integer. Can be negative. Maskvalue is a 32-bit value that is bitwise ANDed with the VerboseMask parameter of any Verbose Test Library function calls the tests make. If the result is non-zero for a particular call, the calls message is displayed. The default maskvalue is zero.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI t Summary File The summary file (by default called summary.txt) is a text file that records the cumulative test results for all invocations of the Test Executive. In the file, there is a line for each test that summarizes the number of passes, failures, and errors that each test has reported. The format of the file is as follows: ---Module.Subtest--------------------Pass Fail Errors Last Updated:: Modulename.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI 3. Add the Server Blade host to the X-server’s access control list. $ xhost +[IP ADDRESS OF Server Blade] 4. Use telnet or ssh to login to the Server Blade. The default user name is “bdp”; the default password is “bdp”. 5. Once logged into the Server Blade, set the DISPLAY environment variable: -bash-3.0# export DISPLAY=[IP ADDRESS OF X-SERVER] 6. Now, change directory to iMTA-1.5 and start the XTC program: -bash-3.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI • Overivew • Basic Concepts • Document File Types Overview XTC is short for “X-Windows Test Control” and is the graphical user interface for the Intel® Modular Test Architecture for Linux. XTC tests are organized into modules to accommodate the wide range of system configurations available. A group of test modules configured for a system is called a test package.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Basic Concepts This section describes all of the basic components of a test package and the overall concepts of XTC which include: • Test Package • Autoconfig Template • Module • Device • Test • Parameter • Test Flow Groups • Test Flow Tree • Configuration Tree • Message Types • Summary File Test Package A test package is a collection of configurations.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI • lxModule - The executable file containing the test code. • lxModule.hlp - A help file describing all the tests, parameters and error messages. • lxModule.snp - A configuration tree branch representing the tests and parameters. • *.so.* - Any required libraries. The name of the module in the configuration tree does not have to be the same as the file name. Device Tests within modules are logically grouped by devices.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Message Types Various message types can be sent from test modules. Each message type can be individually filtered from the message window. Each type can also be redirected to a separate file. The following are the supported message descriptions and corresponding color code: • Error messages are sent when a failure condition is detected by the test or the test executive.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI User Interface The User Interface consists of the following major components. • Test Modules Pane. • Test Sequence Pane. • Message Redirection Pane. • Test Message Pane. • Test Summary Pane. • Property Dialog Boxes. • Menus and Toolbars. • Status Bar. • Right-Click Menu. Test Modules Pane The Test Modules pane, shown in Figure 3-5, holds a configuration’s supported tests.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Table 3-9 Test Modules Pane Button Descriptions (Continued) Run autoconfigure on all devices in the template configuration, and create a new configuration. If there is a flow in the template configuration, the flow is copied to the new configuration. Run autoconfigure on all devices in the current configuration, and create a new configuration.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Table 3-10 Test Sequence Pane Button Descriptions (Continued) Jump to the next failing test node in the flow tree. This is handy for large flows that are collapsed. This button expands all nodes leading to the first failing test. If the flow has multiple failing nodes, clicking it again jumps to the next failed node.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI The Test Message pane, shown in Figure 3-8, displays messages of different types. Figure 3-8 Test Message Pane The buttons on the test message pane are described in Table 3-11. Table 3-11 Test Message Pane Button Descriptions Expand this window vertically. Restores back to the original height if pressed again. Save all of the current messages into a file. Search dialog box. Verbose mask dialog box.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Table 3-11 Test Message Pane Button Descriptions (Continued) Turn advisory messages on or off. Turn error messages on or off. NOTE Turning status, advisory, and error messages off does not affect message redirection to a file. Turning verbose messages off does affect message redirection. Test Summary Pane The Test Summary pane, shown in Figure 3-9, shows the status of the various tests.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Table 3-12 Test Summary Pane Button Descriptions (Continued) The number of times the test passed The number of times the test failed. The first word is the number of test library calls made. The second word is for the module developer’s use. How far along the test is. Property Dialog Boxes Double-click a node to pop up a “property” dialog box.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Double-click Figure 3-10 to display the device node dialog. Device Node Table 3-13 describes the fields for the device node dialog. Table 3-13 Device Node Field Descriptions Name This field is the name of the device. Device names are not editable from a package file. Binary This field is the name of the binary. The binary name and the device name do not have to match.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Double-click Figure 3-11 to display the test node dialog. Test Node Dialog Table 3-14 describes the fields for the test node dialog. Table 3-14 Test Node Field Descriptions. Field Description Name The test name. Test names are Binary The binary name of the module in which the subtest belongs. Number The subtest number. Timeout This timeout overrides the device's timeout.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Table 3-14 Test Node Field Descriptions. (Continued) Field Description Disabled The test will not run. By default, destructive tests should be disabled. Requires The test requires external hardware or special setup before it can run. Flowgroup Node Dialog Box Double-click Figure 3-12 or to display the Flowgroup node dialog. Flowgroup Node Dialog Table 3-15 describes the fields for the flowgroup node dialog box.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Table 3-15 Flowgroup Node Field Descriptions (Continued) Field Description On Error Action to take on error: Continue: Errors do not interrupt test flow. Halt: All tests are aborted on 1st error. Param Node Dialog Box Double-click to display the param node dialog. Figure 3-13 Param Node Dialog Table 3-16 describes the fields in the param node dialog box.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Double-click Figure 3-14 and click the Param View tab to view the parameters that a test is using. Param View Tab The Param View tab provides the list of the parameters that a particular test receives, and also shows where each parameter resides in the package file.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Menus and Toolbars The main menu and toolbar are shown in FIG HERE.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Table 3-17 Menu Command Save File Menu (Continued) Shortcuts Alt-F Alt-S Ctl-S Description Saves the current package file. If the package file has not been previously named, the Save As dialog box is displayed. Save As Alt-F Alt-A Ctl-A Saves the current package file as a different name. Exit Alt-F Alt-E Ctl-E Exits the application if tests are not running.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI View Menu Table 3-19 describes the contents of the view menu. Table 3-19 Menu Command Status View Menu Shortcuts Alt-V Alt-M Description Toggles the status messages on/off in the Test Messages pane. This does not affect status redirection to a file. Alt-S Advisory Alt-V Alt-M Toggles the advisory messages on/off in the Test Messages pane. This does not affect advisory redirection to a file.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Table 3-20 describes the contents of the insert menu. Table 3-20 Menu Command Add Device Insert Menu Shortcuts Alt-l Alt-D Ctl-D Description Adds a device node to the root of the configuration tree (Test Modules Window) when editing a snippet file. You must explicitly open a file with the .
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Table 3-21 Action Menu (Continued) Menu Command Shortcuts Description Import Tests Alt-A Alt-I F9 Imports one or more snippet files into the current package file. The end result is one or more new device nodes under the Template Configuration’s configuration tree. Switch Configurati on Alt-A Alt-W Ctl-W Moves focus to the configuration drop-down list in the Options pane.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI This button creates a new package file. Open File Button This button opens an existing package (.pkg) or snippet (.snp) file. Save File Button This button saves the current package or snippet file. If the file has not been previously saved the Save As dialog box is displayed. Configuration Drop-Down This drop-down list allows you to navigate between configurations within the package file.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI mer. Figure 3-16 Right-Click Menu Invocation Parameters These parameters can be entered on the command line invocation of XTC to control various aspects of XTC. The forward slash token may be replaced by a hyphen. Example - The following commands both invoke autoconfigure: xtc /auto xtc –auto NOTE Flags are case insensitive. The flags can be used with a preceding dash (-) or double-dash (--) slash (/).
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI • /SUMMARY - Write summary file to specified file name • /SF - Same as /SUMMARY above. • /V - Control verbose messaging. • /PORT - Specify the socket for executive communications. • /Q - Quit mode. Turns off most messages to screen. • /L - Loud mode. Forces error, status, and advisory on. • /W - Turns off interactive tests. • Message Redirection parameters - Redirect selected message types to a file. file.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Command Line: /RUN This switch causes XTC to run tests when the application starts. The tests run are those in the flow tree of the test package specified on the command line. If the /AUTO switch is used in combination with this switch, autoconfig runs before testing is started. /CFG Command Line: /CFG configname This switch causes XTC to load a specific configuration tree when it starts.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI This switch forces testing to stop when an error occurs. When tests are stopped, a signal is sent to all running tests. Each test is responsible to check the signal, stop testing, clean up and execute. The amount of time it takes to stop is varies when tests are running in parallel. /FOREVER Command Line: /FOREVER This switch determines the name of the summary file as specified by the filename parameter.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI This switch forces advisory status and error messages to be displayed in the test message. /W Command Line: /W This switch forces interactive tests off. Default values provided by the module are used in place of the users input. Message Redirection The following switches can be used to redirect test message types to files. Any combination of the parameters is allowed. Command Line:.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Click “Configure” to start the autoconfigure process. The ‘template configuration’ is shown above. These are all of the devices that will run autoconfigure. When autoconfiguration completes a new configuration is created. See Figure 3-18. Figure 3-18 Autoconfiguration Complete A new configuration was created and automatically named “Configuratino_0”. The configuration name can be changed in the Options pane.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI In certain situations, a device may need to disable a test. For example, the disk drive module disables all physical write tests because they are destructive. Tests that are disabled can be easily re-enabled by double-clicking the test and clearing the disable check box. Disabled tests are represented by a faded blue icon.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Add New Parameters New parameters may also show up in the configuration tree. Autoconfigure Flow Tree Updates The second tree within a configuration is the flow tree. The flow tree controls test sequencing and duration for a particular configuration. Autoconfigure copies the flow tree from the template configuration into the new configuration, or creates a new serial flow if a template flow is not available.
Diagnostics HP bc2100 ATCA Server Blade Diagnostics Syntax and Semantics of UI Flow After Autoconfigure Re-Configure Re-Configure is useful when you have a configuration with only a subset of the devices from the template configuration tree, or if your configuration’s flow tree differs from the template flow. Autoconfigure runs the autoconfigure process on all devices from the template’s configuration tree, and copies the template's flow tree into the new configuration.
Diagnostics BIOS Settings BIOS Settings Tests affected by BIOS settings and optionally present hardware are given below. Table 3-23 BIOS Settings and Description Test Module Description lxBMC Not affected lxCPU Affected by CPU speed settings in the BIOS. Affected by enabling or disabling HyperThreading in the BIOS lxICHx Disabled if no Serial Ata Ports autosensed. Affected by the presence or absence of PMCs.
Diagnostics BIOS Settings Table 3-23 BIOS Settings and Description (Continued) Test Module Description System, Memory, Cache, CacheWindowed, CacheCoherency, WindowedCacheC oherency, CacheConflict, and CustomStressTest Many of these have “memory size” checks, which are affected by adding or removing memory after the module is configured.
Diagnostics Using ipmitool to Request Diagnostic Boot Using ipmitool to Request Diagnostic Boot For accessing the HP bc2100 ATCA Server Blade diagnostic Service OS, it is possible to override the default boot options and boot device priority (configured in the BIOS setup), and instead invoke a diagnostic boot sequence using an IPMI command to set the system boot options.
Diagnostics Using ipmitool to Request Diagnostic Boot 0x04 - "force PXE" bits 0001b in request data byte 3[5:2] Example 2: Sending an IPMI-over-LAN request to PXE boot blade in slot 4 on next boot ipmitool -I lan -H 0x04 0x00 0x00 192.168.1.2 -U root -P rootpass -t 0x8e raw 0x00 0x08 0x05 0x80 Where: "-I lan" specifies the LAN interface should be used by ipmitool "-H 192.168.1.
Diagnostics Customizing the Service OS ISO Image Customizing the Service OS ISO Image To create a customized Service OS ISO image with customized BDP configurations and any other changes, follow these instructions: 1. Copy the ISO image to a Linux machine. 2. Mount the ISO image using the Loopback interface. # mkdir /mnt/loop # mount -o loop ReleasedImage.iso /mnt/loop 3. Copy the entire directory structure of the mounted image to the new location. # cp -a /mnt/loop/* /home/CopiedImageTree 4.
4 System Event Log Decoding This chapter describes how to access and decode events logged in the System Event Log (SEL). For an in-depth discussion of SEL entries see the IPMI v1.5 documentation.
System Event Log Decoding Introduction Introduction This chapter provides information to decode and interpret the System Event Log (SEL). Accessing the SEL The System Event Log is accessed through the ShMM clia using the sel command. The sel command needs 2 arguments. The first argument specifies the IPMB address of the component that maintains the SEL (20 in the example). The second argument identifies the number of entries from the end of the SEL to be displayed (5 in the example).
System Event Log Decoding Introduction Table 4-1 Byte Raw SEL Event Record Data (Continued) Field Description 4 5 6 7 Timestamp Time when the event was logged. Least significant byte first. 8 9 Generator ID RqSA & LUN if the event was generated from the IPMB. Software ID if the event was generated from the system software. Byte 1 [7:1] - 7 bit I2C. Slave Address or 7-bit system software ID [0] - 0b = ID is IPMB Slave Address [0] - 1b = System Software ID Byte 2 [7:4] = Channel number.
System Event Log Decoding Introduction Table 4-1 Byte 13 Raw SEL Event Record Data (Continued) Field Event Dir | Event Type Description Event Dir [7] - 0b = Assertion event. [7] - 1b = Deassertion event. Event Type Type of trigger for the event, such as a critical threshold going high, a state asserted, etc. It also indicates the class of the event, such as discrete, threshold, or OEM. The Event Type field is encoded using the Event/Reading Type Code.
System Event Log Decoding Introduction Example SEL Entry Figure 4-1 shows an SEL entry with the record fields identified.
System Event Log Decoding Introduction Decoding the Example SEL Entry Decoding an SEL entry consists of identifying the values of the fields in the event record and using the value tables to decode their meaning. Table 4-2 shows how to decode and SEL entry. The example uses the SEL entry from Figure 4-1. Table 4-2 SEL Field Decoding Example Value Decode SEL ID 0x0273 Unique identifier of this SEL entry. Timestamp Apr 19 07:25:31 Date and time of the event.
System Event Log Decoding Introduction Table 4-2 SEL Field Sensor Definition Decoding Example (Continued) Value 0x01,55 Decode The Sensor Definition identifies the sensor type and the specific sensor that caused the event. Sensor types are defined in Table 4-5 (Table 4-5 reproduces Table 36-3 from the IPMI v1.5 documentation). Specific sensor information is dependent on the component identified in the Event Generator field.
System Event Log Decoding IPMB Address to Physical Slot Mapping for SEL Decoding IPMB Address to Physical Slot Mapping for SEL Decoding Table 4-3 is used to map the IPMB address found in the SEL event to the physical slot to identify the blade that caused the SEL event to be recorded.
System Event Log Decoding Decoding Tables Decoding Tables Table 4-4 Generic Event/Reading Type Code Generic Event/Reading Type Codes Event/Reading Class Generic Offset (Data 1) Description THRESHOLD BASED STATES 01h Threshold 00h 01h 02h 03h 04h 05h 06h 07h 08h 09h 0Ah 0Bh Lower Non-critical - going low Lower Non-critical - going high Lower Critical - going low Lower Critical - going high Lower Non-recoverable - going low Lower Non-recoverable - going high Upper Non-critical - going low Upper Non-cr
System Event Log Decoding Decoding Tables Table 4-4 Generic Event/Reading Type Code Generic Event/Reading Type Codes (Continued) Event/Reading Class Generic Offset (Data 1) Description AVAILABILITY STATUS STATES 08h ‘digital’ Discrete 00h 01h Device Removed / Device Absent Device Inserted / Device Present 09h ‘digital’ Discrete 00h 01h Device Disabled Device Enabled 0Ah Discrete 00h 01h 02h 03h 04h 05h 06h 07h 08h transition to Running transition to In Test transition to Power Off transition
System Event Log Decoding Decoding Tables Table 4-4 Generic Event/Reading Type Codes (Continued) Generic Event/Reading Type Code Event/Reading Class 0Bh Discrete Generic Offset (Data 1) 00h Chapter 4 Description Redundancy States Fully Redundant (formerly “Redundancy Regained”) Indicates that full redundancy has been regained. 01h Redundancy Lost Entered any non-redundant state, including Non-redundant: Insufficient Resources.
System Event Log Decoding Decoding Tables Table 4-4 Generic Event/Reading Type Codes (Continued) Generic Event/Reading Type Code Event/Reading Class 0Ch Discrete Generic Offset (Data 1) ACPI Device Power States 00h 01h 02h 03h 204 Description D0 Power State D1 Power State D2 Power State D3 Power State Chapter 4
System Event Log Decoding Sensor Type Codes (Table 36-3 from the IPMI v1.5 Documentation) Sensor Type Codes (Table 36-3 from the IPMI v1.5 Documentation) Table 4-5 Sensor Type Sensor Type Codes (Table 36-3 from the IPMI v1.
System Event Log Decoding Sensor Type Codes (Table 36-3 from the IPMI v1.5 Documentation) Table 4-5 Sensor Type Sensor Type Codes (Table 36-3 from the IPMI v1.
System Event Log Decoding Sensor Type Codes (Table 36-3 from the IPMI v1.5 Documentation) Table 4-5 Sensor Type Sensor Type Codes (Table 36-3 from the IPMI v1.5 Document) Sensor Specific Offset (Data 1) Sensor Type Code Event POST Memory Resize 0Eh - - System Firmware Progress (Formerly POST Error) 0Fh 00h System Firmware Error (POST Error) The Event Data 2 field can be used to provide an event extension code, with the following definition: Event Data 2: 00h Unspecified.
System Event Log Decoding Sensor Type Codes (Table 36-3 from the IPMI v1.5 Documentation) Table 4-5 Sensor Type System Firmware Progress (Formerly POST Error) Continued 208 Sensor Type Codes (Table 36-3 from the IPMI v1.5 Document) Sensor Type Code Sensor Specific Offset (Data 1) 02h Event System Firmware Progress The Event Data 2 field can be used to provide an event extension code with the following definition: Event Data 2: 00h Unspecified. 01h Memory initialization.
System Event Log Decoding Sensor Type Codes (Table 36-3 from the IPMI v1.5 Documentation) Table 4-5 Sensor Type Event Logging Disabled Watchdog 1 Chapter 4 Sensor Type Codes (Table 36-3 from the IPMI v1.5 Document) Sensor Type Code 10h Sensor Specific Offset (Data 1) Event 00h Correctable Memory Error Logging Disabled 01h Event ‘Type’ Logging Disabled. Event Logging is disabled for the following event/reading type and the offset has been disabled.
System Event Log Decoding Sensor Type Codes (Table 36-3 from the IPMI v1.5 Documentation) Table 4-5 Sensor Type System Event 210 Sensor Type Codes (Table 36-3 from the IPMI v1.5 Document) Sensor Type Code 12h Sensor Specific Offset (Data 1) Event 00h 01h 02h System Reconfigured OEM System Boot Event Undetermined system hardware failure (this event would typically require system-specific diagnostics to determine FRU / failure type) 03h Entry added to Auxiliary Log (see 25.
System Event Log Decoding Sensor Type Codes (Table 36-3 from the IPMI v1.5 Documentation) Table 4-5 Sensor Type Sensor Type Codes (Table 36-3 from the IPMI v1.
System Event Log Decoding Sensor Type Codes (Table 36-3 from the IPMI v1.5 Documentation) Table 4-5 Sensor Type Sensor Type Codes (Table 36-3 from the IPMI v1.
System Event Log Decoding Sensor Type Codes (Table 36-3 from the IPMI v1.5 Documentation) Table 4-5 Sensor Type Slot / Connector Sensor Type Codes (Table 36-3 from the IPMI v1.5 Document) Sensor Type Code 21h Sensor Specific Offset (Data 1) Event 00h 01h 02h Fault Status asserted Identify Status asserted Slot / Connector Device installed/attached [This can include dock events] 03h Slot / Connector Ready for Device Installation - Typically, this means that the slot power is off.
System Event Log Decoding Sensor Type Codes (Table 36-3 from the IPMI v1.5 Documentation) Table 4-5 Sensor Type System ACPI Power State Sensor Type Codes (Table 36-3 from the IPMI v1.5 Document) Sensor Type Code 22h Sensor Specific Offset (Data 1) 00h 01h 02h 03h 04h 05h 06h 07h 08h 09h 0Ah 0Bh 0Ch 0Eh 214 Event S0 / G0 “working” S1 “sleeping with system h/w & processor context maintained” S2 “sleeping, processor context lost” S3 “sleeping, processor & h/w context lost, memory retained.
System Event Log Decoding Sensor Type Codes (Table 36-3 from the IPMI v1.5 Documentation) Table 4-5 Sensor Type Watchdog 2 Sensor Type Codes (Table 36-3 from the IPMI v1.5 Document) Sensor Type Code Sensor Specific Offset (Data 1) Event This sensor is recommended for new IPMI v1.0 and later implementations.
System Event Log Decoding Sensor Type Codes (Table 36-3 from the IPMI v1.5 Documentation) Table 4-5 Sensor Type Entity Presence Sensor Type Codes (Table 36-3 from the IPMI v1.5 Document) Sensor Specific Offset (Data 1) Sensor Type Code 25h Event This sensor type provides a mechanism that allows a management controller to direct system management software to ignore a set of sensors based on detecting that presence of an entity.
System Event Log Decoding Component-Specific SEL Decoding Tables Component-Specific SEL Decoding Tables The component-specific SEL decoding tables are available for download from http://www.hp.com/ To download the decoding tables go to http://www.hp.com/. Select the Software & Drivers link. Enter your product in the dialog box and select ‘go’. Select the desired decoding table and follow the instructions for downloading.