HP ProLiant DL585 Server Maintenance and Service Guide July 2004 (Second Edition) Part Number 358708-002
© 2004 Hewlett-Packard Development Company, L.P. AMD, Opteron, and HyperTransport are trademarks of AMD Corporation in the United States and other countries. Microsoft, Windows, and Windows NT are U.S registered trademarks of Microsoft Corporation. Hewlett-Packard Company shall not be liable for technical or editorial errors or omissions contained herein. The information in this document is provided “as is” without warranty of any kind and is subject to change without notice.
Contents About This Guide Audience Assumptions.................................................................................................................................ix Technician Notes..........................................................................................................................................ix Where to Go for Additional Help.................................................................................................................xi Integrated Management Log ...
Contents Power Button/LED assembly ............................................................................................................2-28 PCI-X Expansion Boards.........................................................................................................................2-29 Performance Balancing .....................................................................................................................2-30 PCI-X Expansion Boards ................................................
Contents Other Information Resources .................................................................................................................. 5-13 Chapter 6 Specifications Index List of Figures 1-1 1-2 2-1 2-2 2-3 2-4 2-5 2-6 2-7 2-8 2-9 2-10 2-11 2-12 2-13 2-14 2-15 2-16 2-17 2-18 2-19 2-20 2-21 2-22 2-23 2-24 2-25 2-26 2-27 2-28 2-29 2-30 2-31 2-32 2-33 2-34 2-35 2-36 2-37 2-38 2-39 Mechanical components exploded view ...........................................................................
Contents 2-40 2-41 4-1 4-2 4-3 4-4 4-5 4-6 4-7 4-8 4-9 4-10 4-11 4-12 4-13 4-14 4-15 4-16 4-17 Removing the power supply connectors............................................................................................2-47 Removing the AC filters....................................................................................................................2-47 Rear panel components ....................................................................................................................
Contents 5-6 5-7 5-8 6-1 Is the Monitor Displaying Information? ............................................................................................. 5-8 Problems After Initial Boot................................................................................................................. 5-9 Troubleshooting Resources............................................................................................................... 5-13 Server Specifications .......................................
About This Guide This maintenance and service guide can be used for reference when servicing the HP ProLiant DL585 server. WARNING: To reduce the risk of personal injury from electric shock and hazardous energy levels, only authorized service technicians should attempt to repair this equipment. Improper repairs can create conditions that are hazardous. Audience Assumptions This guide is for service technicians.
About This Guide CAUTION: To properly ventilate the system, you must provide at least 7.6 cm (3.0 in.) of clearance at the front and back of the server. CAUTION: The computer is designed to be electrically grounded (earthed). To ensure proper operation, plug the AC power cord into a properly grounded AC outlet only. NOTE: Any indications of component replacement or printed wiring board modifications may void any warranty.
About This Guide Where to Go for Additional Help In addition to this guide, the following information sources are available: • User documentation • Service Quick Reference Guide • Service training guides • Service advisories and bulletins • QuickFind information services • Insight Manager software Integrated Management Log The server includes an integrated, nonvolatile management log that contains fault and management information.
1 Illustrated Parts Catalog Mechanical Components Exploded View Figure 1-1: Mechanical components exploded view HP ProLiant DL585 Server Maintenance and Service Guide 1-1
Illustrated Parts Catalog Mechanical Components Spare Parts List Table 1-1: Mechanical Components Spare Parts List Item Description Spare Part Number 1 Access panel 359772-001 2 Front bezel 243669-001 3 SCSI hard drive blank 122759-001 4 SCSI hard drive 4a 18.2-GB SCSI hard drive, U320 15K* 289240-001 4b 36.4-GB SCSI hard drive, U320 10K* 289041-001 4c 36.4-GB SCSI hard drive, U320 15K* 289241-001 4d 72.8-GB SCSI hard drive, U320 10K* 289042-001 4e 72.
Illustrated Parts Catalog System Components Exploded View Figure 1-2: System components exploded view HP ProLiant DL585 Server Maintenance and Service Guide 1-3
Illustrated Parts Catalog System Components Spare Parts List Table 1-2: System Components Spare Parts List Item Description Spare Part Number Media Storage Devices 1 Diskette drive, 12.
Illustrated Parts Catalog Table 1-2: System Components Spare Parts List continued Item Description Spare Part Number 14 5i Plus memory module 260741-001 15 Battery-Backed Write Cache Module, 4.
2 Removal and Replacement Procedures You need the following items for some procedures: • Torx T-15 tool • Flathead screwdriver • Diagnostics Utility on the HP SmartStart CD Safety Considerations Before performing service procedures, review the following safety information. Electrostatic Discharge A discharge of static electricity can damage static-sensitive devices or micro-circuitry. Proper packaging and grounding techniques are necessary precautions to prevent damage.
Removal and Replacement Procedures Server Warnings and Cautions WARNING: Do not exceed the level of repair specified in the procedures in the product documentation. All troubleshooting and repair procedures are detailed to allow only subassembly or module-level repair. Because of the complexity of the individual boards and subassemblies, do not attempt to make repairs at the component level or to make modifications to any printed wiring board. Improper repairs can create a safety hazard.
Removal and Replacement Procedures Preparation Procedures To access some components and perform certain service procedures, you must do one or more of the following: • Extend the server from the rack. If you are performing service procedures in an HP, Compaq branded, or third-party rack cabinet, you can use the locking feature of the rack rails to support the server and gain access to internal components. For more information about telco rack solutions, refer to the RackSolutions.com website: www.
Removal and Replacement Procedures Powering Down the Server WARNING: To reduce the risk of personal injury, electric shock, or damage to the equipment, remove the power cord to remove power from the server. The front panel Power On/Standby button does not completely shut off system power. Portions of the power supply and some internal circuitry remain active until AC power is removed. IMPORTANT: If installing a hot-plug device, it is not necessary to power down the server. 1. Back up the server data. 2.
Removal and Replacement Procedures Locating and Removing the Torx T-15 Tool Many hardware procedures in the server are toolless, but a few require the removal of Torx T-15 screws that have been installed for shipping or security reasons. A Torx T-15 tool ships with the server for the removal of these screws. 1. Locate the Torx T-15 tool on the back of the server. 2. Slide the tool upward out of the retaining clips.
Removal and Replacement Procedures Extending the Server from the Rack 1. Loosen the thumbscrews that secure the server to the front of the rack. Figure 2-3: Loosening the front panel thumbscrews 2. Extend the server on the rack rails until the server rail-release latches engage.
Removal and Replacement Procedures WARNING: Be careful when pressing the rail-release levers and sliding the component into or out of the rack. The sliding rails could pinch your fingertips. 3. After performing the installation or maintenance procedure, press the rail-release levers at the front of both server rails and slide the server into the rack. Figure 2-5: Sliding the server into the rack 4. Secure the server by tightening the thumbscrews.
Removal and Replacement Procedures Removing the Access Panel WARNING: Pressing the Power On/Standby button sets the server to the standby position, which removes power from most areas of the server. However, portions of the power supply and some internal circuitry remain active until the AC power cord is removed. WARNING: To reduce the risk of personal injury from hot surfaces, allow the internal system components to cool before touching. CAUTION: Electrostatic discharge can damage electronic components.
Removal and Replacement Procedures Removing the Server from the Rack WARNING: The server is very heavy, up to 44.5 kg (98 lb). To reduce the risk of personal injury or damage to the equipment: • Remove all hot-plug power supplies to reduce the weight of the server before lifting it. • Observe local occupational health and safety requirements and guidelines for material handling. • Get help to lift and maneuver the server. To remove the server from the rack: 1. Power down the server.
Removal and Replacement Procedures Processor Memory Boards The server supports up to four processor memory boards with AMD Opteron processors. CAUTION: Processor memory boards 1and 2 must always be installed. The system will not boot if either board is missing. CAUTION: Update the ROM to be sure that the system ROM recognizes the new processor you are installing. For the most recent ROMPaq, refer to the HP website and follow the support link on the product website.
Removal and Replacement Procedures Figure 2-8: Locating the processor memory boards Table 2-1: Processor Memory Boards Item Description 1 Processor memory board slot 1 (processor) 2 Processor memory board slot 2 (boot processor) 3 Processor memory board slot 3 (air baffle) 4 Processor memory board slot 4 (air baffle) HP ProLiant DL585 Server Maintenance and Service Guide 2-11
Removal and Replacement Procedures To remove a processor memory board: 1. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 2. Extend the server from the rack. Refer to “Extending the Server from the Rack” earlier in this chapter. 3. Remove the access panel. Refer to “Removing the Access Panel” earlier in this chapter. 4. Remove the processor memory board. Figure 2-9: Removing a processor memory board Reverse the steps to install a processor memory board.
Removal and Replacement Procedures Processor Power Module (PPM) To remove a processor power module: 1. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 2. Extend the server from the rack. Refer to “Extending the Server from the Rack” earlier in this chapter. 3. Remove the access panel. Refer to “Removing the Access Panel” earlier in this chapter. 4. Remove the processor memory board. Refer to “Processor Memory Boards” earlier in this chapter. 5.
Removal and Replacement Procedures Processor To remove a processor: 1. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 2. Extend the server from the rack. Refer to “Extending the Server from the Rack” earlier in this chapter. 3. Remove the access panel. Refer to “Removing the Access Panel” earlier in this chapter. 4. Remove the processor memory board. Refer to “Processor Memory Boards” earlier in this chapter. 5. Remove the processor heatsink.
Removal and Replacement Procedures 6. Remove the processor. Figure 2-12: Removing a processor IMPORTANT: Align the processor to the socket before installing it. Reverse the steps to install a processor.
Removal and Replacement Procedures Memory Options Minimum Memory Requirements • Two DIMMs must be installed in bank 1 on the processor memory board in slot two. • All DIMMs on a processor memory board must have the same part number. NOTE: Processor memory boards in slots 1, 3, and 4 may be installed without memory.
Removal and Replacement Procedures To remove a DIMM: 1. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 2. Extend the server from the rack. Refer to “Extending the Server from the Rack” earlier in this chapter. 3. Remove the access panel. Refer to “Removing the Access Panel” earlier in this chapter. 4. Remove the processor memory board. Refer to “Processor Memory Boards” earlier in this chapter. 5. Remove the DIMM.
Removal and Replacement Procedures Hard Drive Blanks To remove a hard drive blank: 1. Push the sliding release button to unlock the blank (1). 2. Pull the drive blank out of the drive cage (2). Figure 2-15: Removing a drive blank NOTE: Keep the blank for future use. Reverse the steps to replace a drive blank.
Removal and Replacement Procedures Hot-Plug SCSI Hard Drives Drive Replacement Precautions Be aware of the following guidelines cautioning unsafe hot-plug replacement. • Do not remove a degraded drive if any other member of the array is offline (the online LED is off). No other drive in the array is hot-pluggable without data loss. The exception to this is the use of RAID 1+0 as a fault-tolerant configuration. In this case, drives are mirrored in pairs.
Removal and Replacement Procedures To remove a non-functioning hot-plug SCSI hard drive: CAUTION: Remove or replace a hard drive only when the drive failure LED is amber. Data loss can occur if a drive is removed when the drive online LED is green. Refer to “Hot-Plug SCSI Hard Drive LEDs” in Chapter 4, “Server Component Identification,” for more information. CAUTION: Remove or replace only one hard drive at a time. The controller relies on other drives to reconstruct data on the replaced drive.
Removal and Replacement Procedures Power Supply Blank To remove a power supply blank: 1. Press the lever-release button on the handle of the power supply blanks (1). 2. Pull on the lever to release the blank (2). 3. Remove the power supply blank from the server (3). Figure 2-17: Removing a power supply blank NOTE: Keep the power supply blank for future use. Reverse the steps to replace the power supply blank.
Removal and Replacement Procedures Hot-Plug Power Supplies For information on power supply diagnosis, refer to “Hot-Plug Power Supply LEDs” in Chapter 4, “Server Component Identification.” CAUTION: Do not remove a hot-plug power supply unless there are two power supplies installed. If second power supply is not installed, the system must be powered down to remove the power supply.
Removal and Replacement Procedures Hot-Plug Fans The server supports redundant hot-plug fans in a 7+1 configuration. To replace a hot-plug fan: 1. Extend the server from the rack. Refer to “Extending the Server from the Rack” earlier in this chapter. 2. Remove the access panel. Refer to “Removing the Access Panel” earlier in this chapter. 3. Identify the malfunctioned fan by checking for an amber LED on the QuickFind Diagnostic Display or on the failed fan. 4. Remove the malfunctioned fan.
Removal and Replacement Procedures 5. Install the fan. 6. Be sure that the LED on the fan is green. 7. Reinstall the access panel. 8. Restore the server to its operating position in the rack. CAUTION: Do not operate the server for extended periods without the access panel installed. Operating the server without the access panel results in improper airflow that can lead to thermal damage. For information on hot-plug fan diagnosis, refer to “Hot-Plug Fan” in Chapter 4, “Server Component Identification.
Removal and Replacement Procedures Figure 2-21: Removing a drive from bay 2 To replace a universal media drive, slide the drive into the slot until it engages the connector. IMPORTANT: For the diskette drive to be bootable, it must be installed in the top universal media drive bay. For the CD-ROM or DVD-ROM drive to be bootable, it must be installed in the bottom universal media drive bay.
Removal and Replacement Procedures Front Bezel To remove the front bezel: 1. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 2. Extend the server from the rack. Refer to “Extending the Server from the Rack” earlier in this chapter. 3. Remove the access panel. Refer to “Removing the Access Panel” earlier in this chapter. 4.
Removal and Replacement Procedures 5. Using a flathead screwdriver, disengage the locking tabs on the bezel from underneath the chassis (1). 6. Remove the front bezel (2). Figure 2-23: Disengaging the locking tabs and removing the front bezel Reverse the steps to replace the front bezel.
Removal and Replacement Procedures Power Button/LED assembly To remove the power button/LED assembly: 1. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 2. Extend the server from the rack. Refer to “Extending the Server from the Rack” earlier in this chapter. 3. Remove the access panel. Refer to “Removing the Access Panel” earlier in this chapter. 4. Unplug the cable from the diagnostic display board (1). 5. Push on the locking tab until the opposite side comes out (2).
Removal and Replacement Procedures PCI-X Expansion Boards The server supports the installation of both PCI and PCI-X expansion boards.
Removal and Replacement Procedures Performance Balancing Balancing is the paired arrangement of expansion boards for optimal performance based on the bus architecture of the expansion slots. Properly balancing the boards across buses can improve performance. To balance expansion boards: • Populate slots 1 and 2 with 133-MHz PCI-X boards. • Populate slots 3-8 with 100-MHz PCI-X boards. • Populate slots across different buses before populating two slots on the same bus.
Removal and Replacement Procedures 8. Remove the board from the slot (3). Figure 2-26: Removing an expansion board Reverse the steps to replace an expansion board.
Removal and Replacement Procedures SCSI Cables To remove the SCSI cables: 1. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 2. Extend the server from the rack. Refer to “Extending the Server from the Rack” earlier in this chapter. 3. Remove the access panel. Refer to “Removing the Access Panel” earlier in this chapter. 4. Disconnect the SCSI cables from the SCSI backplane. 5. Disconnect the SCSI cables from the PCI-X expansion boards.
Removal and Replacement Procedures Front Fan Cage To remove the front fan cage: 1. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 2. Extend the server from the rack. Refer to “Extending the Server from the Rack” earlier in this chapter. 3. Remove the access panel. Refer to “Removing the Access Panel” earlier in this chapter. 4. Remove the processor memory boards. Refer to “Processor Memory Boards” earlier in this chapter. 5. Remove the PCI-X expansion boards.
Removal and Replacement Procedures Pass-Through Board To remove the pass-through board: 1. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 2. Extend the server from the rack. Refer to “Extending the Server from the Rack” earlier in this chapter. 3. Remove the access panel. Refer to “Removing the Access Panel” earlier in this chapter. 4. Remove the processor memory boards. Refer to “Processor Memory Boards” earlier in this chapter. 5. Remove the PCI-X expansion boards.
Removal and Replacement Procedures Diagnostic Display Board and Lightpipe To remove the diagnostic display board and lightpipe: 1. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 2. Extend the server from the rack. Refer to “Extending the Server from the Rack” earlier in this chapter. 3. Remove the access panel. Refer to “Removing the Access Panel” earlier in this chapter. 4. Remove the processor memory boards. Refer to “Processor Memory Boards” earlier in this chapter.
Removal and Replacement Procedures Rear Fan Cage To remove the rear fan cage: 1. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 2. Extend the server from the rack. Refer to “Extending the Server from the Rack” earlier in this chapter. 3. Remove the access panel. Refer to “Removing the Access Panel” earlier in this chapter. 4. Remove the processor memory boards. Refer to “Processor Memory Boards” earlier in this chapter.
Removal and Replacement Procedures Battery-Backed Write Cache Assembly IMPORTANT: The Battery-Backed Write Cache Enabler (BBWCE) and the 5i Plus BBWC Module must be removed together with the cable connected to save the data in the cache. Unplugging the cable deletes all the data in the cache. To remove the BBWC assembly: 1. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 2. Extend the server from the rack.
Removal and Replacement Procedures 8. Remove the 5i Plus BBWC Module. CAUTION: Do not disconnect the cable connecting the BBWCE and the 5i Plus BBWC Module. Unplugging the cable deletes all the data in the cache. 9. Remove the BBWCE. Figure 2-32: Removing the BBWC enabler and 5i Plus BBWC Module Reverse the steps to replace the BBWC Enabler and 5i Plus BBWC Module.
Removal and Replacement Procedures SCSI Backplane To remove the SCSI backplane: 1. Extend the server from the rack. Refer to “Extending the Server from the Rack” earlier in this chapter. 2. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 3. Unseat all the hot-plug SCSI hard drives and pull them away from the SCSI backplane. Refer to “Hot-Plug SCSI Hard Drives” earlier in this chapter.
Removal and Replacement Procedures System Battery WARNING: This server contains an internal lithium manganese dioxide or vanadium pentoxide battery. A risk of fire and burns exists if the battery is not handled properly. To reduce the risk of personal injury: • Do not attempt to recharge the battery. • Do not expose to temperatures higher than 60°C (140°F). • Do not disassemble, crush, puncture, short external contacts, or dispose of in fire or water.
Removal and Replacement Procedures 5. Remove the existing battery. Figure 2-35: Removing the battery from the system board 6. Install the new battery. 7. Install the access panel. 8. Restore the server to its operating position in the rack. 9. Run the RBSU to reconfigure the system if the settings were lost.
Removal and Replacement Procedures System Board To remove the system board: 1. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 2. Remove all power supplies. Refer to “Hot-Plug Power Supply” earlier in this chapter. 3. Remove all the hot-plug SCSI hard drives. Refer to “Hot-Plug SCSI Hard Drives” earlier in this chapter. NOTE: Label the individual hard drives before pulling them out. 4. Extend the server from the rack.
Removal and Replacement Procedures 14. Remove the lower foam air baffle. 15. Unscrew the system board thumbscrews. It may be necessary to use the Torx tool located on the back of the server. 16. Slide the system board toward the front of the unit. Figure 2-36: Unscrewing the thumbscrews and sliding the system board IMPORTANT: When handling the system board, place your hands only on the two handles.
Removal and Replacement Procedures 17. Grasping the two handles, lift the side with the processor memory board slots first, tilting the board to a 45° angle (1). 18. Lift the entire system board out of the chassis (2). 19. Check the switch settings on the old board against the settings in chapter 4 to be sure that they are in the correct positions. Figure 2-37: Removing the system board from the chassis Reverse the steps to replace the system board.
Removal and Replacement Procedures AC Filter Cable Assembly The AC filter cable assembly consists of two cables leading from separate power supply input connectors near the front of the chassis and ending in an AC filter and inlets at the rear of the chassis. To remove the AC filter cable: 1. Power down the server. Refer to “Powering Down the Server” earlier in this chapter. 2. Remove all power supplies. Refer to “Hot-Plug Power Supply” earlier in this chapter. 3. Remove all the hot-plug SCSI hard drives.
Removal and Replacement Procedures 15. Cut the four tie-wraps securing the AC filter cable to the chassis. Figure 2-38: Cutting the cable tie-wraps 16. Remove the seven cable clips: a. Lift to disengage the cable clip (1). b. Slide the clip forward, and lift it off the chassis (2).
Removal and Replacement Procedures 17. Use the Torx T-15 tool or screwdriver to remove the eight screws that secure the power supply connectors to the chassis bottom (1). 18. Remove the cords from the connectors (2). 19. Remove the connectors from the server (3). Figure 2-40: Removing the power supply connectors 20. Use the Torx T-15 tool or a screwdriver to remove the four screws that secure the AC filter cable assembly to the rear chassis wall (1). 21.
Removal and Replacement Procedures Re-entering the Server Serial Number After replacing the system board or clearing the NVRAM, the server serial number must be re-entered. 1. To access RBSU, press the F9 key when prompted during POST. 2. Select the System Options menu. 3. Select Serial Number. The following warning is displayed: WARNING! WARNING! WARNING! The serial number is loaded into the system during the manufacturing process and should NOT be modified.
3 Diagnostic Tools Table 3-1: Diagnostic Tools Tool Description How to run the tool Array Diagnostics Utility (ADU) ADU is designed to run on all ProLiant systems that support HP array controllers. ADU collects information about the array controllers in the system and generates a list of detected problems. For a list of HP servers that support ADU, follow the support link on product website at www.hp.com. ASR-2 automatically restarts the server after a catastrophic operating system failure.
Diagnostic Tools Table 3-1: Diagnostic Tools Tool Description How to run the tool Integrated Lights-Out (iLO) iLO is a remote management port that can be accessed through a dedicated Lights-Out port on the rear panel of the server. Accessed through a standard browser from anywhere, the iLO capabilities will not require any unique software applications to use. For more information on iLO, refer to the Integrated Lights-Out User Guide on the Documentation CD.
4 Server Component Identification This chapter explains the location and function of system connectors, internal and external LEDs, and system switches. Connectors Use this section to identify system connectors for service procedures.
Server Component Identification Rear Panel Figure 4-1: Rear panel components Table 4-1: Rear Panel Components Item 4-2 Description 1 Integrated Lights-Out Manager connector 2 USB connector 1 3 USB connector 2 4 Rear Unit Identification button and LED 5 Keyboard connector 6 Mouse connector 7 Video connector 8 Serial connector 9 NIC 2 10 NIC 1 11 AC inlet 1(primary) 12 AC inlet 2 (optional) HP ProLiant DL585 Server Maintenance and Service Guide
Server Component Identification SCSI Backplane Board Figure 4-2: SCSI backplane board connectors Table 4-2: SCSI Backplane Board Connectors Item Description 1 SCSI channel A 2 SCSI simplex/duplex switch (default = duplex) 3 SCSI channel B HP ProLiant DL585 Server Maintenance and Service Guide 4-3
Server Component Identification Processor Memory Board DIMM Slots Figure 4-3: DIMM slots Table 4-3: Processor Memory Board Memory Banks 4-4 Slot Bank 1-2 Bank 1 3-4 Bank 2 5-6 Bank 3 7-8 Bank 4 HP ProLiant DL585 Server Maintenance and Service Guide
Server Component Identification Processor and Processor Power Module Figure 4-4: Processor and processor power module Table 4-4: Processor and Processor Power Module Item Description 1 Processor Power Module 2 Processor and heatsink HP ProLiant DL585 Server Maintenance and Service Guide 4-5
Server Component Identification LEDs The server contains several sets of LEDs that indicate the status and settings of hardware components.
Server Component Identification QuickFind Diagnostic Display Figure 4-6: QuickFind Diagnostic Display LEDs Table 4-6: QuickFind Diagnostic Display LEDs Item Description Status Action 1 Fan Off = normal Be sure fan is installed and seated properly. On = attention required If fan is installed and seated properly, replace the fan. Off = normal Processor pre-failure notification. Inspect the IML logs and/or POST messages. Processor may need to be replaced.
Server Component Identification Table 4-6: QuickFind Diagnostic Display LEDs continued Item Description Status Action 5 Temperature Off = normal One or more components have experienced an elevated temperature or a thermal shutdown. On = attention required A thermal shutdown is indicated by the Therm Trip LED. • Be sure the processor heatsink is properly attached. • Be sure there are no 1.7-inch DIMMs installed that prevent airflow across the processor heatsink).
Server Component Identification 12 Bus error Off = normal A bus error occurred. On = attention required The error may have been caused by one of the adapter cards on this bus (see amber LEDs for which slots may have caused the error). If problem persists, remove/replace one or both of the cards.
Server Component Identification Table 4-8: Hot-Plug SCSI Hard Drive LED Combinations 4-10 Activity LED Online LED Fault LED Status On Off Off Do not remove the drive. Removing a drive during this process will cause data loss. The drive is being accessed and is not configured as part of an array. On Flashing Off Do not remove the drive. Removing a drive during this process will cause data loss. The drive is rebuilding or undergoing capacity expansion.
Server Component Identification Hot-Plug Fan Figure 4-8: Hot-plug fan LED The hot-plug fan LED indicates the following conditions: Table 4-9: Hot-Plug Fan LED LED Status Description Off Power is not applied to the fan Green Power is applied to the fan and the fan is functional Amber Fan failure HP ProLiant DL585 Server Maintenance and Service Guide 4-11
Server Component Identification Hot-Plug Power Supplies Figure 4-9: Hot-plug power supply LEDs Table 4-10: Hot-Plug Power Supply LEDs (1) Power LED Status (2) Fault LED Status Description Green Amber Off Off No AC power Off On No AC power to power supply -OrPower supply failure Blinking Off AC power present System in standby mode 4-12 On Off Power supply on and working properly On Blinking Power supply current limit exceeded HP ProLiant DL585 Server Maintenance and Service Guide
Server Component Identification NIC LEDs Figure 4-10: NIC LEDs Table 4-11: NIC LEDs Item Description LED Color Status 1 Activity LED Green On or flashing = network activity Off = no network activity 2 Link LED Green On = linked to the network Off = not linked to the network HP ProLiant DL585 Server Maintenance and Service Guide 4-13
Server Component Identification Battery-Backed Write Cache Enabler Figure 4-11: Battery-Backed Write Cache Enabler LEDs NOTE: The battery will take 24 hours to charge for the first time. Table 4-12: Battery-Backed Write Cache Enabler LEDs Server Status LED Color LED Status Battery Module Status Server is on and has normal run time.
Server Component Identification Internal Diagnostic Display Figure 4-12: Internal diagnostic display LEDs and switches Table 4-13: Internal diagnostic display switches Item Switch 1 Switch 2 Description Port 84 OFF OFF For more information on Port 84, refer to the Port 84 diagnostic codes that occur during the boot process. Reserved OFF ON N/A iLO ON OFF For more information on iLO, refer to the Integrated LightsOut User Guide.
Server Component Identification System Board Switches Some server operations, including adding or removing a component or changing a security feature, require that you reconfigure a system switch. If the system configuration is incorrect, the server may not work properly and you may receive error messages on the screen. Setting and checking the system board switches is an important part of the overall troubleshooting process.
Server Component Identification System Maintenance Switch (SW3) Figure 4-13: System maintenance switch (SW3) Table 4-14: System Maintenance Switch (SW3) Position Default 1 2 Function Description Settings Enables or disables configuration changes in nonvolatile RAM (NVRAM) Open (off) Reserved Off Lock configuration Configuration can be changed. Closed (on) Configuration is locked and cannot be changed.
Server Component Identification Table 4-14: System Maintenance Switch (SW3) continued Position Default Function Description Settings 5 Off Password disable Enables or disables password protection. Open (off) Power-on password is enabled. Closed (on) Power-on password is disabled. 6 Off Configuration validation Invalidates nonvolatile RAM (NVRAM) configuration information Open (off) NVRAM is valid. Closed (on) NVRAM is invalid, and configuration is lost.
Server Component Identification System ID switch (SW4) Figure 4-14: System ID switch (SW4) Table 4-15: System ID Switch (SW4) Position Description 1 Reserved 2 Reserved 3 Reserved HP ProLiant DL585 Server Maintenance and Service Guide 4-19
Server Component Identification iLO/Redundant ROM Override Switch (SW5) The iLO/ Redundant ROM Override switch (SW5) is a two-position switch that is used for miscellaneous purposes. The iLO security override switch allows the administrator full access to the iLO processor. This access is necessary if the administrator lost their password or if the iLO boot-block needs to be flashed.
Server Component Identification Non-Maskable Interrupt (NMI) Switch Crash dump analysis is an essential part of eliminating reliability problems such as hangs or crashes in operating systems, device drivers, and applications. Crashes can freeze a system, requiring you to do a hard reset. Resetting the system erases any information that supports root cause analysis. Systems running supported Microsoft operating systems experience a blue screen trap when the operating system crashes.
Server Component Identification Rear Unit Identification LED Switch Figure 4-17: Rear Unit Identification LED switch Table 4-17: Rear Unit ID Switch (1) 4-22 LED Status Description On (blue) The switch is activated Off The switch is deactivated Flashing The system is being remotely accessed HP ProLiant DL585 Server Maintenance and Service Guide
5 Troubleshooting This chapter provides specific troubleshooting information for the server. Use it to find details about server startup and operation errors. For information on LEDs and switches specific to the server, refer to Chapter 4, “Server Component Identification.” For a more detailed discussion of troubleshooting techniques, diagnostic tools, error messages, and preventative maintenance, refer to the HP Servers Troubleshooting Guide included on the Documentation CD that ships with the server.
Troubleshooting If the Server Does Not Start This section provides systematic instructions on what to try and where to go for help for the most common problems encountered during initial POST. The server must first complete this test each time you power up, before it can load the operating system and start running software applications. WARNING: A risk of personal injury exists from hazardous energy levels.
Troubleshooting c. Copyright information d. Processor initialization e. Multi-initiator configuration IMPORTANT: Select the multi-initiator configuration utility (CRTL-A) only to support HP storage and clustering options. f. PXE initialization g. Option ROMs h. SCSI devices 10. The operating system loads to complete the boot process. If the server completes POST and attempts to load the operating system, go to the “Problems After Initial Boot” section in this chapter.
Troubleshooting Diagnostic Steps If the server does not power up, or powers up but does not complete POST, answer the questions in Table 5-1 to determine appropriate actions based on the symptoms observed. The flow of questions reflects the usual flow of events during a power-on sequence. A flow chart following this table illustrates the recommended diagnostic steps and decision options. According to the answers you give, you will be directed to an appropriate secondary table in this section.
Troubleshooting Table 5-2: Is the System Power LED Off? continued Answer Possible Reasons Possible Solutions No If the system power button LED is amber, press the Power On/Standby button. Refer to Table 5-3. Note: For LED locations and functions, refer to Chapter 4, “Server Component Identification.” Table 5-3: Is the System Power LED Green? Answer Possible Reasons Possible Solutions No, it is off Power On/Standby button has not been pressed firmly. Firmly press the power button.
Troubleshooting Table 5-5: Is the Internal Health LED Green? Answer Possible Reasons Possible Solutions For component failure identification (amber LED), check the hood label. No, it is amber A processor is in pre-failure condition. Use internal component failure LEDs to identify: A DIMM is in pre-failure condition. • Missing components One memory bank is valid, but another bank is missing a DIMM.
Troubleshooting Table 5-5: Is the Internal Health LED Green? continued Answer Possible Reasons Possible Solutions Interlock problem Reseat all boards and cables. Processor memory board configuration error Inspect the diagnostic LED panel to determine if one of the processor memory boards has both its PPM and BRD LEDs illuminated. Ensure that processor and PPM are properly installed on the processor memory board.
Troubleshooting Table 5-6: Is the Monitor Displaying Information? Answer Possible Reasons Possible Solutions No The monitor may not have power. Be sure that the monitor AC power cord is plugged in and that the monitor power button has been pressed. Video may not be connected properly. Non-volatile RAM (NVRAM) may be corrupted. The system ROM and redundant ROM may be corrupted. A processor memory board or HyperTransport link may have failed.
Troubleshooting Problems After Initial Boot Once the server has passed POST, you may still encounter errors, such as an inability to load the operating system. Use Table 5-7 to troubleshoot server installation problems that occur after the initial boot. Refer to the HP Servers Troubleshooting Guide for more information. Table 5-7: Problems After Initial Boot Problem Possible Reasons Possible Solutions System cannot load ProLiant Essentials Foundation Pack.
Troubleshooting Table 5-7: Problems After Initial Boot continued Problem Possible Reasons Possible Solutions Server cannot load operating system. Required operating system step was missed. Follow these steps: Installation problem occurred. 1. Note at which phase the operating system failed. 2. Remove any loaded operating system. 3. Refer to the operating system documentation. 4. Install the operating system again.
Troubleshooting Troubleshooting a Failed Processor The AMD HyperTransport link architecture links multiple processors. When a processor failure occurs, the root cause of the problem can be either a failed HyperTransport link or a failed processor. A failed processor or failed HyperTransport link causes the system to hang at POST without video. Perform the steps outlined in the following sections to isolate a failed component in a 2 or 4-processor memory board configuration.
Troubleshooting Four-Processor Memory Board Configuration In a 4P configuration, all processor memory board slots are populated. 1. If the system fails to boot, remove the processor memory boards in slot 3 and slot 4 to attempt booting with 2P. 2. If this works, one or both of the processor memory boards removed from slots 3 and 4 could be bad. 3. Go through the steps in the 2P description above to isolate the failed components. 4.
Troubleshooting Other Information Resources Refer to the following additional information for help. Table 5-8: Troubleshooting Resources Resource Description HP Servers Troubleshooting Guide This is a resource for obtaining troubleshooting information that is beyond the scope of this document. It includes general hardware and software troubleshooting information for all ProLiant servers, a complete list of error messages along with explanations of probable causes, and a list of remedial measures.
6 Specifications Table 6-1: Server Specifications Feature Units Dimensions Height 17.5 cm (6.88 in) Depth 69.2 cm (27.25 in) Width 46.3 cm (19.0 in) Weight 30.8 kg, minimum (68 lb, minimum) 44.
Index A D AC filter cable removing 2-45 replacing 2-45 AC power supply See power supply access panel opening 2-8 removing 2-8 ADU (Array Diagnostic Utility), described 3-1 air baffle 2-12, 2-15 airflow, caution 2-22 Array Diagnostic Utility See ADU Automatic Server Recovery-2 (ASR-2), described 3-1 data loss caution 5-9 diagnostic accessing 3-1 described 3-1 locating 3-1 Diagnostic Display 4-7 diagnostic tools, ROM Based Setup Utility 3-2 Diagnostics, described 3-1 diskette drive, removing 2-24 document
Index G grounding x grounding plug ix grounding, procedures 2-1 guidelines expansion board installation 2-29 occupational safety and health 2-9 guides, Factory-Installed Operating System Software Installation Guide 5-10 H hard drive blanks, removing 2-18 hard drives, hot-plug LEDs, location 4-9 LEDs, status 4-9 help resources xi hot-plug fans installing 2-23 removing 2-23 hot-plug hard drive blanks See hard drive blanks hot-plug hard drive LEDs drive activity 4-9 fault status 4-9 online status 4-9 hot-plu
Index performance balancing 2-30 slot population 2-30 POST error messages 5-8 power button/LED assembly, removing and replacing 2-28 power supply blank, removing and replacing 2-21 power supply, removing and replacing 2-22 powering up, normal sequence 5-2 preparation procedures overview 2-3 removing server from the rack 2-9 processor memory boards air baffle 2-12, 2-15 cautions 2-10 installing 2-12, 2-14 locating 2-11 processor power modules (PPMs), when to replace 5-8 processors See processor memory board
Index tools required for servicing 2-1 Torx T-15, locating and removing 2-5 Torx T-15 tool locating 2-5 removing 2-5 troubleshooting BIOS problems 5-9 CD-ROM problems 5-9 CMOS problems 5-9 configure-to-order system problems 5-10 factory-installed software problems 5-10 how to diagnose problems 5-4, 5-10 installation problems 5-9, 5-10 memory problems 5-9 new hardware added 5-10 NVRAM problems 5-8 problems after initial boot 5-9 ProLiant Essentials problems 5-9, 5-10 resources 5-13 ROM problems 5-9 startup