SPARC Enterprise T5440 Server Service Manual Manual Code C120-E512-03EN, Part No.
Copyright © 2008, 2011 , Oracle and/or its affiliates. All rights reserved. FUJITSU LIMITED provided technical input and review on portions of this material. Oracle and/or its affiliates and Fujitsu Limited each own or control intellectual property rights relating to products and technology described in this document, and such products, technology and this document are protected by copyright laws, patents, and other intellectual property laws and international treaties.
Copyright © 2008, 2011, Oracle et/ou ses sociétés affiliées. Tous droits réservés. FUJITSU LIMITED a fourni et vérifié des données techniques de certaines parties de ce composant. Oracle et/ou ses sociétés affiliées et Fujitsu Limited détiennent et contrôlent chacune des droits de propriété intellectuelle relatifs aux produits et technologies décrits dans ce document.
Contents Preface xiii Identifying Server Components 1 Infrastructure Boards and Cables Front Panel Diagram Front Panel LEDs 6 8 Ethernet Port LEDs Managing Faults 3 5 Rear Panel Diagram Rear Panel LEDs 1 9 11 Understanding Fault Handling Options Server Diagnostics Overview Diagnostic Flowchart 11 12 13 Options for Accessing the Service Processor ILOM Overview 17 18 ALOM CMT Compatibility Shell Overview Predictive Self-Healing Overview Oracle VTS Overview 20 20 21 POST Fault Managem
▼ Switch From the System Console to the Service Processor (ILOM or ALOM CMT Compatibility Shell) 26 ▼ Switch From ILOM to the System Console ▼ Switch From the ALOM CMT Compatibility Shell to the System Console 26 Displaying FRU Information With ILOM 26 27 ▼ Display System Components (ILOM show components Command) 27 ▼ Display Individual Component Information (ILOM show Command) 28 Controlling How POST Runs POST Parameters 29 30 ▼ Change POST Parameters ▼ Run POST in Maximum Mode Detectin
Identifying Faults Detected by PSH ▼ 48 Detect Faults Identified by the Oracle Solaris PSH Facility (ILOM fmdump Command) 49 Clearing Faults 52 ▼ Clear Faults Detected During POST ▼ Clear Faults Detected by PSH ▼ Clear Faults Detected in the External I/O Expansion Unit Disabling Faulty Components 52 54 55 55 Disabling Faulty Components Using Automatic System Recovery ▼ Disable System Components ▼ Re-Enable System Components 57 57 ILOM-to-ALOM CMT Command Reference Preparing to Service th
Extending the Server to the Maintenance Position Components Serviced in the Maintenance Position 69 ▼ 70 Extend the Server to the Maintenance Position ▼ Remove the Server From the Rack ▼ Perform Electrostatic Discharge – Antistatic Prevention Measures ▼ Remove the Top Cover 71 73 Servicing Customer-Replaceable Units 75 Hot-Pluggable and Hot-Swappable Devices Servicing Hard Drives 76 ▼ Remove a Hard Drive (Hot-Plug) ▼ Install a Hard Drive (Hot-Plug) ▼ Remove a Hard Drive ▼ Install a
▼ Remove a Power Supply ▼ Install a Power Supply 92 93 Power Supply Device Identifiers Power Supply LED Servicing PCIe Cards 94 95 96 ▼ Remove a PCIe Card ▼ Install a PCIe Card ▼ Add a PCIe Card 96 97 98 PCIe Device Identifiers 99 PCIe Slot Configuration Guidelines Servicing CMP/Memory Modules 100 102 CMP/Memory Modules Overview 102 ▼ Remove a CMP/Memory Module ▼ Install a CMP/Memory Module ▼ Add a CMP/Memory Module 104 105 105 CMP and Memory Module Device Identifiers 107 Su
Servicing the Front Bezel 119 ▼ Remove the Front Bezel ▼ Install the Front Bezel 120 121 Servicing the DVD-ROM Drive 122 ▼ Remove the DVD-ROM Drive ▼ Install the DVD-ROM Drive Servicing the Service Processor Remove the Service Processor ▼ Install the Service Processor 124 126 127 ▼ Remove the IDPROM ▼ Install the IDPROM Servicing the Battery 123 124 ▼ Servicing the IDPROM 122 127 128 129 ▼ Remove the Battery ▼ Install the Battery 129 130 Servicing the Power Distribution B
Servicing the Flex Cable Assembly 144 ▼ Remove the Flex Cable Assembly ▼ Install the Flex Cable Assembly Servicing the Front Control Panel 146 148 ▼ Remove the Front Control Panel ▼ Install the Front Control Panel Servicing the Front I/O Board 145 148 149 150 ▼ Remove the Front I/O Board ▼ Install the Front I/O Board Returning the Server to Operation 150 151 153 ▼ Install the Top Cover ▼ Install the Server Into the Rack ▼ Slide the Server Into the Rack ▼ Connect the Power Cord
I/O Fabric in 4P Configuration Identifying Connector Pinouts 169 171 Serial Management Port Connector Pinouts 172 Network Management Port Connector Pinouts Serial Port Connector Pinouts USB Connector Pinouts 174 175 Gigabit Ethernet Connector Pinouts Server Components 177 Customer-Replaceable Units Field-Replaceable Units Index xii 178 180 183 SPARC Enterprise T5440 Server Service Manual • June 2011 176 173
Preface This manual provides detailed procedures that describe the removal and replacement of replaceable parts in the SPARC Enterprise T5440 Server. This manual also includes information about the use and maintenance of the server. This document is written for technicians, system administrators, authorized service providers (ASPs), and users who have advanced experience troubleshooting and replacing hardware.
■ “Identifying Server Components” on page 1 Provides an overview of the server, including major boards and components, as well as front and rear panel features. ■ “Managing Faults” on page 11 Describes the diagnostics that are available for monitoring and troubleshooting the server. ■ “Preparing to Service the System” on page 63 Describes the steps necessary to prepare the server for service.
(http://primeserver.fujitsu.
Title Description Manual Code Integrated Lights Out Manager 3.0 SNMP and IPMI Procedure Guide Information and procedures for accessing ILOM 3.0 functions using SNMP or IPMI management hosts C120-E579 Integrated Lights Out Manager 3.x Feature Updates and Release Notes Enhancements that have been made to ILOM firmware since the ILOM 3.0 release C120-E600 Integrated Lights Out Manager 3.0 Supplement for SPARC Enterprise T5440 Server How to use the ILOM 3.
Text Conventions Typeface* Meaning Examples AaBbCc123 The names of commands, files, and directories; on-screen computer output Edit your .login file. Use ls -a to list all files. % You have mail. AaBbCc123 What you type, when contrasted with on-screen computer output % su Password: AaBbCc123 Book titles, new words or terms, words to be emphasized. Replace command-line variables with real names or values. Read Chapter 6 in the User’s Guide. These are called class options.
Conventions for Alert Messages This manual uses the following conventions to show alert messages, which are intended to prevent injury to the user or bystanders as well as property damage, and important messages that are useful to the user. Caution – This indicates a hazardous situation that could result in death or serious personal injury (potential hazard) if the user does not perform the procedure correctly.
Also, important alert messages are shown in “Important Alert Messages” on page xix. Notes on Safety Important Alert Messages This manual provides the following important alert signals: Caution – This indicates a hazardous situation could result in minor or moderate personal injury if the user does not perform the procedure correctly. This signal also indicates that damage to the product or other property may occur if the user does not perform the procedure correctly.
Task Warning Maintenance Electric shock Never attempt to run the server with the covers removed. Hazardous voltage present. Because 3.3v standby power is always present in the system, you must unplug the power cords before accessing any cold-serviceable components. Caution – This indicates that surfaces are hot and might cause personal injury if touched. Avoid contact. Task Warning Maintenance Extremely hot FB-DIMMs may be hot. Use caution when servicing FB-DIMMs.
Caution – The following tasks regarding this product and the optional products provided from Fujitsu should only be performed by a certified service engineer. Users must not perform these tasks. Incorrect operation of these tasks may cause malfunction. ■ Unpacking optional adapters and such packages delivered to the users ■ Plugging or unplugging of external interface cards Remodeling/Rebuilding Caution – Do not make mechanical or electrical modifications to the equipment.
Documentation Feedback If you have any comments or requests regarding this document, or if you find any unclear statements in the document, please state your points specifically on the form at the following URL. (http://www.fujitsu.com/global/contact/computing/sparce_index.
Identifying Server Components These topics provide an overview of the server, including major boards and components, as well as front and rear panel features. Description Links Review the infrastructure boards and cables in the server. “Infrastructure Boards and Cables” on page 1 Review the front panel features. “Front Panel Diagram” on page 3 “Front Panel LEDs” on page 5 Review the rear panel features.
■ Memory module – A memory module containing slots for an additional 12 FB-DIMMs is associated with each CMP module. ■ Service processor – The service processor (ILOM) board controls the server power and monitors server power and environmental events. The service processor draws power from the server’s 3.3V standby supply rail, which is available whenever the system is receiving main input power, even when the system is turned off.
Front Panel Diagram The server front panel contains a recessed system power button, system status and fault LEDs, Locator button and LED. The front panel also provides access to internal hard drives, the DVD-ROM drive (if equipped), and the two front USB ports. The following illustration shows front panel features on the server front panel. For a detailed description of front panel controls and LEDs, see “Front Panel LEDs” on page 5.
4 ■ “Rear Panel Diagram” on page 6 ■ “Servicing the Front Bezel” on page 119 SPARC Enterprise T5440 Server Service Manual • June 2011
Front Panel LEDs LED or Button Icon Description Locator LED and button (white) The Locator LED enables you to find a particular system. The LED is activated using one of the following methods: • The ALOM CMT command setlocator on. • The ILOM command set /SYS/LOCATE value=Fast_Blink • Manually press the Locator button to toggle the Locator LED on or off. This LED provides the following indications: • Off – Normal operating state.
LED or Button Icon Description Fan Fault LED (amber) TOP FAN Provides the following operational fan indications: • Off – Indicates a steady state, no service action is required. • Steady on – Indicates that a fan failure event has been acknowledged and a service action is required on at least one of the fan modules. Power Supply Fault LED (amber) REAR PS Provides the following operational PSU indications: • Off – Indicates a steady state, no service action is required.
FIGURE: Rear Panel Features Figure Legend 1 Power supplies 2 Serial port 3 Serial management port 4 System status LEDs 5 USB ports 6 Network management port 7 Gigabit ethernet ports Related Information ■ “Front Panel Diagram” on page 3 ■ “Rear Panel LEDs” on page 8 ■ “Ethernet Port LEDs” on page 9 ■ “Detecting Faults Using LEDs” on page 34 Identifying Server Components 7
Rear Panel LEDs LED Icon Description Locator LED and button (white) The Locator LED enables you to find a particular system. The LED is activated using one of the following methods: • The ALOM CMT command setlocator on. • The ILOM command set /SYS/LOCATE value=Fast_Blink • Manually press the Locator button to toggle the Locator LED on or off. This LED provides the following indications: • Off – Normal operating state.
Ethernet Port LEDs The service processor network management port and the four 10/100/1000 Mbps Ethernet ports each have two LEDs. LED Color Description Left LED Amber or green Speed indicator: • Amber on – The link is operating as a Gigabit connection (1000-Mbps).* • Green on – The link is operating as a 100-Mbps connection. • Off – The link is operating as a 10-Mbps connection. Right LED Green Link/Activity indicator: • Steady on – A link is established. • Blinking – There is activity on this port.
10 SPARC Enterprise T5440 Server Service Manual • June 2011
Managing Faults These topics describe the diagnostics tools that are available for monitoring and troubleshooting the server. These topics are intended for technicians, service personnel, and system administrators who service and repair computer systems.
■ “Memory Fault Handling Overview” on page 24 Server Diagnostics Overview You can use a variety of diagnostic tools, commands, and indicators to monitor and troubleshoot a server: ■ LEDs – Provide a quick visual notification of the status of the server and of some of the FRUs. See “Detecting Faults Using LEDs” on page 34. ■ ILOM firmware – This system firmware runs on the service processor.
■ Verify that the server was installed properly. ■ Visually inspect cables and power. ■ (Optional) Perform a reset of the server. Related Information ■ “Diagnostic Flowchart” on page 13 ■ SPARC Enterprise T5440 Server Installation and Setup Guide ■ SPARC Enterprise T5440 Server Administration Guide Diagnostic Flowchart The following diagnostics are available to troubleshoot faulty hardware. See “Change POST Parameters” on page 31 for more information about each diagnostic in this chapter.
FIGURE: 14 Diagnostic Flowchart SPARC Enterprise T5440 Server Service Manual • June 2011
TABLE: Action No. Diagnostic Flowchart Actions Diagnostic Action Resulting Action For more information “Detecting Faults” on page 34 1. The Power OK LED is located on the front and rear Check Power OK of the chassis. and AC Present LEDs on the server. The AC Present LED is located on the rear of the server on each power supply. If these LEDs are not on, check the power source and power connections to the server. 2. Run the ILOM show faulty command to check for faults.
TABLE: Action No. 6. 7. Diagnostic Flowchart Actions (Continued) Diagnostic Action Resulting Action For more information Determine if the fault is an environmental or configuration fault. Determine if the fault is an environmental fault or a configuration fault. If the fault listed by the show faulty command displays a temperature or voltage fault, then the fault is an environmental fault.
TABLE: Action No. 8. 9. 10. Diagnostic Flowchart Actions (Continued) Diagnostic Action Resulting Action For more information Determine if the fault was detected by PSH. If the fault displayed included a uuid and sunw-msg-id property, the fault was detected by the Predictive Self-Healing software. If the fault is a PSH-detected fault, refer to the PSH Knowledge Article web site for additional information. The Knowledge Article for the fault is located at the following link: (http://www.sun.
The code examples in this document depict use of the ILOM shell. Note – Multiple service processor accounts can be active concurrently. A user can be logged in under one account using the ILOM shell, and another account using the ALOM CMT shell. Related Information ■ “Diagnostic Flowchart” on page 13 ■ SPARC Enterprise T5440 Server Installation and Setup Guide ■ SPARC Enterprise T5440 Server Administration Guide ■ Integrated Lights Out Manager 3.
FIGURE: ILOM Fault Management The service processor can detect when a fault is no longer present and clears the fault in several ways: ■ Fault recovery – The system automatically detects that the fault condition is no longer present. The service processor extinguishes the Service Required LED and updates the FRU’s PROM, indicating that the fault is no longer present. ■ Fault repair – The fault has been repaired by human intervention.
Related Information ■ “Diagnostic Flowchart” on page 13 ■ “Detecting Faults Using LEDs” on page 34 ■ “Detecting Faults (Oracle Solaris OS Files and Commands)” on page 39 ■ SPARC Enterprise T5440 Server Installation and Setup Guide ■ SPARC Enterprise T5440 Server Administration Guide ■ Integrated Lights Out Manager 3.0 Supplement for the SPARC Enterprise T5440 Server ALOM CMT Compatibility Shell Overview The default shell for the service processor is the ILOM shell.
The Oracle Solaris OS uses the Fault Manager daemon, fmd (1M), which starts at boot time and runs in the background to monitor the system. If a component generates an error, the daemon handles the error by correlating the error with data from previous errors and other related information to diagnose the problem. Once diagnosed, the Fault Manager daemon assigns the problem a Universal Unique Identifier (UUID) that distinguishes the problem across any set of systems.
■ “Oracle VTS Software Packages” on page 45 ■ “Useful Oracle VTS Tests” on page 46 ■ SPARC Enterprise T5440 Server Administration Guide POST Fault Management Overview Power-on self-test (POST) is a group of PROM-based tests that run when the server is powered on or reset. POST checks the basic integrity of the critical hardware components in the server (CMP, memory, and I/O subsystem). POST tests critical hardware components to verify functionality before the system boots and accesses software.
POST Fault Management Flowchart FIGURE: Flowchart of Variables for POST Configuration Related Information ■ “Diagnostic Flowchart” on page 13 Managing Faults 23
■ “Detecting Faults Using POST” on page 46 ■ SPARC Enterprise T5440 Server Installation and Setup Guide ■ SPARC Enterprise T5440 Server Administration Guide Memory Fault Handling Overview A variety of features plays a role in how the memory subsystem is configured and how memory faults are handled. Understanding the underlying features helps you identify and repair memory problems. This section describes how the server deals with memory faults.
Note – You can use the FB-DIMM DIAG buttons on the CMP module and memory module to identify faulty FB-DIMMs. See “FB-DIMM Fault Button Locations” on page 117. Once you identify which FB-DIMMs you want to replace, see “Servicing FB-DIMMs” on page 108 for FB-DIMM removal and replacement instructions. You must perform the instructions in that section to clear the faults and enable the replaced FB-DIMMs.
■ “Switch From the System Console to the Service Processor (ILOM or ALOM CMT Compatibility Shell)” on page 26 ■ “Switch From ILOM to the System Console” on page 26 ■ “Switch From the ALOM CMT Compatibility Shell to the System Console” on page 26 ■ SPARC Enterprise T5440 Server Installation and Setup Guide ■ SPARC Enterprise T5440 Server Administration Guide ▼ Switch From the System Console to the Service Processor (ILOM or ALOM CMT Compatibility Shell) ● To switch from the system console to the s
Displaying FRU Information With ILOM ■ “Display System Components (ILOM show components Command)” on page 27 ■ “Display Individual Component Information (ILOM show Command)” on page 28 ▼ Display System Components (ILOM show components Command) The show components command displays the system components (asrkeys) and reports their status. ● At the -> prompt, type the show components command. The examples below show two possibilities.
EXAMPLE: Output of the show components Command With No Disabled Components -> show components Target | Property | Value --------------------+------------------------+------------------------------/SYS/MB/PCIE0 | component_state | Enabled /SYS/MB/PCIE3/ | component_state | Enabled /SYS/MB/PCIE1/ | component_state | Enabled /SYS/MB/PCIE4/ | component_state | Enabled /SYS/MB/PCIE2/ | component_state | Enabled /SYS/MB/PCIE5/ | component_state | Enabled /SYS/MB/NET0 | component_state | Enabled /SYS/MB/NET1 | c
EXAMPLE: show Command Output -> show /SYS/MB/CPU0/CMP0/BR1/CH0/D0 /SYS/MB/CPU0/CMP0/BR1/CH0/D0 Targets: R0 R1 SEEPROM SERVICE PRSNT T_AMB Properties: type = DIMM component_state = Enabled fru_name = 1024MB DDR2 SDRAM FB-DIMM 333 (PC2 5300) fru_description = FBDIMM 1024 Mbyte fru_manufacturer = Micron Technology fru_version = FFFFFF fru_part_number = 18HF12872FD667D6D4 fru_serial_number = d81813ce fault_state = OK clear_fault_action = (none) Commands: cd show Controlling How POST Runs This topic contains
POST Parameters The server can be configured for normal, extensive, or no POST execution. You can also control the level of tests that run, the amount of POST output that is displayed, and which reset events trigger POST by using ILOM command variables. The keyswitch_state parameter, when set to diag, overrides all the other ILOM POST variables. The following table lists the ILOM variables used to configure POST. “POST Fault Management Flowchart” on page 23 shows how the variables work together.
Parameter Values Description normal POST output displays all test and informational messages. max POST displays all test, informational, and some debugging messages.
▼ Run POST in Maximum Mode This procedure describes how to run POST when you want maximum testing, as in the case when you are troubleshooting a server, or verifying a hardware upgrade or repair. 1. Access the ILOM prompt. See “Connecting to the Service Processor” on page 25. 2. Set the virtual keyswitch to diag so that POST will run in service mode. -> set /SYS/keyswitch_state=Diag Set ‘keyswitch_state’ to ‘Diag’ 3. Reset the system so that POST runs. There are several ways to initiate a reset.
EXAMPLE: show Command Output -> stop /SYS Are you sure you want to stop /SYS (y/n)? y Stopping /SYS -> start /SYS Are you sure you want to start /SYS (y/n)? y Starting /SYS EXAMPLE: show Command Output -> start /SP/console ... 2007-12-19 22:01:17.810 0:0:0>INFO: STATUS: Running RGMII 1G BCM5466R PHY level Loopback Test 2007-12-19 22:01:22.534 0:0:0>End : Neptune 1G Loopback Test Port 2 2007-12-19 22:01:22.553 0:0:0> 2007-12-19 22:01:22.
Detecting Faults This section describes the different methods you can use to identify system faults in the server. Task Topic Use front panel and back panel LEDs to identify system faults. “Detecting Faults Using LEDs” on page 34 Use the ILOM show faulty command to detect faults. “Detecting Faults (ILOM show faulty Command)” on page 36 Use Oracle Solaris OS files and commands to detect faults.
The following table describes which fault LEDs are lit under given error conditions. Use the ILOM show faulty command to obtain more information about the nature of a given fault. See “Detect Faults (ILOM show faulty Command)” on page 37.
Related Information ■ “Diagnostic Flowchart” on page 13 ■ “Detecting Faults Using LEDs” on page 34 ■ “ILOM-to-ALOM CMT Command Reference” on page 58 ■ SPARC Enterprise T5440 Server Installation and Setup Guide ■ SPARC Enterprise T5440 Server Administration Guide ■ Integrated Lights Out Manager 3.
▼ Detect Faults (ILOM show faulty Command) ● At the -> prompt, type the show faulty command.
■ Example showing a fault that was detected by the PSH technology. These kinds of faults are distinguished from other kinds of faults by the presence of a sunw-msg-id and by a UUID.
faults/0 /SP/faultmgmt/0/ faults/0 | | sp_detected_fault | | | Ext FRU /SYS/IOX@X0TC/IOB1/LINK | SIGCON=0 I2C no device response Detecting Faults (Oracle Solaris OS Files and Commands) With the Oracle Solaris OS running on the server, you have the full complement of Oracle Solaris OS files and commands available for collecting information and for troubleshooting.
▼ View System Message Log Files The error logging daemon, syslogd, automatically records various system warnings, errors, and faults in message files. These messages can alert you to system problems such as a device that is about to fail. The /var/adm directory contains several message files. The most recent messages are in the /var/adm/messages file. After a period of time (usually every week), a new messages file is automatically created.
▼ View ILOM Event Log ● Type the following command: -> show /SP/logs/event/list Note – The ILOM event log can also be viewed through the ILOM BUI or the ALOM CMT CLI. If a “major” or “critical” event is found that was not expected and not included under ILOM show faulty than it may indicate a system fault. The following is an example of unexpected major events in the log.
About Oracle VTS Software The Oracle VTS software features a Java-based browser environment, an ASCII-based screen interface, and a command-line interface. For more information about how to use the Oracle VTS software, see the Oracle VTS 7.0 User’s Guide. The Oracle Solaris OS must be running in order to use the Oracle VTS software. You also must ensure that the Oracle VTS validation test software is installed on your system.
■ If the Oracle VTS software is not installed, you see an error message for each missing package, as in EXAMPLE: show Command Output on page 43. See “Oracle VTS Overview” on page 21 for a list of required Oracle VTS software packages. 2. If the Oracle VTS software is not installed, you can obtain the installation packages from the following places: ■ Oracle Solaris Operating System DVDs ■ Download from the web. Refer to the Preface for information on how to access the web site.
4. (Optional) Select the test categories you want to run. Certain test categories are enabled by default. You can choose to accept these. Note – Oracle VTS Tests on page 46 lists test categories that are especially useful to run on this server. 5. (Optional) Customize individual tests. Click on the name of the test to select and customize individual tests. Tip – Use the System Excerciser – High Stress Mode to test system operations. Use the Component Stress – High setting for the highest stress possible.
■ Oracle VTS Test Kernel (Vtsk) Error – Error messages pertaining to the Oracle VTS software itself. Look here if the Oracle VTS software appears to be acting strangely, especially when it starts up. ■ Information – Detailed versions of all the status and error messages that appear in the test messages area. ■ Oracle Solaris OS Messages (/var/adm/messages) – A file containing messages generated by the operating system and various applications. ■ Test Messages (/var/Oracle VTS/logs/Oracle VTS.
Useful Oracle VTS Tests Oracle VTS Tests FRUs Exercised by Tests Memory Test FB-DIMMs Processor Test CMP, motherboard Disk Test Disks, cables, disk backplane, DVD drive Network Test Network interface, network cable, CMP, motherboard Interconnect Test Board ASICs and interconnects IO Ports Test I/O (serial port interface), USB subsystem Environmental Test Motherboard and service processor Related Information ■ “Diagnostic Flowchart” on page 13 ■ “Oracle VTS Software Packages” on page 45 ■
In the following example, POST reports a memory error at FB-DIMM location /SYS/MB/CPU0/CMP0/BR1/CH0/D0. The error was detected by POST running on core 7, strand 2. EXAMPLE: show Command Output 7:2> 7:2>ERROR: TEST = Data Bitwalk 7:2>H/W under test = /SYS/MB/CPU0/CMP0/BR1/CH0/D0 7:2>Repair Instructions: Replace items in order listed by 'H/W under test' above. 7:2>MSG = Pin 149 failed on /SYS/MB/CPU0/CMP0/BR1/CH0/D0 (J792) 7:2>END_ERROR 7:2>Decode of Dram Error Log Reg Channel 2 bits 60000000.
■ “POST Fault Management Flowchart” on page 23 ■ SPARC Enterprise T5440 Server Administration Guide Identifying Faults Detected by PSH When a PSH fault is detected, a Oracle Solaris console message is displayed, similar to the following example. EXAMPLE: Console Message Showing Fault Detected by PSH SUNW-MSG-ID: SUN4V-8000-DX, TYPE: Fault, VER: 1, SEVERITY: Minor EVENT-TIME: Wed Sep 14 10:09:46 EDT 2005 PLATFORM: SUNW,system_name, CSN: -, HOSTNAME: wgs48-37 SOURCE: cpumem-diagnosis, REV: 1.
EXAMPLE: ALOM CMT Alert of PSH Diagnosed Fault SC Alert: Host detected fault, MSGID: SUN4V-8000-DX The ILOM show faulty command provides summary information about the fault. See “Detect Faults (ILOM show faulty Command)” on page 37 for more information about the show faulty command. Note – The Service Required LED is also turned on for PSH diagnosed faults.
■ Faulted FRU. The information provided in the example includes the part number of the FRU (part=541215101) and the serial number of the FRU (serial=101083). The Location field provides the name of the FRU. In EXAMPLE: Output from the fmdump -v Command on page 50 the FRU name is MB, meaning the motherboard. Note – fmdump displays the PSH event log. Entries remain in the log after the fault has been repaired. 2. Use the message ID to obtain more information about this type of fault. a.
Automated Response The fault manager will attempt to remove the affected CPU from service. Impact System performance may be affected. Suggested Action for System Administrator Schedule a repair procedure to replace the affected CPU, the identity of which can be determined using fmdump -v -u . Details The Message ID: SUN4V-8000-JA indicates diagnosis has determined that a CPU is faulty. The Oracle Solaris fault manager arranged an automated attempt to disable this CPU....
Clearing Faults This section describes how to clear faults. Note – Some system faults are cleared automatically. Description Topic Clear faults detected during POST. “Clear Faults Detected During POST” on page 52 Clear faults detected by PSH.
1. After replacing a faulty FRU, at the ILOM prompt use the show faulty command to identify POST-detected faults. Faults detected by POST are distinguished from other kinds of faults by the text: Forced fail. No UUID number is reported. Refer to EXAMPLE: Fault Detected by POST on page 54. If no fault is reported, you do not need to do anything else. Do not perform the subsequent steps. 2. Use the component_state property of the component to clear the fault and remove the component from the ASR blacklist.
EXAMPLE: Fault Detected by POST -> show faulty Target | Property | Value ----------------------+------------------------+---------------------------/SP/faultmgmt/0 | fru | /SYS/MB/CPU0/CMP0/BR1/CH0/D0 /SP/faultmgmt/0 | timestamp | Dec 21 16:40:56 /SP/faultmgmt/0/ | timestamp | Dec 21 16:40:56 faults/0 | | /SP/faultmgmt/0/ | sp_detected_fault | /SYS/MB/CPU0/CMP0/BR1/CH0/D0 faults/0 | | Forced fail(POST) ▼ Clear Faults Detected by PSH When the Oracle Solaris PSH facility detects faults, the faults are logg
▼ Clear Faults Detected in the External I/O Expansion Unit For service processor detected faults in the External I/O Expansion Unit, the fault must be manually cleared from ILOM show faulty after the problem has been repaired. Note – After the problem has been repaired, the fault will also be cleared from the ILOM show faulty command by resetting the service processor.
Disabling Faulty Components Using Automatic System Recovery You can use the Automatic System Recovery (ASR) feature to configure the server to automatically disable failed components until they can be replaced. The following components are managed by the ASR feature: ■ UltraSPARC T2 Plus processor strands ■ Memory FB-DIMMs ■ I/O subsystem The database that contains the list of disabled components is referred to as the ASR blacklist (asr-db).
Note – A reset or power cycle is required after disabling or enabling a component. If the status of a component is changed, there is no effect to the system until the next reset or power cycle. Related Information ■ “Diagnostic Flowchart” on page 13 ■ “Detecting Faults” on page 34 ■ SPARC Enterprise T5440 Server Administration Guide ▼ Disable System Components The component_state property disables a component by adding it to the ASR blacklist. 1.
1. At the -> prompt, set the component_state property to Enabled. -> set /SYS/MB/CPU0/CMP0/BR1/CH0/D0 component_state=Enabled 2. Reset the server so that the ASR command takes effect. -> stop /SYS Are you sure you want to stop /SYS (y/n)? y Stopping /SYS -> start /SYS Are you sure you want to start /SYS (y/n)? y Starting /SYS Note – In the ILOM shell there is no notification when the system is actually powered off. Powering off takes about a minute.
■ Integrated Lights Out Manager 3.0 Supplement for the SPARC Enterprise T5440 Server ILOM Command ALOM CMT Command Description help [command] help [command] Displays a list of all available commands with syntax and descriptions. Specifying a command name as an option displays help for that command. set /HOST/send_break_action true break [-y][-c][-D] • -y skips the confirmation question. • -c executes a console command after the break command completes.
ILOM Command ALOM CMT Command bootmode value set /HOST/bootmode/value[normal|re [normal|reset_nvram| bootscript=string] set_nvram|bootscript=string] Description Enables control of the firmware during system initialization with the following options: • normal is the default boot mode. • reset_nvram resets OpenBoot PROM parameters to their default values. • bootscript=string enables the passing of a string to the boot command.
ILOM Command ALOM CMT Command Description set /SYS/keyswitch_state value normal | stby | diag | locked setkeyswitch [-y] value normal | stby | diag | locked • -y enables you to skip the confirmation question when setting the keyswitch to stby. Sets the virtual keyswitch. set /SUS/LOCATE value=value [Fast_blink | Off] setlocator value [on | off] Turns the Locator LED on the server on or off. (No ILOM equivalent.) showenvironment Displays the environmental status of the host server.
ILOM Command ALOM CMT Command Description show /SYS/LOCATE showlocator Displays the current state of the Locator LED as either on or off. show /SP/logs/event/list showlogs [-b lines | -e lines | -v] [-g lines] [-p logtype[r|p]]] Displays the history of all events logged in the service processor event buffers (in RAM or the persistent buffers).
Preparing to Service the System These topics describe how to prepare the server for servicing.
Observing Important Safety Precautions For your protection, observe the following safety precautions when setting up your equipment: ■ Follow all cautions and instructions marked on the equipment and described in the documentation shipped with your system. ■ Follow all cautions and instructions marked on the equipment and described in the SPARC Enterprise T5440 Server Safety and Compliance Guide.
Electrostatic Discharge Safety Measures This topic includes the following: ■ “Handling Electronic Components” on page 65 ■ “Antistatic Wrist Strap” on page 65 ■ “Antistatic Mat” on page 65 Handling Electronic Components Electrostatic discharge (ESD) sensitive devices, such as the motherboard, PCI cards, hard drives, and memory modules require special handling. Caution – Circuit boards and hard drives contain electronic components that are extremely sensitive to static electricity.
Required Tools ■ Antistatic wrist strap ■ Antistatic mat ■ No. 1 Phillips screwdriver ■ No. 2 Phillips screwdriver ■ 7 mm hex driver ■ No. 1 flat-blade screwdriver (battery removal) ■ Pen or pencil (power on server) ▼ Obtain the Chassis Serial Number To obtain support for your system, you need your chassis serial number. ● The chassis serial number is located on a sticker that is on the front of the server and another sticker on the side of the server.
Properties: type = Host System keyswitch_state = Normal product_name = T5440 product_serial_number = 0723BBC006 fault_state = OK clear_fault_action = (none) power_state = On Commands: cd reset set show start stop Powering Off the System Note – Additional information about powering off the system is located in the SPARC Enterprise T5440 Server Administration Guide.
svc.stard: The system is down. syncing file systems...done Program terminated r)eboot o)k prompt, h)alt? 2. Switch from the system console prompt to the service processor console prompt. Type: ok #. -> 3. From the ILOM -> prompt, type: -> stop /SYS Are you sure you want to stop /SYS (y/n)? y Stopping /SYS -> Note – To perform an immediate shutdown, use the stop -force -script /SYS command. Ensure that all data is saved before entering this command.
Caution – Because 3.3v standby power is always present in the system, you must unplug the power cords before accessing any cold-serviceable components.
▼ Extend the Server to the Maintenance Position 1. (Optional) Use the set /SYS/LOCATE command from the -> prompt to locate the system that requires maintenance. -> set /SYS/LOCATE value=Fast_Blink Once you have located the server, press the Locator LED and button to turn it off. 2. Verify that no cables will be damaged or will interfere when the server is extended.
4. While squeezing the slide rail locks, slowly pull the server forward until it is locked in the service position. ▼ Remove the Server From the Rack The server must be removed from the rack to remove or install the following components: ■ Motherboard Caution – Two people must dismount and carry the chassis. 1. Disconnect all the cables and power cords from the server. 2. Extend the server to the maintenance position. See “Extending the Server to the Maintenance Position” on page 69. 3.
FIGURE: Removing the Server From the Rack Figure Legend 1 Disconnect system cables and CMA. 2 Press inner rail release buttons to remove the server from the rack. Caution – Use two people to dismount and carry the chassis. FIGURE: Lift Warning 4. From the front of the server, press inner rail release buttons and pull the server forward until it is free of the rack rails. 5. Set the server on a sturdy work surface.
▼ Perform Electrostatic Discharge – Antistatic Prevention Measures 1. Prepare an antistatic surface to set parts on during the removal, installation, or replacement process. Place ESD-sensitive components such as the printed circuit boards on an antistatic mat. The following items can be used as an antistatic mat: ■ Antistatic bag used to wrap a replacement part ■ ESD mat ■ A disposable ESD mat (shipped with some replacement parts or optional system components) 2. Attach an antistatic wrist strap.
Caution – If the top cover is removed before the server is powered off, the server will immediately disable the front panel Power button and shut down. After such an event, you must replace the top cover and use the poweron command to power on the server. See “Power On the Server” on page 157.
Servicing Customer-Replaceable Units These topics describe how to service customer-replaceable units (CRUs) in the server. Topic Links Read and learn about components which can be serviced while the system is in operation. “Hot-Pluggable and Hot-Swappable Devices” on page 75 Remove, install and add hard drives. “Servicing Hard Drives” on page 76 Remove and install fan trays. “Servicing Fan Trays” on page 84 Remove and install power supplies.
■ Hard drives Hot-swappable devices are those devices that can be removed and installed while the server is running without affecting the rest of the server’s capabilities. The following devices are hot-swappable: ■ Fan trays ■ Power supplies Note – The chassis-mounted hard drives can be hot-swappable, depending on how they are configured.
Caution – You must use hard drives designed for this server, which have a vented front panel to allow adequate airflow to internal system components. Installing inappropriate hard drives could result in an overtemperature condition. The following situations inhibit your ability to hot-plug a drive: ■ If the hard drive contains the operating system, and the operating system is not mirrored on another drive. ■ If the hard drive cannot be logically isolated from the online operations of the server.
1. At the Solaris prompt, issue the cfgadm -al command to list all drives in the device tree, including drives that are not configured. Type: # cfgadm -al This command should identify the Ap_id for the hard drive you wish to remove, as in EXAMPLE: Sample Ap_id Output on page 79. 2. Issue the cfgadm -c unconfigure command to unconfigure the disk. For example, type: # cfgadm -c unconfigure c0::dsk/d1t1d1 where c0:dsk/c0t1d1 is the disk that you are trying to unconfigure. 3.
EXAMPLE: Sample Ap_id Output Ap_id c0 c0::dsk/d1t0d0 c0::dsk/d1t1d0 usb0/1 usb0/2 usb0/3 usb1/1 usb1/2 usb1/3 usb2/1 usb2/2 usb2/3 usb2/4 usb2/5 usb2/6 usb2/7 usb2/8 ---------------------------- Type scsi-bus disk disk unknown unknown unknown unknown unknown unknown unknown unknown unknown unknown unknown unknown unknown unknown Receptacle connected connected connected empty empty empty empty empty empty empty empty empty empty empty empty empty empty Occupant configured configured configured unconfigu
4. Close the latch to lock the drive in place. 5. At the Solaris prompt, type the cfgadm -al command to list all drives in the device tree, including any drives that are not configured. Type: # cfgadm -al This command should help you identify the Ap_id for the hard drive you installed. For an output example refer to EXAMPLE: Sample Ap_id Output on page 81. 6. Type the cfgadm -c configure command to configure the disk.
9. Type the iostat -E command. Type: # iostat -E The iostat -E command displays information about your system’s installed devices such as manufacturer, model number, serial number, size, and system error statistics.
1. Note the location of each hard drive. Note – You must install each hard drive in the same bay from which it was removed. 2. Press the hard drive latch release button. 3. Slide the hard drive out of its bay. ▼ Install a Hard Drive If you are installing a hard drive after servicing another component in the system, do the following: 1. Align the replacement drive to the drive slot. Hard drives are physically addressed according to the slot in which they are installed.
3. Close the latch to lock the drive in place. 4. If you performed any additional service procedures, see “Power On the Server” on page 157. Hard Drive Device Identifiers The following table lists physical drive locations and their corresponding default path names in OpenBoot PROM and Solaris for the server.
Hard Drive LEDs No. LED Color Notes 1 Ready -to-Remove Blue This LED is lit to indicate that a hard drive can be removed safely during a hot-plug operation. 2 Service Required Amber This LED is lit when the system is running and the hard drive is faulty. 3 OK/Activity Green This LED lights when data is being read from or written to the hard drive. The front and rear panel Service Required LEDs are also lit if the system detects a hard drive fault.
■ “Fan Tray Fault LED” on page 89 About Fan Trays Four fan trays are located toward the front of the server, arranged in two N+1 redundant pairs. Each fan tray contains a fan mounted in an integrated, hot-swappable CRU. If a fan tray fails, replace it as soon as possible to maintain server availability. Caution – Hazardous moving parts. Unless the power to the server is completely shut down, the only service permitted in the fan compartment is the replacement of the fan trays by trained personnel.
▼ Install a Fan Tray (Hot-Swap) 1. Slide the fan tray into its bay until it locks into place. Ensure that the fan tray is oriented correctly. Airflow in the system is from front to back. 2. Verify proper fan tray operation. See “Fan Tray Fault LED” on page 89. Next Steps If you are replacing a faulty fan tray due to an overtemperature condition, monitor the system to ensure proper cooling.
▼ Remove a Fan Tray If you are removing the fan trays as a prerequisite for another service procedure, follow the steps in this procedure. Before you begin, complete these tasks: ■ Read the section, “Safety Information” on page 63. ■ Power off the server, using one of the methods described in the section, “Powering Off the System” on page 67.
▼ Install a Fan Tray 1. Slide each fan tray into its bay until it locks into place. Ensure that the fan tray is oriented correctly. Airflow in the system is from front to back. 2. Verify proper fan tray operation. See “Fan Tray Fault LED” on page 89. Next Steps If you are replacing the fan trays after performing another service procedure, complete these steps.
Fan Tray Fault LED Each fan tray contains a Fault LED that is located on the top panel of the server. The LED is visible when you slide the server partially out of the rack. LED Color Notes Fault Amber This LED is lit when the fan tray is faulty. The front panel Fan Fault LED, and the front and rear panel Service Required LEDs are also lit if the system detects a fan tray fault. In addition, the system Overtemp LED might be lit if a fan fault causes an increase in system operating temperature.
About Power Supplies The server is equipped with redundant hot-swappable power supplies. Redundant power supplies enable you to remove and replace a power supply without shutting the server down, provided that at least two other power supplies are online and working. Note – If a power supply fails and you do not have a replacement available, leave the failed power supply installed to ensure proper airflow in the server.
4. Grasp the power supply handle and press the release latch. 5. Pull the power supply out of the chassis. ▼ Install a Power Supply (Hot-Swap) 1. Align the replacement power supply with the empty power supply bay. 2. Slide the power supply into the bay until it is fully seated.
3. Reconnect the power cord to the power supply. Verify that the power supply LED is green or blinking green. 4. Verify that the system Power Supply Fault LED, and the front and rear Service Required LEDs are not lit. Note – See “Front Panel LEDs” on page 5 and “Rear Panel LEDs” on page 8 for more information about identifying and interpreting system LEDs. 5. At the ILOM -> prompt, use the show faulty command to verify the status of the power supplies.
If you are removing the power supplies as a prerequisite for another service procedure, follow these steps. Before you begin, complete these tasks: ■ Read the section, “Safety Information” on page 63. ■ Power off the server, using one of the methods described in the section, “Powering Off the System” on page 67.
2. Slide the power supply into the bay until it is fully seated.
Note – Power supply names in ILOM messages are displayed with the full FRU name, such as /SYS/PS0. Related Information ■ “Managing Faults” on page 11 ■ “Hot-Pluggable and Hot-Swappable Devices” on page 75 “Power Supply LED” on page 95 Power Supply LED Each power supply contains a dual-color LED that is visible when looking at the back panel of the system. The following table includes a description of power supply LED modes and their function, listed from top to bottom.
See “Power Supply LED” on page 95 for specific information about power supply status LEDs.
3. Remove the PCIe card the system. 4. Place the PCIe card on an antistatic mat. 5. If you are not replacing the PCIe card, install a PCIe filler panel in its place. 6. Close the PCIe card latch. ▼ Install a PCIe Card 1. Identify the correct slot for installation. 2. Open the PCIe card latch.
3. Insert the PCIe card into its slot. 4. Close the PCIe card latch. Next Steps ■ “Install the Top Cover” on page 154 ■ “Slide the Server Into the Rack” on page 155 ■ “Power On the Server” on page 157 ▼ Add a PCIe Card Before you begin, complete these tasks: ■ Read the section, “Safety Information” on page 63. ■ Power off the server, using one of the methods described in the section, “Powering Off the System” on page 67.
2. Open the PCIe card latch. 3. Remove the PCIe filler panel. 4. Insert the PCIe card into its slot. 5. Close the PCIe card latch. Next Steps ■ “Install the Top Cover” on page 154 ■ “Slide the Server Into the Rack” on page 155 ■ “Power On the Server” on page 157 PCIe Device Identifiers Device identifiers are case-sensitive.
Device Device Identifier Notes PCIe5 (XAUI1) /SYS/MB/PCIE5 or /SYS/MB/XAUI1 x8 slot; shared with XAUI slot PCIe6 /SYS/MB/PCIE6 x16 slot operating at x8 PCIe7 /SYS/MB/PCIE7 x8 slot Note – PCIe names in ILOM messages are displayed with the full FRU name, such as /SYS/MB/PCIE0. Note – In the Solaris OS, PCIe slot addresses are associated with CMP modules. The PCIe slot address in the Solaris OS might change if you add or remove CMP modules, or if a CMP module is brought offline.
Use the following guidelines to spread the load evenly across CMP/memory modules. If a slot is already populated with a device, install a new device in the next available slot, in the order indicated. Number of CMP/Memory Modules PCIe/XAUI Card Type Installation Order Notes 10 GBit Ethernet (XAUI) 1, 2, 3 or 4 card Slot 4, 5 Install XAUI cards first. External I/O Expansion Unit PCIe Link card 2 Slot 0, 4, 1, 5 Maximum of 4 cards; install in order shown.
Servicing CMP/Memory Modules This topic includes the following: ■ “CMP/Memory Modules Overview” on page 102 ■ “Remove a CMP/Memory Module” on page 104 ■ “Install a CMP/Memory Module” on page 105 ■ “Add a CMP/Memory Module” on page 105 ■ “CMP and Memory Module Device Identifiers” on page 107 ■ “Supported CMP/Memory Module Configurations” on page 107 CMP/Memory Modules Overview Up to four CMP/memory modules can be installed in the system. Each CMP module is paired with a memory module.
Related Information ■ “CMP and Memory Module Device Identifiers” on page 107 ■ “Supported CMP/Memory Module Configurations” on page 107 ■ “I/O Connections to CMP/Memory Modules” on page 160 ■ “Reconfiguring I/O Device Nodes” on page 162 ■ “Servicing FB-DIMMs” on page 108 ■ “System Bus Topology” on page 167 ■ “I/O Fabric in 2P Configuration” on page 168 ■ “I/O Fabric in 4P Configuration” on page 169 Servicing Customer-Replaceable Units 103
▼ Remove a CMP/Memory Module Before you begin, complete these tasks: ■ Read the section, “Safety Information” on page 63. ■ Power off the server, using one of the methods described in the section, “Powering Off the System” on page 67. ■ “Extend the Server to the Maintenance Position” on page 70 ■ “Perform Electrostatic Discharge – Antistatic Prevention Measures” on page 73 ■ “Remove the Top Cover” on page 73 Do the following : 1. Identify the module you want to remove. 2.
▼ Install a CMP/Memory Module Note – If you are replacing a faulty CMP or memory module, you must transfer the FB-DIMMs on the faulty module to the replacement module. Replacement CMP/memory modules do not include FB-DIMMs. For more information about installing FB-DIMMs, see “Servicing FB-DIMMs” on page 108 1. Identify the correct slot for installation. 2. Slide the module down into its slot. 3. Rotate the ejector levers down to secure the module into place.
■ Power off the server, using one of the methods described in the section, “Powering Off the System” on page 67. ■ “Extend the Server to the Maintenance Position” on page 70 ■ “Perform Electrostatic Discharge – Antistatic Prevention Measures” on page 73 ■ “Remove the Top Cover” on page 73 Do the following : 1. Identify the correct slot for installation. 2. Remove the air baffle. Squeeze the air baffle latches toward each other and lift the air baffle straight up and out of the chassis. 3.
CMP and Memory Module Device Identifiers The following table describes device, device identifiers, and supported configurations for CMP and memory modules. Device identifiers are case-sensitive.
Configuration Four CMP/memory pairs (full configurations) CMP3 MEM3 CMP1 MEM1 CMP2 MEM2 CMP0 MEM0 X X X X Related Information ■ “CMP and Memory Module Device Identifiers” on page 107 ■ “Performing Node Reconfiguration” on page 159 Servicing FB-DIMMs ■ “Remove FB-DIMMs” on page 108 ■ “Install FB-DIMMs” on page 109 ■ “Verify FB-DIMM Replacement” on page 110 ■ “Add FB-DIMMs” on page 113 ■ “FB-DIMM Configuration” on page 113 ■ “FB-DIMM Device Identifiers” on page 116 ■ “FB-DIMM Fault
a. Press the FB-DIMM fault button. See “FB-DIMM Fault Button Locations” on page 117. b. Note which FB-DIMM fault LED is illuminated. 2. Push down on the ejector tabs on each side of the FB-DIMM until the FB-DIMM is released. Caution – FB-DIMMs might be hot. Use caution when servicing FB-DIMMs. 3. Grasp the top corners of the faulty FB-DIMM and remove it from the CMP/memory module. 4. Place the FB-DIMM on an antistatic mat. 5. Repeat Step 2 through Step 4 to remove any additional FB-DIMMs.
■ “Power On the Server” on page 157 ▼ Verify FB-DIMM Replacement 1. Access the ILOM -> prompt. Refer to the Integrated Lights Out Manager 3.0 Supplement for the SPARC Enterprise T5440 Server for instructions. 2. Run the show faulty command to determine how to clear the fault. The method you use to clear a fault depends on how the fault is identified by the showfaults command. Examples: ■ If the fault is a host-detected fault (displays a UUID), continue to Step 3.
Example: -> set /SYS/MB/CPU0/CMP0/BR0/CH0/D0 component_state=Enabled 3. Perform the following steps to verify the repair: a. Set the virtual keyswitch to diag so that POST will run in Service mode. -> set /SYS/keyswitch_state=Diag Set ‘keyswitch_state’ to ‘Diag’ b. Power cycle the system. -> stop /SYS Are you sure you want to stop /SYS (y/n)? y Stopping /SYS -> start /SYS Are you sure you want to start /SYS (y/n)? y Starting /SYS Note – The server takes about one minute to power off.
Note – Depending on the configuration of ILOM POST variables and whether POST-detected faults or not, the system might boot, or the system might remain at the ok prompt. If the system is at the ok prompt, type boot. d. Return the virtual keyswitch to Normal mode. -> set /SYS keyswitch_state=Normal Set ‘ketswitch_state’ to ‘Normal’ e. Switch to the system console and issue the Solaris OS fmadm faulty command. # fmadm faulty No memory faults should be displayed.
▼ Add FB-DIMMs If you are upgrading the system with additional FB-DIMMs, use this procedure. Before you begin, complete these tasks: ■ Read the section, “Safety Information” on page 63. ■ Read the sections, “FB-DIMM Configuration” on page 113 and “FB-DIMM Device Identifiers” on page 116. ■ Power off the server using one of the methods described in the section, “Powering Off the System” on page 67.
■ “Memory Bank Configurations” on page 114 Supported FB-DIMM Configurations Use these FB-DIMM configuration rules to help you plan the memory configuration of your server: ■ Up to 16 FB-DIMMs can be installed in each CMP/memory module pair. ■ Each bank consists of four FB-DIMMs. ■ Each bank must be populated completely, never partially. ■ For each CPU/Memory module pair, all FB-DIMMs must be of the same capacity, either 2 GB, 4 GB or 8 GB per FB-DIMM. ■ Memory bank 0 must always be populated.
Bank 1, if filled, must contain FB-DIMMs of the same capacity as Bank 0 Banks 2 and 3 must be either completely empty or completely filled. If filled, they must have the FB-DIMMs of the same capacity as Banks 0 and 1 CPU Module (Includes Memory Bank 0) Memory Module (Includes Memory Banks 1, 2 and 3.
Note – See “FB-DIMM Device Identifiers” on page 116 for a list of FB-DIMM device identifiers and the corresponding slots on the CMP/memory modules. Related Information ■ “Managing Faults” on page 11 ■ “FB-DIMM Device Identifiers” on page 116 ■ “FB-DIMM Fault Button Locations” on page 117 ■ “Performing Node Reconfiguration” on page 159 FB-DIMM Device Identifiers These are the device and device identifiers for FB-DIMMs on a CMP and memory module pair. Device identifiers are case-sensitive.
Location Connector Number FB-DIMM Device Identifiers /SYS/MB/MEMx/CMPx/BR0/CH1/D 1 /SYS/MB/MEMx/CMPx/BR0/CH0/D 1 /SYS/MB/MEMx/CMPx/BR1/CH1/D 1 /SYS/MB/MEMx/CMPx/BR1/CH0/D 1 Motherboard connector J746 J511 J1344 J927 FB-DIMM Group Bank 1 The FB-DIMM address follows the same convention as the CMP or memory module upon which it is mounted. For example, /SYS/MB/CPU0/CMP0/BR1/CH0/D0 is the device identifier for the FB-DIMM mounted at J792 on CMP module 0.
Related Information 118 ■ “Managing Faults” on page 11 ■ “FB-DIMM Configuration” on page 113 ■ “FB-DIMM Device Identifiers” on page 116 SPARC Enterprise T5440 Server Service Manual • June 2011
Servicing Field-Replaceable Units These topics describe how to service field-replaceable units (FRUs) in the server. Note – The procedures in this chapter must be performed by a qualified service technician. Topic Links Remove and install field-replaceable components.
Related Information ■ “Servicing the DVD-ROM Drive” on page 122 ▼ Remove the Front Bezel Before you begin, complete these tasks: ■ Read the section, “Safety Information” on page 63. ■ If you are performing additional service procedures, power off the server, using one of the methods described in the section, “Powering Off the System” on page 67.
Note – Avoid bending the bezel by gradually pulling it from the middle and both ends simultaneously. ▼ Install the Front Bezel 1. Align the bezel with the chassis front panel. 2. Press the bezel onto the front panel. The bezel is oriented with four guide pins, and is secured with three snap-in posts. Next Steps ■ “Slide the Server Into the Rack” on page 155 ■ If you performed any additional service procedures, see “Power On the Server” on page 157.
Servicing the DVD-ROM Drive You must remove the front bezel before servicing the DVD-ROM drive. ■ “Remove the DVD-ROM Drive” on page 122 ■ “Install the DVD-ROM Drive” on page 123 Related Information ■ “Servicing the Front Bezel” on page 119 ▼ Remove the DVD-ROM Drive Before you begin, complete these tasks: ■ Read the section, “Safety Information” on page 63. ■ Power off the server using one of the methods described in the section, “Powering Off the System” on page 67.
4. Slide the DVD-ROM drive out of the chassis. ▼ Install the DVD-ROM Drive 1. Slide the DVD-ROM drive into its bay. 2. Connect the DVD-ROM drive to the flex cable assembly. 3. Install the flex cable retainer. Place the retainer into position and tighten the captive No. 2 Phillips screw.
■ “Power On the Server” on page 157 Servicing the Service Processor The service processor module contains the service processor firmware, IDPROM, and system battery. ■ “Remove the Service Processor” on page 124 ■ “Install the Service Processor” on page 126 Related Information ■ “Servicing the IDPROM” on page 127 ■ “Servicing the Battery” on page 129 ▼ Remove the Service Processor Before you begin, complete these tasks: ■ Read the section, “Safety Information” on page 63.
3. Lift the service processor up and out of the system. 4. Place the service processor on an antistatic mat. Next Steps If you are replacing a faulty service processor, you must install the IDPROM onto the new service processor. Do the following: ■ Remove the IDPROM from the old service processor. See “Remove the IDPROM” on page 127.
■ Install the IDPROM onto the new service processor. See “Install the IDPROM” on page 128. ▼ Install the Service Processor 1. Ensure that the power cords are disconnected from the system. 2. Lower the service processor into position. Ensure that the service processor is oriented correctly over the motherboard connector and the two snap-on standoffs. 3. Press down evenly to plug the service processor into the motherboard. 4. Secure the service processor with the two captive No.2 Phillips screws.
■ “Slide the Server Into the Rack” on page 155 ■ “Connect the Power Cords to the Server” on page 157 ■ “Power On the Server” on page 157 Servicing the IDPROM The IDPROM stores system parameters, such as host ID and MAC address, ILOM configuration settings, and OpenBoot PROM configuration settings. If you are replacing a faulty service processor, you must move the IDPROM from the old service processor to the new one.
2. Place the IDPROM on an antistatic mat. ▼ Install the IDPROM Before you begin, complete these tasks: 128 ■ Read the section, “Safety Information” on page 63. ■ Power off the server, using one of the methods described in the section, “Powering Off the System” on page 67.
● Plug the IDPROM into its connector on the service processor. Ensure that the service processor is oriented correctly. A notch on the IDPROM corresponds to a similar notch on the connector. Servicing the Battery The battery provides the power necessary to maintain system configuration parameters during power outages, or while the system is being serviced, stored or relocated.
▼ Install the Battery 1. Place the battery into its holder on the service processor board. Ensure that the battery is oriented correctly. 2. Press the battery firmly until it snaps into place.
Note – You must remove all four power supplies from the system. ■ “Perform Electrostatic Discharge – Antistatic Prevention Measures” on page 73 ■ “Remove the Top Cover” on page 73 Do the following: 1. Remove the flex cable retainer. Loosen the captive No. 2 Phillips screw and lift the retainer up and out of the chassis. 2. Unplug the flex cable from the power distribution board. 3. Unplug the auxiliary power cable from the power distribution board. 4. Remove the No. 2 Phillips screw. 5.
6. Slide the power distribution board up and out of the chassis. ▼ Install the Power Distribution Board 1. Align the keyholes in the power distribution board with the corresponding mushroom standoffs in the chassis. 2. Lower the power distribution board into the chassis. 3. Install the No. 2 Phillips screw. 4. Install the two 7 mm nuts securing the bus bars to the power distribution board. 5. Plug in the flex cable connector. Ensure that the auxiliary power cable is routed under the flex cable connector.
■ “Power On the Server” on page 157 Servicing the Fan Tray Carriage You must remove the fan tray carriage in order to service the following components: ■ Hard drive backplane ■ Motherboard ■ Front control panel ■ Front I/O board This topic includes the following: ■ “Remove the Fan Tray Carriage” on page 133 ■ “Install the Fan Tray Carriage” on page 134 Related Information ■ “Servicing Fan Trays” on page 84 ■ “Servicing the Hard Drive Backplane” on page 135 ■ “Servicing the Motherboard” on
■ “Remove a CMP/Memory Module” on page 104 Note – You must remove all CPU modules and memory modules from the system. Do the following: 1. Remove the nine No. 1 Phillips screws securing the fan tray carriage to the top of the chassis. 2. Loosen the seven captive No. 2 Phillips securing the bottom of the fan tray carriage to the motherboard assembly. 3. Lift the fan tray carriage up and out of the system. ▼ Install the Fan Tray Carriage 1. Lower the fan tray carriage into the system.
2. Secure the seven captive No. 2 Phillips screws 3. Install the nine No. 1Phillips screws. Next Steps ■ “Install a Fan Tray” on page 88 Note – Install all four fan trays. ■ “Install the Top Cover” on page 154 ■ “Slide the Server Into the Rack” on page 155 ■ “Power On the Server” on page 157 Servicing the Hard Drive Backplane The hard drive backplane provides the power and data interconnect to the internal hard drives.
Related Information ■ “Servicing Hard Drives” on page 76 ■ “Servicing the Fan Tray Carriage” on page 133 ▼ Remove the Hard Drive Backplane Before you begin, complete these tasks: ■ Read the section, “Safety Information” on page 63. ■ Power off the server, using one of the methods described in the section, “Powering Off the System” on page 67.
4. Lift the backplane up and out of the system. ▼ Install the Hard Drive Backplane 1. Lower the hard drive backplane into the system. Align the tab on the lower edge the backplane with the corresponding slot in the chassis floor.
2. Tighten the three captive No. 2 Phillips screws. 3. Plug the cable into its connector on the backplane. 4. Install the flex cable retainer. Place the retainer into position and tighten the captive No.2 Phillips screw. Next Steps ■ “Install the Fan Tray Carriage” on page 134 ■ “Install a Fan Tray” on page 88 ■ “Install a CMP/Memory Module” on page 105 ■ “Install the Top Cover” on page 154 ■ “Install a Hard Drive” on page 82 Note – You must install the hard drives in the correct slots.
Servicing the Motherboard Note – If you are replacing faulty motherboard, you must set diag_mode to normal or off before performing this procedure.
■ “Remove the Service Processor” on page 124 ■ “Remove a CMP/Memory Module” on page 104 Note – You must remove all CMP and memory modules. ■ “Remove a Fan Tray” on page 87 Note – You must remove all four fan trays. ■ “Remove the Fan Tray Carriage” on page 133 1. Remove the CMP/memory module bracket. The bracket is secured with six captive No. 2 Phillips screws. See “Motherboard Fastener Locations” on page 143.
2. Remove the flex cable retainer. Loosen the captive No.2 Phillips screw and lift the retainer up and out of the chassis. 3. Unplug the flex cable from J9801 on the motherboard. 4. Unplug the auxiliary power cable from J9803 on the motherboard. 5. Unplug the front I/O connector from J9901 on the motherboard. 6. Remove the six No. 2 Phillips screws that secure the bus bar assembly to the motherboard. 7. Slide the chassis midwall panel up.
▼ Install the Motherboard 1. Ensure that all 14 captive screws in the motherboard are retracted. 2. Lower the motherboard down into the chassis. Guide the flex cable connector through the midwall partition. 3. Secure the No. 2 captive Phillips screws. Ensure that all fasteners are secured. (See “Motherboard Fastener Locations” on page 143.) 4. Lower and secure the midwall partition. 5. Install the six No. 2 Phillips screws that secure the bus bar assembly to the motherboard. 6.
Next Steps ■ “Install the Fan Tray Carriage” on page 134 ■ “Install a Fan Tray” on page 88 Note – Install all four fan trays. ■ “Install a CMP/Memory Module” on page 105 Note – Install all CMP and memory modules.
Related Information ■ “Servicing the Motherboard” on page 139 Servicing the Flex Cable Assembly The flex cable assembly provides the power and data connection between the power supply backplane, hard drive backplane, and motherboard.
Related Information ■ “Safety Information” on page 63 ■ “Servicing Power Supplies” on page 89 ■ “Servicing the Power Distribution Board” on page 130 ■ “Servicing the Hard Drive Backplane” on page 135 ■ “Servicing the Motherboard” on page 139 ▼ Remove the Flex Cable Assembly Before you begin, complete these tasks: ■ Read the section, “Safety Information” on page 63. ■ Power off the server, using one of the methods described in the section, “Powering Off the System” on page 67.
3. Unplug the flex cable-to-power supply backplane connection. 4. Unplug the flex cable-to-hard drive backplane connection. 5. Unplug the flex cable-to-DVD-ROM drive connection. 6. Unplug the flex cable-to-motherboard connection. 7. Lift the flex cable up and out of the system. ▼ Install the Flex Cable Assembly 1. Ensure the power cables are unplugged. 2. Plug in the motherboard connector. 3. Plug in the hard drive backplane connector. 4. Plug in the DVD-ROM drive connector. 5.
6. Install the flex cable retainer. Place the retainer into position and tighten the captive No. 2 Phillips screw. 7. Plug in the power cables.
Servicing the Front Control Panel The front control panel contains system status LEDs and the Power button. This topic includes the following: ■ “Remove the Front Control Panel” on page 148 ■ “Install the Front Control Panel” on page 149 Related Information ■ “Infrastructure Boards and Cables” on page 1 ■ “Front Panel Diagram” on page 3 ■ “Front Panel LEDs” on page 5 ▼ Remove the Front Control Panel Before you begin, complete these tasks: ■ Read the section, “Safety Information” on page 63.
4. Lift the front control panel up and out of the system. 5. Place the front control panel on an antistatic mat. ▼ Install the Front Control Panel 1. Lower the front control panel into the system.
2. Install the two No. 2 Phillips screws. 3. Plug the front control panel connector into the front I/O board. 4. Plug the front control panel connector into J9901 on the motherboard.
■ “Remove the Server From the Rack” on page 71 ■ “Perform Electrostatic Discharge – Antistatic Prevention Measures” on page 73 ■ “Remove the Top Cover” on page 73 ■ “Remove a Fan Tray” on page 87 ■ “Remove the Fan Tray Carriage” on page 133 1. Unplug the front control panel cable from J9901 on the motherboard. 2. Unplug the front control panel cable from the front I/O board. 3. Remove the two No. 2 Phillips screws. 4. Lift the front I/O board up and out of the system. 5.
Next Steps ■ “Install the Fan Tray Carriage” on page 134 152 ■ “Install a Fan Tray” on page 88 ■ “Install the Top Cover” on page 154 ■ “Install the Server Into the Rack” on page 154 ■ “Connect the Power Cords to the Server” on page 157 ■ “Power On the Server” on page 157 SPARC Enterprise T5440 Server Service Manual • June 2011
Returning the Server to Operation These topics describe how to return the server to operation after you have performed service procedures. Caution – Never attempt to run the server with the cover removed. Hazardous voltage is present. Caution – Equipment damage could occur if you run the server with the cover removed. The cover must be in place for proper air flow. Description Links Install the top cover after servicing internal components.
▼ Install the Top Cover If you removed the top cover, perform the steps in this procedure. Note – If removing the top cover caused an emergency shutdown, you must install the top cover and use the poweron command to restart the system. See “Power On the Server” on page 157. 1. Place the top cover on the chassis. Set the cover down so that it hangs over the rear of the server by about an inch (25.4 mm). 2. Slide the top cover forward until it seats. 3.
2. Lift the server up and insert the inner rails into the inner slide assemblies. Ensure that the inner rails are horizontal when the inner rails enter the inner slide assemblies. 3. Ensure that the inner rails are engaged with the ball-bearing retainers on both inner slide assemblies. Note – If necessary, support the server with the mechanical lift while aligning the inner rails parallel to the rack-mounted inner slide assemblies. ▼ Slide the Server Into the Rack 1.
FIGURE: Sliding the server into the rack. Figure Legend 1 Inner rail release button 2 Slide rail lock 2. While pushing on the release buttons, slowly push the server into the rack. Ensure that the cables do not get in the way. 3. If necessary, re-attach the CMA. a. Attach the CMA support strut to the inner glide. b. Attach the CMA to the inner glide. Slide the hinge plate into the end of the outer rail until the retaining pin snaps into place. 4. Reconnect the cables to the back of the server.
▼ Connect the Power Cords to the Server ● Reconnect both power cords to the power supplies. Note – As soon as the power cords are connected, standby power is applied. Depending on the configuration of the firmware, the system might boot. See the SPARC Enterprise T5440 Server Administration Guide for configuration and power-on information.
158 SPARC Enterprise T5440 Server Service Manual • June 2011
Performing Node Reconfiguration If a CMP/memory module pair develops a fault, the server can be reconfigured to run in a degraded state until the CMP/memory module is replaced. In addition, you can add CMP/memory module pairs to existing systems. However, adding or removing CMP/memory modules might affect internal hardware device addresses, as well as the device address of any external devices attached to the system such as external disk arrays and devices attached via an External I/O Expansion Unit.
■ “Servicing CMP/Memory Modules” on page 102 ■ “Servicing FB-DIMMs” on page 108 I/O Connections to CMP/Memory Modules Each PCIe slot and onboard I/O device is connected to one CMP module. Device address is dependent on system configuration. See CMP Number on page 168 and CMP Number on page 169 for more information. If a CMP module fails, the onboard devices and slots directly connected to it become unavailable. Recovery of the I/O services connected to the failed CMP requires I/O node reconfiguration.
Recovering From a Failed CMP/Memory Module This topic includes the following: ■ “Options for Recovering From a Failed CMP/Memory Module” on page 161 ■ “Reconfiguring I/O Device Nodes” on page 162 ■ “Reset the LDoms Guest Configuration” on page 166 Options for Recovering From a Failed CMP/Memory Module If your system experiences a complete CMP/memory module failure, do one of the following: 1. Replace the failed CMP/memory module. 2.
■ “Servicing FB-DIMMs” on page 108 ■ “I/O Connections to CMP/Memory Modules” on page 160 ■ “Reconfiguring I/O Device Nodes” on page 162 ■ “System Bus Topology” on page 167 ■ “I/O Fabric in 2P Configuration” on page 168 ■ “I/O Fabric in 4P Configuration” on page 169 Reconfiguring I/O Device Nodes This topic includes the following: ■ “Options for Reconfiguring I/O Device Nodes” on page 162 ■ “Reconfigure the I/O and PCIe Fabric” on page 163 ■ “Temporarily Disable All Memory Modules” on page 1
▼ Reconfigure the I/O and PCIe Fabric The reconfig.pl script reconfigures the PCIe fabric to reconnect the PCIe slots and onboard devices to the CMP nodes as efficiently as possible. The reconfig.pl script also reconfigures the Solaris device names to match the new connections between the CMP modules and the PCIe devices and slots. Use the reconfig.pl script to reattach each PCIe slot and onboard device to its nearest available CMP module. To use the reconfig.
9. Do one of the following: ■ If your Jumpstart server is exporting Solaris 10 8/07 or Solaris 10 5/08, type: # cp /reconfig.pl . ■ If your Jumpstart server is exporting Solaris 10 10/08, type: # cp /cdrom/Solaris_10/Tools/Boot/reconfig.pl . 10. Run the reconfig.pl script. Type: # /mnt/reconfig.pl 11. Halt the system. Type: # halt 12. Power off the system.
To remotely disable all FB-DIMMs in the system, do the following: 1. Halt the Solaris OS. 2. Power off the system. 3. Disable each FB-DIMM. sc> disablecomponent /SYS/MB/MEMx/CMPx/BR0/CH0/D1 sc> disablecomponent /SYS/MB/MEMx/CMPx/BR0/CH0/D2 ... sc> disablecomponent /SYS/MB/MEMx/CMPx/BR1/CH1/D3 where x is the memory module to be disabled. The following example shows how to disable all the FB-DIMMs on MEM1.
where x is the CMP/memory module to be enabled. The following example shows how to enable all the FB-DIMMs on MEM1.
System Bus Topology Related Information ■ “I/O Fabric in 2P Configuration” on page 168 ■ “I/O Fabric in 4P Configuration” on page 169 Performing Node Reconfiguration 167
I/O Fabric in 2P Configuration CMP Number Devices Controlled CMP0 Onboard disk drives Onboard USB ports Onboard DVD drive PCIe0 PCIe1 PCIe2 PCIe3 CMP1 Onboard Gbit or 10-Gbit network PCIe4 PCIe5 PCIe6 PCIe7 Related Information 168 ■ “System Bus Topology” on page 167 ■ “I/O Fabric in 4P Configuration” on page 169 SPARC Enterprise T5440 Server Service Manual • June 2011
I/O Fabric in 4P Configuration CMP Number Devices Controlled CMP0 Onboard disk drives Onboard USB ports Onboard DVD drive PCIe0 PCIe1 CMP1 Onboard Gbit or 10-Gbit network PCIe4 PCIe5 CMP2 PCIe2 PCIe3 CMP3 PCIe6 PCIe7 Related Information ■ “System Bus Topology” on page 167 ■ “I/O Fabric in 2P Configuration” on page 168 Performing Node Reconfiguration 169
170 SPARC Enterprise T5440 Server Service Manual • June 2011
Identifying Connector Pinouts This section provides reference information about the system back panel ports and pin assignments.
Serial Management Port Connector Pinouts The serial management connector (labeled SERIAL MGT) is an RJ-45 connector located on the back panel. This port is the default connection to the system console.
Network Management Port Connector Pinouts The network management connector (labeled NET MGT) is an RJ-45 connector located on the motherboard and can be accessed from the back panel. This port needs to be configured prior to use.
Serial Port Connector Pinouts The serial port connector (TTYA) is a DB-9 connector that can be accessed from the back panel.
USB Connector Pinouts Two Universal Serial Bus (USB) ports are located on the motherboard in a double-stacked layout and can be accessed from the back panel. Two additional USB ports are located on the front panel.
Gigabit Ethernet Connector Pinouts Four RJ-45 Gigabit Ethernet connectors (NET0, NET1, NET2, NET3) are located on the system motherboard and can be accessed from the back panel. The Ethernet interfaces operate at 10 Mbit/sec, 100 Mbit/sec, and 1000 Mbit/sec.
Server Components These topics provide illustrations depicting components of SPARC Enterprise T5440 server. Description Links A diagram and list of customer replaceable units (CRUs) “Customer-Replaceable Units” on page 178 A diagram and list of components that only field service personnel can replace.
Customer-Replaceable Units FIGURE: Customer-Replaceable Units (CRUs) Figure Legend 1 CMP modules 5 Front bezel 2 Memory modules 6 Hard drives 3 Fan trays 7 Power supplies 4 Removable media drive 8 178 SPARC Enterprise T5440 Server Service Manual • June 2011
Related Information ■ “Hot-Pluggable and Hot-Swappable Devices” on page 75 ■ “Servicing Hard Drives” on page 76 ■ “Servicing Fan Trays” on page 84 ■ “Servicing Power Supplies” on page 89 ■ “Servicing CMP/Memory Modules” on page 102 ■ “Servicing FB-DIMMs” on page 108 ■ “Servicing the Front Bezel” on page 119 ■ “Servicing the DVD-ROM Drive” on page 122 Server Components 179
Field-Replaceable Units FIGURE: Field-Replaceable Units (FRUs) Figure Legend 180 1 CMP/memory module bracket 4 Power supply backplane 2 Fan cage 5 Flex cable assembly 3 Hard drive backplane 6 Auxiliary power cable SPARC Enterprise T5440 Server Service Manual • June 2011
FIGURE: Field Replaceable Units (FRUs) (Motherboard and Auxiliary Boards) Figure Legend 1 IDPROM 4 Motherboard 2 Front Control Panel 5 Battery 3 Front I/O Board 6 Service Processor Related Information ■ “Servicing the Service Processor” on page 124 ■ “Servicing the IDPROM” on page 127 ■ “Servicing the Battery” on page 129 ■ “Servicing the Power Distribution Board” on page 130 Server Components 181
182 ■ “Servicing the Fan Tray Carriage” on page 133 ■ “Servicing the Hard Drive Backplane” on page 135 ■ “Servicing the Motherboard” on page 139 ■ “Servicing the Flex Cable Assembly” on page 144 ■ “Servicing the Front Control Panel” on page 148 ■ “Servicing the Front I/O Board” on page 150 SPARC Enterprise T5440 Server Service Manual • June 2011
Index Numerics 3.
consolehistory command, 59 D DC OK (power supply LED), 95 device identifiers CMP/memory modules, 107 fan tray, 88 FB-DIMMs, 116 hard drive, 83 PCIe card, 99 power supply, 94 diag_level parameter, 30, 62 diag_mode parameter, 30, 62 diag_trigger parameter, 30, 62 diag_verbosity parameter, 30, 62 diagnostics about, 12 flowchart, 14 low level, 22 running remotely, 18 using Oracle VTS, 21 disablecomponent command, 57 displaying FRU status, 28 dmesg command, 39 DVD-ROM drive installing, 123 removing, 122 E ejec
example POST error output, 46 fault handling, 24 installing, 109 managing faults in, 110 re-enabling to run system in degraded state, 165 removing, 108 troubleshooting, 24 verifying successful replacement, 110 flex cable assembly installing, 146 removing, 145 fmadm command, 54, 112 fmdump command, 49 front bezel installing, 121 removing, 120 front control panel installing, 149 removing, 148 front I/O board installing, 151 removing, 150 front panel diagram, 3 FRU event ID, 49 FRU ID PROMs, 18 FRU information
hard drive, 79, 82 hard drive backplane, 137 IDPROM, 128 motherboard, 142 PCIe card, 97 power distribution board, 132 power supply, 91, 93 service processor, 126 top cover, 154 Integrated Lights Out Manager and fault detection in External I/O Expansion Unit, 18 iostat -E command, 81 L latch power supply, 91, 93 slide rail, 70 LED AC Present (power supply LED), 15, 95 DC OK (power supply LED), 95 Fan Fault (system LED), 35 Fault (fan tray LED), 35 Fault (hard drive LED), 35 Fault (power supply LED), 35, 90,
installing, 97 removing, 96 PCIe fabric reconfiguration, 163 pinouts Gigabit Ethernet ports, 176 network management port, 173 serial management port, 172 serial port (DB-9), 174 USB ports, 175 power cords plugging into server, 157 unplugging before servicing the system, 65 power distribution board about, 2 installing, 132 removing, 130 power off, 68 Power OK (system LED), 15 power supply about, 90 AC Present LED, 15, 95 DC OK LED, 95 device identifiers, 94 Fault LED, 35, 90, 95 hot-swapping, 91, 93 installi
service processor, 124 reset command, 60 reset, system using ILOM, 32 using POST commands, 32 resetsc command, 60 S safety information, 63 safety symbols, 64 sanity check for hardware components, 22 SCC module and host ID, 2 and MAC addresses, 2 serial management port pinouts, 172 serial number, chassis, 66 serial port (DB-9) pinouts, 174 service processor installing, 126 removing, 124 Service Required (system LED), 35 about, 5 cleared by enablecomponent command, 53 interpreting to diagnose faults, 35 trig
Power OK LED state, 15 using LEDs, 34 using Oracle VTS, 15 using POST, 15, 17 using the show faulty command, 15 U UltraSPARC T2+ multicore processor, 21 Universal Unique Identifier (UUID), 21, 49 USB ports front, 3 pinouts, 175 V virtual keyswitch, 31, 111 X XAUI card about, 1 configuration guidelines, See PCIe configuration guidelines installing, See PCIe card, installing Index 189
190 SPARC Enterprise T5440 Server Service Manual • June 2011