VAX 4000 Model 200 Troubleshooting and Diagnostics Order Number EK–437AB–TS–002 Digital Equipment Corporation Maynard, Massachusetts
First Printing, December 1990 Revised, June 1991 The information in this document is subject to change without notice and should not be construed as a commitment by Digital Equipment Corporation. Digital Equipment Corporation assumes no responsibility for any errors that may appear in this document. The software, if any, described in this document is furnished under a license and may be used or copied only in accordance with the terms of such license.
Contents vii Preface Chapter 1 Troubleshooting During Power-Up 1.1 Autobooting the System Software . . . . . . . . . . . . . . . . . . . . 1.2 Troubleshooting Power-Up Problems . . . . . . . . . . . . . . . . . . 1.2.1 Problems During Self-Tests . . . . . . . . . . . . . . . . . . . . . . . 1.2.2 General Problems During Boot Sequence . . . . . . . . . . . . . 1.2.3 Problems Booting from an RF-Series Integrated Storage Element (ISE) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2.
3.3 Starting MDM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.3.1 Tape Drive Instructions . . . . . . . . . . . . . . . . . . . . . . . . . 3.3.1.1 Booting MDM Manually . . . . . . . . . . . . . . . . . . . . . . . 3.3.1.2 Booting MDM Automatically . . . . . . . . . . . . . . . . . . . 3.3.2 RRD-Series Disc Instructions . . . . . . . . . . . . . . . . . . . . . 3.4 MDM Introductory Display . . . . . . . . . . . . . . . . . . . . . . . . 3.5 Main Menu Options . . . . . . . . . . . .
A–6 A–7 TK50 and TK70 Tape Drives . . . . . . . . . . . . . . . . . . . . . . . . . A–13 TF85 and TLZ04 Tape Drives . . . . . . . . . . . . . . . . . . . . . . . .
Preface Troubleshooting is the process of isolating and diagnosing problems. When your system does not operate as described in your Operation manual, use the information in this manual to isolate and diagnose the problem. This manual contains three chapters and one appendix: • Chapter 1 lists problems you may experience at power-up and provides corrective actions. • Chapter 2 lists problems you may experience during normal operation and provides corrective actions.
Chapter 1 Troubleshooting During Power-Up After you turn on your system, the processor performs a series of self-tests and startup routines. After successful completion of the self-tests, if the Break Enable/Disable switch is set to disable (down), your system autoboots system software. 1.1 Autobooting the System Software Your system boots automatically (autoboots) after you enter the SET BOOT device-name command for a device or string of devices (see your Operation manual).
Problem DC OK indicator not lit. System has power; LED on CPU cover panel counting down; nothing displays on console terminal. Possible Cause Corrective Action Power switch tripped to 0. (BA215: Circuit breaker tripped.) Wait 1 minute, then set Power switch to 1. (BA215: Set Power switch to 0, push Reset button, and set Power switch to 1.) If problem persists, call your Digital service representative. Power cable incorrectly installed. Set Power switch to 0. Check that cable is seated in socket.
Problem Possible Cause Corrective Action System has power; LED counting down; DC OK indicator lit; nothing displays on console terminal. Problem with CPU. Call your Digital service representative. System has power; DC OK indicator lit; nothing displays on console terminal; LED on CPU cover panel displays E or F. Problem with CPU. Call your Digital service representative. Self-tests halt; error message or error summary displays on console terminal. System detected error while running self-tests.
Problem Possible Cause Corrective Action Countdown does not continue from 2 through 0; Break Enable/Disable switch set to disable (down). System cannot load software from disk drive, tape drive, or Ethernet. See actions listed in subsequent sections for boot device. 1.2.3 Problems Booting from an RF-Series Integrated Storage Element (ISE) Problem Possible Cause Countdown continues System from 2 through 0; con- write-protected.
Problem Possible Cause Corrective Action System will boot from fixed disk if it is on line. Place fixed disk off line. Tape not bootable. Use tape containing bootstrap program to start system software. Tape worn or damaged. Try another tape cartridge. Problem with adapter module or tape drive controller. Call your Digital service representative.
Chapter 2 Troubleshooting During Normal Operation Problems that occur during normal operation of your system may result from a problem in the system, faulty settings, or incorrect procedures. The following sections list problems, possible causes, and corrective actions. If the action(s) listed do not solve the problem, call your Digital service representative.
Problem System loses ac power; DC OK indicator not lit. Possible Cause Corrective Action Power cable incorrectly installed. Set Power switch to 0. Check that cable is seated in socket. Set Power switch to 1. Power supply failed. Turn on system again. If unsuccessful, turn off system and call your Digital service representative. System loses ac power; Current surge. Power switch at 0 (BA215: Circuit breaker tripped).
2.2 RF-Series Integrated Storage Element (ISE) Problems Problem Possible Cause Corrective Action Write error message displays; Write-Protect button glows orange. ISE write-protected. Press and release Write-Protect button (not lit). Fault indicator lit or blinking. Bus node ID plug not installed. Install that plug. Two or more devices have same node ID on same bus. Make sure all devices and controllers /adapters on same bus have unique IDs. Problem with adapter module or ISE controller.
Problem Possible Cause Corrective Action Four lights blinking. Failed self-test detected hard error. Green light on; yellow light not lit. Cartridge load error. Try another cartridge. If problem persists, call your Digital service representative. Cartridge stuck in drive; tape on takeup reel. Hard drive error. Press Unload button. If problem persists, call your Digital service representative. Cartridge does not load. Mispositioned leader. Try another cartridge. or Push Unload button.
Problem Passes self-test; not work. does Possible Cause Corrective Action Problem in controller or connection between drive and controller. Call your Digital service representative. 2.5 TK70 Tape Drive Problems Problem Possible Cause Corrective Action Green light blinking after tape insertion. Tape leader defective. Pull handle open and remove cartridge. Use another cartridge. Orange, yellow, and green lights blinking. Problem with drive. Press Unload button once.
2.6 TLZ04 Tape Drive Problems Problem Possible Cause Corrective Action Unable to copy to tape; tape indicator yellow. Cassette write-protected. Set cassette Write-Protect tab to writeenable. Unable to copy to tape; drive indicator green. No cassette in drive. Insert cassette into drive. Unable to copy to or from tape; tape and drive indicators not lit. Problem with adapter module or drive controller. Run MDM as described in Chapter 3 and call your Digital service representative.
Chapter 3 Running the MicroVAX Diagnostic Monitor (MDM) The MicroVAX Diagnostic Monitor (MDM) is an optional software package containing diagnostic tests that isolate and identify faults in your system. MDM also lets you display your system configuration and test how devices work together. MDM is on tape, in a cartridge labeled MV DIAG CUST TK70. MDM operating instructions begin in Section 3.2.
• MDM checks terminal interfaces (but not terminals) by means of feedback connectors. If your devices pass the tests but you still experience problems, call your Digital service representative for further testing. If you require more complete diagnostic testing, purchase the MicroVAX Maintenance Kit. That kit includes a system maintenance guide and the service diagnostic tests. The MDM version you receive with your system is a subset of the service version.
3.2.2 Preparing to Run MDM on an RF-Series Integrated Storage Element (ISE) or Hard Disk You can install MDM on an RF-series ISE or hard-disk drive by using the MDM Hard Disk Kit. Installation of that kit requires your completion of the Diagnostic Software Installation Acknowledgment. See the MicroVAX Diagnostic Monitor Hard Disk User’s Guide for licensing requirements and installation instructions. 3.2.
2. Set the Break Enable/Disable switch on the host to be tested to enable (up). 3. Insert a blank tape cartridge into the tape drive and lock it in place. 4. Restart the host to be tested. 5. Enter one of these commands, where n is the unit number of the disk containing the Hard Disk Kit and x is the controller port letter. BOOT/100 DIAn (CPU-based DSSI) BOOT/100 DUxn (KFQSA adapter or KDA50 controller) The system prompts for the boot file name by displaying Bootfile: 6. Enter one of these file names.
3. Insert the tape cartridge containing MDM into the tape drive and lock it in place. 4. Restart the host to be tested. 5. After the self-test countdown completes and the >>> prompt displays, enter the command BOOT MUx0, where x is the controller port letter, to load MDM from the tape. 6. Run MDM as described in Section 3.3. 7. After you complete the tests on the first host: a.
• You can boot MDM manually from a TF85, TK50 or TK70 tape drive (Section 3.3.1.1). • You can use the autoboot feature to boot MDM only from a TK50 or TK70 tape drive (Section 3.3.1.2). NOTE: If your system is part of a local area network (LAN), you can reduce the time required to load MDM by obtaining the MicroVAX Ethernet Server Customer Diagnostics Kit.
3.3.1.2 Booting MDM Automatically Use this procedure to boot MDM from a TK50 or TK70 tape drive. Do not use this procedure for a TF85 tape drive. 1. If software is installed on your system: a. Warn all users to log off. b. Perform system shutdown as described in your software manuals. 2. Enter the command SET BOOT MUx0, where x is the controller port letter. 3. Make sure the Write-Protect switch on the tape cartridge is in the writeprotect position. 4.
5. Turn on your system (and the disc drive). 6. Insert the disc caddy into the disc drive: a. Remove the disc from its package. b. Insert the disc into the caddy. c. Insert the caddy into the drive slot: For an RRD40 drive, push the caddy in until it stops. Then withdraw the caddy. The disc remains in the drive. For another RRD-series drive, the caddy stays in the drive. Do not remove it. 7.
3.5 Main Menu Options The Main Menu provides six options as shown in Figure 3–1. Figure 3–1: Main Menu MAIN MENU Release nnn Version xx.xx 1 - Test the System 2 - Display System Configuration and Devices 3 - Display the System Utilities Menu 4 - Display the Service Menu 5 - Display the Connect/Ignore Menu 6 - Select Single Device Tests Type the number; then press the RETURN key. NOTE: The MDM release and version numbers are represented by nnn and xx.xx in the displays provided throughout this chapter.
in each name shows the difference. For example, DH–CX0A indicates one option, DH–CX0B a second, and so forth. If a device fails the test, you receive a failure message which identifies: The device tested When the failure occurred The field-replaceable unit (FRU) Copy the failure message and report it to your Digital service representative. Figure 3–2 shows an example of an unsuccessful test. Figure 3–2: Example of an Unsuccessful Test BEGIN FUNCTIONAL TEST Device Result SGCA FAILURE DETECTED ...........
3.5.2 Display System Configuration and Devices Option The Display System Configuration and Devices option identifies devices recognized by MDM. After you select this option, the diagnostics are prepared for testing. If this is the first option you selected, the diagnostics are loaded. Loading takes several minutes. After loading completes, you are prompted to press Return . After you press Return , the configuration is displayed. Figure 3–3 shows a system configuration and devices display.
• KDA50 — Unit number, drive type, controller revision number • PDIA — DSSI adapter, embedded in KA660 CPU • DESQA — Ethernet station address • SGCA — On-board Ethernet controller, embedded in KA660 CPU • KZQSA — Adapter for TLZ04 tape drive or any other supported Q-bus to SCSI device • Communication devices — Type of device and whether it has modem control In addition to showing information about testable device options, MDM displays messages indicating the presence of nontestable system device
3.5.3 Display the System Utilities Menu Option The Display the System Utilities Menu option displays the System Utilities Menu. If system utilities in addition to the IOADDRES function are available on your system, they are listed on that menu. After you select this option, the diagnostics are prepared for testing. If this is the first option you selected, the diagnostics are loaded. Loading takes several minutes. After loading completes, you are prompted to press Return .
3.5.3.2 Update Drive Unit Number for RRD-Series Function The Update Drive Unit Number function lets you update the unit number for an RRD-series disc drive connected to a KRQ50 controller. Refer to the appropriate RRD-series disc drive owner’s manual for instructions. 3.5.4 Display the Service Menu Option The Display the Service Menu option is available only if you purchased the MicroVAX Maintenance Kit. That kit contains service diagnostics and the system maintenance guide.
Figure 3–5: Select Single Device Test Menu MAIN MENU SELECT SINGLE DEVICE TEST Release nnn Version xx.xx Select the device number to be tested. The functional tests will be run followed by the exercisers for 4 minutes. 1 2 3 4 5 - CPUA - MicroVAX CPU MEMA - MicroVAX memory system PDIA - SHAC DSSI Subsystem SGCA - Second Generation Ethernet Controller No Dg TKA - Diagnostic not loaded. Type the number; then press the RETURN key, or type 0 and press the RETURN key to return to the Main Menu.
Figure 3–6: Example of a Successful Test BEGIN FUNCTIONAL TEST Device Result CPUA ................... PASSED BEGIN EXERCISER TEST Results are reported at the end of the testing. SINGLE DEVICE TEST PASSED If a device fails the test, you receive a failure message which identifies: The device tested When the failure occurred The field-replaceable unit (FRU) Copy the failure message and report it to your Digital service representative. Figure 3–7 shows an example of an unsuccessful test.
3.6 Exiting MDM Exit MDM in one of the following ways. • Press • Press and then release the Halt button. • Press the Restart (BA215: Restart/Run) button. Break . Remove the tape cartridge as described in your Operation manual. If you ran MDM on a new system, you are ready to install your software. Set the Write-Protect button to write-enable and then follow the instructions in your software manuals. If you ran MDM on a system containing software, you must reboot that software.
Appendix A System Controls and Indicators The illustrations in this appendix show the controls and indicators on your system: Figure A–1: BA430 System Control Panel and Power Supply Figure A–2: BA430 CPU Cover Panel Figure A–3: BA215 CPU Cover Panel Figure A–4: BA215 Power Supply Figure A–5: BA215 Operator Control Panel Figure A–6: TK50 and TK70 Tape Drives Figure A–7: TF85 and TLZ04 Tape Drives System Controls and Indicators A–1
Figure A–1: BA430 System Control Panel and Power Supply System Control Panel Over Temperature Warning Indicator DC OK Indicator Halt Button Restart Button Power Supply Power Switch AC Present Indicator DC OK Indicator Fan Failure Indicator Over Temperature Condition Indicator Power Bus Connectors Power Cable Connector MLO-005564 System Controls and Indicators A–3
Figure A–2: BA430 CPU Cover Panel CPU Cover Panel Break Enable/ Disable Switch Standard Ethernet Connector LED Display Power-Up Mode Switch Modified Modular Jack Ethernet Connector Switch ThinWire Ethernet Connector MLO-005504 System Controls and Indicators A–5
Figure A–3: BA215 CPU Cover Panel LED Display Break Enable/ Disable Switch Power-Up Mode Switch Modified Modular Jack A1 A1 Standard Ethernet Connector ThinWire Ethernet Connector Ethernet Connector Switch MLO-000679 System Controls and Indicators A–7
Figure A–4: BA215 Power Supply DC OK Light Reset Button A1 Power Supply Circuit Breaker MLO-000680 System Controls and Indicators A–9
Figure A–5: BA215 Operator Control Panel ISE Controls and Indicators DRIVE Unit Number 2 1 0 Fault WriteProtect Ready SYSTEM Restart/ Run Halt MLO-005512 System Controls and Indicators A–11
Figure A–6: TK50 and TK70 Tape Drives TK50 TAPE DRIVE Load/Unload Button, Red Light Handle Closed Green Indicator Light TK70 TAPE DRIVE Orange Light Yellow Light Green Light Unload Button Handle MLO-005341 System Controls and Indicators A–13
Figure A–7: TF85 and TLZ04 Tape Drives Yellow Light Orange Light Green Light Bus Node ID Plug To Re Han Op P W Un ait res d e m lo s ov le n t ad his e Lig But Ta to ht pe n Orange Light 85 ad lo Un To H Ha C O Lo nd los Inse and pe Wa it ad le n le e r th th t Ta Lig is is pe ht Ta TF W rit Pr e ot ec te d pe in Us Us e e Cle a Ta nin pe g O pe Ha rat nd e le TF85 TAPE DRIVE Cartridge Insert/ Release Handle TLZ04 TAPE DRIVE Unload Button Tape Indicator Drive Indicator MLO-006361 System C
Index A Autobooting system, 1–1 B Booting power-up problems, 1–3 RF-series integrated storage element problems, 1–4 system, 1–1 tape problems, 1–4 Booting MicroVAX Diagnostic Monitor See MicroVAX Diagnostic Monitor, starting Boot sequence power-up problems, 1–3 C Caddy, disc See RRD-series disc CDROM See RRD-series disc CDROM Kit, MicroVAX Diagnostic Monitor, 3–2 Compact disc See RRD-series disc Connect/Ignore option, MicroVAX Diagnostic Monitor Main Menu, 3–14 Controls and indicators, system, A–1 Disc,
Hard Disk Kit, MicroVAX Diagnostic Monitor, 3–3 I Indicators, system, A–1 Integrated storage element See RF-series integrated storage element Introductory display, MicroVAX Diagnostic Monitor, 3–8 IOADDRES function, MicroVAX Diagnostic Monitor System Utilities Menu, 3–13 ISE See RF-series integrated storage element K KZQSA adapter, booting MicroVAX Diagnostic Monitor, 3–7 L Leaving MicroVAX Diagnostic Monitor, 3–17 Limitations, MicroVAX Diagnostic Monitor, 3–1 M Main Menu options, MicroVAX Diagnostic Mo
MicroVAX Diagnostic Monitor (Cont.
System (Cont.