7026 Enterprise Server Model M80 Model 6M1 Service Guide
7026 Enterprise Server Model M80 Model 6M1 Service Guide
7026 Enterprise Server Model M80 Model 6M1 Service Guide
SA38-0571-01
Second Edition (September 2001)
Before using this information and the product it supports, read the information in “Safety Notices” on page xi,
Appendix A, “Environmental Notices” on page 547, and Appendix B, “Notices” on page 549.
A reader’s comment form is provided at the back of this publication. If the form has been removed, address comments
to Information Development, Department H6DS-905-6C006, 11400 Burnet Road, Austin, Texas 78758-3493. To send
comments electronically, use this commercial internet address: aix6kpub@austin.ibm.com. Any information that you
supply may be used without incurring any obligation to you.
© International Business Machines Corporation, 2000, 2001. All rights reserved.
Note to U.S. Government Users -- Documentation related to restricted rights -- Use, duplication or disclosure is subject
to restrictions set forth is GSA ADP Schedule Contract with IBM Corp.
Contents
Safety Notices . . . . . . . . . . . . . . . . . . . . . . . . xi
Rack Safety Instructions . . . . . . . . . . . . . . . . . . . . . xi
Electrical Safety . . . . . . . . . . . . . . . . . . . . . . . xii
Laser Safety Information . . . . . . . . . . . . . . . . . . . . xiii
Laser Compliance . . . . . . . . . . . . . . . . . . . . . . xiii
iii
I/O Drawer Features . . . . . . . . . . . . . . . . . . . . . . 25
I/O Drawer Addressing . . . . . . . . . . . . . . . . . . . . 25
I/O Drawer and Power Supply LED Status. . . . . . . . . . . . . . 26
Secondary I/O Drawer Operator Panel Message Hierarchy . . . . . . . . 27
Primary I/O Drawer Operator Panel Behavior During Power-On. . . . . . . 28
Secondary I/O Drawer Operator Panel Behavior During Power-On. . . . . . 29
Logical and Physical Locations . . . . . . . . . . . . . . . . . . 29
Physical Location Codes . . . . . . . . . . . . . . . . . . . . 29
Location Code Format . . . . . . . . . . . . . . . . . . . . 29
Multiple FRU Callout Instructions . . . . . . . . . . . . . . . . . 30
AIX Location Codes . . . . . . . . . . . . . . . . . . . . . . 32
AIX and Physical Location Code Reference Tables . . . . . . . . . . . 33
Cabling the CEC Drawer, Primary I/O Drawer, and Secondary I/O Drawer(s) . . . 53
Connecting JTAG and V/S COMM Cables . . . . . . . . . . . . . . 53
Connecting RIO and SPCN Cables . . . . . . . . . . . . . . . . 53
System Cable Diagrams. . . . . . . . . . . . . . . . . . . . . 63
CEC Drawer Cable . . . . . . . . . . . . . . . . . . . . . 63
Primary I/O Drawer Cable . . . . . . . . . . . . . . . . . . . 64
Secondary I/O Drawer Cable . . . . . . . . . . . . . . . . . . 65
DC Power Cable Routings . . . . . . . . . . . . . . . . . . . 66
Specifications . . . . . . . . . . . . . . . . . . . . . . . . 67
CEC Drawer . . . . . . . . . . . . . . . . . . . . . . . 68
Noise Emission Notes . . . . . . . . . . . . . . . . . . . . 71
Power Cables . . . . . . . . . . . . . . . . . . . . . . . . 72
Service Inspection Guide . . . . . . . . . . . . . . . . . . . . 73
iv Service Guide
Chapter 4. Checkpoints . . . . . . . . . . . . . . . . . . . . 147
IPL Flow . . . . . . . . . . . . . . . . . . . . . . . . . 149
Service Processor Checkpoints . . . . . . . . . . . . . . . . . . 150
Firmware Checkpoints . . . . . . . . . . . . . . . . . . . . . 156
Boot Problems and Concerns . . . . . . . . . . . . . . . . . . 167
Step 1 . . . . . . . . . . . . . . . . . . . . . . . . . 167
Step 2 . . . . . . . . . . . . . . . . . . . . . . . . . 167
Step 3 . . . . . . . . . . . . . . . . . . . . . . . . . 168
Step 4 . . . . . . . . . . . . . . . . . . . . . . . . . 168
Step 5 . . . . . . . . . . . . . . . . . . . . . . . . . 169
Contents v
Menu Inactivity . . . . . . . . . . . . . . . . . . . . . . 413
General User Menu . . . . . . . . . . . . . . . . . . . . . . 413
Privileged User Menus . . . . . . . . . . . . . . . . . . . . . 414
Main Menu. . . . . . . . . . . . . . . . . . . . . . . . 414
Service Processor Setup Menu . . . . . . . . . . . . . . . . . 416
Passwords . . . . . . . . . . . . . . . . . . . . . . . . 417
System Power Control Menu . . . . . . . . . . . . . . . . . . 420
System Information Menu . . . . . . . . . . . . . . . . . . . 424
Memory Riser Card 1 Memory DIMM Locations for Service Processor Menus 428
Memory Riser Card 2 Memory DIMM Locations for Service Processor Menus 430
Language Selection Menu . . . . . . . . . . . . . . . . . . 431
Call-In/Call-Out Setup Menu . . . . . . . . . . . . . . . . . . 432
Modem Configuration Menu . . . . . . . . . . . . . . . . . . 433
Serial Port Selection Menu . . . . . . . . . . . . . . . . . . 434
Serial Port Speed Setup Menu . . . . . . . . . . . . . . . . . 434
Telephone Number Setup Menu . . . . . . . . . . . . . . . . . 435
Call-Out Policy Setup Menu . . . . . . . . . . . . . . . . . . 436
Customer Account Setup Menu . . . . . . . . . . . . . . . . . 437
Service Processor Procedures in Service Mode . . . . . . . . . . . . 437
Service Processor Functions . . . . . . . . . . . . . . . . . . . 438
System Power-On Methods . . . . . . . . . . . . . . . . . . . 439
Service Processor Reboot/Restart Recovery . . . . . . . . . . . . . 440
Boot (IPL) Speed . . . . . . . . . . . . . . . . . . . . . 440
Failure During Boot Process . . . . . . . . . . . . . . . . . . 440
Failure During Normal System Operation . . . . . . . . . . . . . . 440
Service Processor Reboot/Restart Policy Controls . . . . . . . . . . . 441
System Firmware Updates . . . . . . . . . . . . . . . . . . . 442
General Information on System Firmware Updates . . . . . . . . . . 442
Determining the Level of Firmware on the System. . . . . . . . . . . 443
System Firmware Update Using a Locally Available Image . . . . . . . . 443
Updating System Firmware From the Service Processor Menus . . . . . . 444
Updating System Firmware from the AIX Service Aids . . . . . . . . . 444
Updating System Firmware from the AIX Command Line . . . . . . . . 444
Recovery Mode . . . . . . . . . . . . . . . . . . . . . . 445
Configuring and Deconfiguring Processors or Memory . . . . . . . . . . 445
Run-Time CPU Deconfiguration (CPU Gard) . . . . . . . . . . . . 446
Service Processor System Monitoring - Surveillance . . . . . . . . . . . 446
System Firmware Surveillance . . . . . . . . . . . . . . . . . 446
Operating System Surveillance . . . . . . . . . . . . . . . . . 446
Call-Out (Call-Home) . . . . . . . . . . . . . . . . . . . . . 447
Console Mirroring . . . . . . . . . . . . . . . . . . . . . . 449
System Configuration . . . . . . . . . . . . . . . . . . . . 449
Service Processor Error Log . . . . . . . . . . . . . . . . . . . 450
LCD Progress Indicator Log . . . . . . . . . . . . . . . . . . . 451
Service Processor Operational Phases . . . . . . . . . . . . . . . 452
Pre-Standby Phase . . . . . . . . . . . . . . . . . . . . . 452
Standby Phase . . . . . . . . . . . . . . . . . . . . . . 452
Bring-Up Phase . . . . . . . . . . . . . . . . . . . . . . 453
Run-Time Phase . . . . . . . . . . . . . . . . . . . . . . 454
vi Service Guide
Chapter 9. Using System Management Services . . . . . . . . . . . 456
455
Password Utilities . . . . . . . . . . . . . . . . . . . . . 456
Display Error Log . . . . . . . . . . . . . . . . . . . . . 457
Remote Initial Program Load Setup . . . . . . . . . . . . . . . 457
SCSI Utilities . . . . . . . . . . . . . . . . . . . . . . . 461
Select Console . . . . . . . . . . . . . . . . . . . . . . 461
MultiBoot . . . . . . . . . . . . . . . . . . . . . . . . 462
Select Language . . . . . . . . . . . . . . . . . . . . . . 465
OK Prompt. . . . . . . . . . . . . . . . . . . . . . . . 465
Exiting System Management Services. . . . . . . . . . . . . . . 465
Contents vii
Replacement . . . . . . . . . . . . . . . . . . . . . . . 498
I/O Drawer Backplane Assembly . . . . . . . . . . . . . . . . . 498
Removal . . . . . . . . . . . . . . . . . . . . . . . . 498
Replacement . . . . . . . . . . . . . . . . . . . . . . . 500
I/O Drawer RIO Bus Adapter . . . . . . . . . . . . . . . . . . . 501
Removal . . . . . . . . . . . . . . . . . . . . . . . . 501
Replacement . . . . . . . . . . . . . . . . . . . . . . . 502
I/O Drawer Power Distribution Board . . . . . . . . . . . . . . . . 503
Removal . . . . . . . . . . . . . . . . . . . . . . . . 503
Replacement . . . . . . . . . . . . . . . . . . . . . . . 504
I/O Drawer Power Supplies and Power Supply Fan Assemblies . . . . . . . 505
Removal . . . . . . . . . . . . . . . . . . . . . . . . 506
Replacement . . . . . . . . . . . . . . . . . . . . . . . 507
Power Supply Test Switch Procedure . . . . . . . . . . . . . . . . 508
Internal Disk Drives . . . . . . . . . . . . . . . . . . . . . . 509
Removal . . . . . . . . . . . . . . . . . . . . . . . . 509
Replacement . . . . . . . . . . . . . . . . . . . . . . . 511
Fan Controller Card . . . . . . . . . . . . . . . . . . . . . . 512
Removal . . . . . . . . . . . . . . . . . . . . . . . . 512
Replacement . . . . . . . . . . . . . . . . . . . . . . . 512
CD-ROM Drive . . . . . . . . . . . . . . . . . . . . . . . 513
Removal . . . . . . . . . . . . . . . . . . . . . . . . 513
Replacement . . . . . . . . . . . . . . . . . . . . . . . 514
Tape Drive . . . . . . . . . . . . . . . . . . . . . . . . . 515
Removal . . . . . . . . . . . . . . . . . . . . . . . . 515
Replacement . . . . . . . . . . . . . . . . . . . . . . . 515
Diskette Drive . . . . . . . . . . . . . . . . . . . . . . . . 516
Removal . . . . . . . . . . . . . . . . . . . . . . . . 516
Replacement . . . . . . . . . . . . . . . . . . . . . . . 516
Operator Panel . . . . . . . . . . . . . . . . . . . . . . . 517
Removal . . . . . . . . . . . . . . . . . . . . . . . . 517
Replacement . . . . . . . . . . . . . . . . . . . . . . . 518
Rear Service Position . . . . . . . . . . . . . . . . . . . . . 519
Front Service Position . . . . . . . . . . . . . . . . . . . . . 520
Battery . . . . . . . . . . . . . . . . . . . . . . . . . . 521
Removal . . . . . . . . . . . . . . . . . . . . . . . . 521
Replacement . . . . . . . . . . . . . . . . . . . . . . . 522
Contents ix
Appendix G. Rack Drawer Rail Instructions . . . . . . . . . . . . . 587
Front Drawer-Stop Installation . . . . . . . . . . . . . . . . . . 587
Drawer Rail Assembly and Installation. . . . . . . . . . . . . . . . 588
Rails for an S00 Type Rack . . . . . . . . . . . . . . . . . . . 588
Rails for a T00 Type Rack . . . . . . . . . . . . . . . . . . . 588
Rail Installation . . . . . . . . . . . . . . . . . . . . . . . 588
Cable Routing . . . . . . . . . . . . . . . . . . . . . . . 589
Index . . . . . . . . . . . . . . . . . . . . . . . . . . 593
x Service Guide
Safety Notices
A danger notice indicates the presence of a hazard that has the potential of causing
death or serious personal injury. Danger notices appear on the following pages:
v xii
v 94
v 467
v 506
A caution notice indicates the presence of a hazard that has the potential of causing
moderate or minor personal injury. Caution notices appear on the following pages:
v xii
v xiii
v 94
v 467
v 513
v 521
Note: For a translation of these notices, see the System Unit Safety Information
manual, SA23-2652.
xi
Electrical Safety
Observe the following safety instructions any time you are connecting or disconnecting
devices attached to the workstation.
DANGER
An electrical outlet that is not correctly wired could place hazardous voltage
on metal parts of the system or the devices that attach to the system. It is the
responsibility of the customer to ensure that the outlet is correctly wired and
grounded to prevent an electrical shock.
Before installing or removing signal cables, ensure that the power cables for
the system unit and all attached devices are unplugged.
During an electrical storm, do not connect cables for display stations, printers,
telephones, or station protectors for communications lines.
CAUTION:
This product is equipped with a three–wire power cable and plug for the user’s
safety. Use this power cable with a properly grounded electrical outlet to avoid
electrical shock.
DANGER
To prevent electrical shock hazard, disconnect all power cables from the
electrical outlet before relocating the system.
CAUTION:
This unit has more than one power supply cord. To reduce the risk of electrical
shock, disconnect two power supply cords before servicing.
CAUTION:
This product may contain a CD-ROM which is a class 1 laser product.
Laser Compliance
All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR
Subchapter J for class 1 laser products. Outside the U.S., they are certified to be in
compliance with the IEC 825 (first edition 1984) as a class 1 laser product. Consult the
label on each part for laser certification numbers and approval information.
CAUTION:
All IBM laser modules are designed so that there is never any human access to
laser radiation above a class 1 level during normal operation, user maintenance,
or prescribed service conditions. Data processing environments can contain
equipment transmitting on system links with laser modules that operate at
greater than class 1 power levels. For this reason, never look into the end of an
optical fiber cable or open receptacle. Only trained service personnel should
perform the inspection or repair of optical fiber cable assemblies and receptacles.
Preface xiii
xiv Service Guide
Data Integrity and Verification
IBM computer systems contain mechanisms designed to reduce the possibility of
undetected data corruption or loss. This risk, however, cannot be eliminated. Users who
experience unplanned outages, system failures, power fluctuations or outages, or
component failures must verify the accuracy of operations performed and data saved or
transmitted by the system at or near the time of the outage or failure. In addition, users
must establish procedures to ensure that there is independent data verification before
relying on such data in sensitive or critical operations. Users should periodically check
the IBM support websites for updated information and fixes applicable to the system and
related software.
xv
xvi Service Guide
About This Book
This book provides maintenance information that is specific to the RS/6000 Enterprise
Server Model M80 and Eserver pSeries 660 Model 6M1, as well as to adapters and
attached devices that do not have their own service information. In this book, the
RS/6000 Enterprise Server Model M80 and Eserver pSeries 660 Model 6M1 are
hereafter referred to as the ″system″ or the ″server.″
This book contains maintenance analysis procedures (MAPs) that are not common to
other systems. MAPs that are common to all systems are contained in the RS/6000 and
Eserver pSeries Diagnostic Information for Multiple Bus Systems.
This book is used by the service technician to repair system failures. This book
assumes that the service technician has had training on the system unit.
ISO 9000
ISO 9000 registered quality systems were used in the development and manufacturing
of this product.
Online Publications
RS/6000 and pSeries publications are available online. To access the online books, visit
our Web site at: http://www.rs6000.ibm.com/resource/hardware_docs/
Related Publications
The following publications provide additional information about your system unit:
v The System Unit Safety Information, order number SA23-2652, contains translations
of safety information used throughout this book.
v The RS/6000 Enterprise Server Model M80, Eserver pSeries 660 Model 6M1
User’s Guide, order number SA38-0565, contains information on how to use the
system, use diagnostics, use service aids, and verify system operations.
v The 7014 Series Model T00 and T42 Rack Installation and Service Guide, order
number SA38-0577, contains information regarding the 7014 Series Model T00 Rack,
in which the Enterprise Server Model M80 and Eserver pSeries 660 Model 6M1
may be installed.
v The RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus Systems,
order number SA38-0509, contains common diagnostic procedures, error codes,
service request numbers, and failing function codes. This manual is intended for
trained service technicians.
v The RS/6000 and Eserver pSeries Adapters, Devices, and Cable Information for
Multiple Bus Systems, order number SA38-0516 contains information about adapters,
devices, and cables for your system. This manual is intended to supplement
information found in the RS/6000 and Eserver pSeries Diagnostic Information for
Multiple Bus Systems.
xvii
v The PCI Adapter Placement Reference, order number SA38-0538, contains
information regarding slot restrictions for adapters that can be used in this system.
v The Site and Hardware Planning Information, order number SA38-0508, contains
information to help you plan your installation.
Trademarks
The following terms are trademarks of International Business Machines Corporation in
the United States, other countries, or both:
v AIX
v e (logo)
v pSeries
v RS/6000
Other company, product, and service names may be trademarks or service marks of
others.
Overview
The RS/6000 Enterprise Server Model M80 and Eserver pSeries 660 Model 6M1
systems are multiprocessor, multibus systems packaged in two to five drawers. The
processors and memory are packaged in an 8 EIA-unit central electronics complex
(CEC) drawer, and the optional DASD and I/O devices are in 5 EIA-unit I/O drawers.
The basic system consists of one CEC drawer and one I/O drawer in the same rack.
You can expand the system by adding up to three additional I/O drawers in a minimum
of two racks.
A number of cables connect the CEC and I/O drawers. These cables include:
v SPCN (System Power Control Network) cables
v V/S COMM cables
v RIO (Remote Input Output) cables
v JTAG cable
An ac power cord (dc power cord in a dc-powered system) is connected to the CEC
drawer (drawer 1), primary I/O drawer (drawer 0), and secondary I/O drawer (drawer 2).
You can also connect two additional secondary drawers (drawers 3 and 4) in another
rack with a separate power distribution unit and ac power cord (dc power cord in a
dc-powered system). A dc-powered system has no power distribution unit.
1
Processors
The CEC drawer is powered independently from the I/O drawer and supports two
processor cards.
Model M80:
v Can have any combination of 2-way or 4-way processor cards running at 500 MHz
v All 500 MHz processors can have their own 4 MB L2 cache
Model 6M1:
v Can be ordered with 500 MHz processors or 750 MHz processors
v Systems with 500 MHz processors can have one or two 2-way processor cards
v Systems with 750 MHz processors can have any combination of 2-way and 4-way
processor cards
v All 500 MHz processors can have their own 4 MB L2 cache
v All 750 MHz processors can have their own 8 MB L2 cache
The Capacity Upgrade on Demand feature is available for this system when a minimum
of four processors running at 750 MHz are installed. The Capacity Upgrade on Demand
feature is not available on systems in which 500 MHz processors are installed. For
more information about Capacity Upgrade on Demand, see Chapter 7, “Using the
Capacity Upgrade on Demand Feature” on page 403.
Note: In order for the system to boot, the first processor card slot must be populated.
Memory
v Model M80: 1 GB (minimum) to 32 GB (maximum)
v Model 6M1: 1 GB (minimum) to 64 GB (maximum)
v One or two memory riser cards; each riser card has thirty-two sockets. 128 MB, 256
MB, 512 MB, and 1 GB dual inline memory modules (DIMMs) are available.
v Certain 32 MB DIMMs from older RS/6000 systems can also be used when
upgrading the system memory.
2 Service Guide
Primary I/O Drawer
The primary I/O drawer has the following:
v 14 PCI slots
v System operator panel
v SPCN controller
v System media
v Service processor
v Optional boot DASDs that occupy PCI slots 13 and 14
v Various connectors, including four serial port connectors, on the back of the drawer
for the attachment of external devices
Note: Serial ports 1 and 2 can only be used to access the service processor menus.
No ″heartbeat″-type devices or cables can be used on these ports.
″Heartbeat″-type devices or cables must be installed on serial port 3 or serial
port 4.
Slots 1-5, 8-10, 13 and 14 support either 32- or 64-bit PCI adapters. The remaining
slots are 32-bit only.
The CEC drawer includes the interrupt and system bus control logic.
The primary I/O drawer has space for up to three media devices. The diskette drive and
CD-ROM drive are standard; the third media device is optional.
4 Service Guide
Powering the System On and Off
The system can be powered on after the following cables are connected:
v V/S COMM Cable
v All RIO cables
v All SPCN cables
v JTAG Cable
v All PCI cables to supported drawers
After the required cables are installed, and the power cables are connected, the power
button on the primary I/O drawer operator panel can be pushed to initialize the system.
Progress indicators, also referred to as checkpoints, are visible on the primary I/O
drawer operator panel display and the power LED on the primary I/O drawer stops
blinking and stays on, indicating the system power is on.
The CEC drawer and secondary I/O drawers are powered on through the primary I/O
drawer system power control network (SPCN). When power is applied, the power LEDs
on the primary I/O drawer go from blinking to on continuously, and the power LED on
the CEC drawer and secondary I/O drawer(s) comes on and stays on. This indicates
that power levels are satisfactory in the drawers.
If the system is operating under AIX, type the shutdown command to power off the
system. If you cannot use this method, you can power off the system by pressing the
power button on the primary I/O drawer operator panel.
Attention: Using the power button on the operator panel to power off the system can
cause unpredictable results in the data files, and the next IPL will take longer to
complete.
For complete details on how to power on and off the system, go to “Powering Off and
Powering On the System” on page 469 and “System Power-On Methods” on page 439.
The console-selection sequence allows you to select (from the appropriate input device)
any one of the available console devices. If no console is selected within approximately
60 seconds, serial port 1 (S1) is selected as the console and the selection sequence
times-out.
Attention: If an ASCII terminal is attached to serial port 1 (S1), and there is any
interaction with this terminal, then:
v After OK displays in the operator panel,
AND
v Before the power-on sequence is initiated,
the firmware will use this terminal as the console, regardless of the previous console
selection.
After a console has been selected, the console-selection sequence is only started at
boot time if there is a change in the system configuration (as previously described), or
the contents of the system’s nonvolatile memory (NVRAM) are lost.
Note: Moving an ASCII terminal from one serial port to another (from S1 to S2) cannot
be detected by the firmware, so it does not constitute a configuration change.
You can also initiate a system console-selection sequence from the SMS menus.
Power-On Self-Test
After power is turned on and before the operating system is loaded, the system does a
power-on self-test (POST). This test performs checks to ensure that the hardware is
functioning correctly before the operating system is loaded. During the POST, a POST
screen displays, and POST indicators appear on the system console (if one is
connected). The next section describes the POST indicators and functions that can be
accessed during the POST.
6 Service Guide
POST Indicators
POST indicators indicate tests that are being performed as the system is preparing to
load the operating system. The POST indicators are words that display on the system
console. Each time that the system starts a different step in the POST, a POST
indicator word appears on the console. Each word is an indicator of the tests that are
being performed.
POST Keys
The POST keys, if pressed after the keyboard POST indicator displays and before the
last POST indicator (speaker) displays, cause the system to start services or to initiate
service mode boots used for configuring the system and diagnosing problems. The keys
are described below:
Note: The program function keys (F1-F12) on a keyboard attached to the system unit
are no longer used and will be ignored during POST. After the keyboard POST
indicator displays, you must use the numeric number keys to enter input.
1 Key
The numeric 1 key, when pressed during POST, starts the System Management
Services (SMS) interface.
5 Key
The numeric 5 key, when pressed during POST, initiates a system boot in service mode
using the default service mode boot list.
This mode attempts to boot from the first device of each type found in the list. It does
not search for other bootable devices of that type if the first device is not bootable.
Instead, it continues to the next device type in the list. The firmware supports up to five
entries in the boot list.
6 Key
The numeric 6 key works like the numeric 5 key, except that firmware uses the
customized service mode bootlist that was set up in System Management Services or in
the AIX service aids.
8 Key
To enter the open firmware command line, press the numeric 8 key after the word
keyboard displays and before the last word (speaker) displays during startup. After you
press the 8 key, the remaining POST indicators display until initialization completes.
When initialization and POST are complete, the open firmware command line (an OK
prompt) displays.
Use the open firmware command line to set up adapters that are not configurable with
the System Management Services. Your adapter documentation directs you to use this
option if it is needed.
To exit from the open firmware command prompt, type reset-all or power off the
system and reboot.
8 Service Guide
System Locations
The system consists of a minimum of two drawers in one rack: the CEC drawer and the
primary I/O drawer. These two components are connected by cables that transmit data
and control signals. Additional I/O drawers can be added if further expansion of the
system is required. The following figure shows the system units.
Primary
R
Primary
R
Secondary
I/O Drawer
(Optional)
(5 EIA Units)
Note: For information about installing rack rails, see Appendix G, “Rack Drawer Rail
Instructions” on page 587.
3 2
10 Service Guide
CEC Drawer Rear View
1
4
7
6 5
12 Service Guide
Primary I/O Drawer Front View With Cover On
1
2
14 Service Guide
Primary I/O Drawer Rear View
18
17 19
1 2 3 4 5 6 7 8 9 10 11 12 13 14
P1
1 2 3 4 5 6 7 8
9 10 11 12 13 14 15 16
4
5 6 7
16 Service Guide
Secondary I/O Drawer Rear View
2
8
1 2 3 4 5 6 7 8 9 10 11 12 13 14
P1
3 4 5
6 7
18 Service Guide
System Board Locations
2 3
4 5
1
Rear
Note: In order for the system to boot, the first processor card slot (M03) must be
populated.
(31) D (32) D
(29) C (30) C
(27) B (28) B
(25) A (26) A
(23) D (24) D
(21) C (22) C
(19) B (20) B
(17) A (18) A
(15) D (16) D
(13) C (14) C
(11) B (12) B
(9) A (10) A
(7) D (8) D
(5) C (6) C
(3) B (4) B
(1) A (2) A
20 Service Guide
Location Code Memory DIMMs
U1.1-P1-M1.1 x8 Memory Octal A (DIMMs 1, 2, 9, 10, 17, 18, 25,
26)
U1.1-P1-M1.1 x32 All Memory DIMMs on Riser Card 1
U1.1-P1-M1.2 x4 Memory Quad A Even (DIMMs 2, 10, 18, 26)
U1.1-P1-M1.3 x4 Memory Quad B Odd (DIMMs 3, 11, 19, 27)
U1.1-P1-M1.3 x8 Memory Octal B (DIMMs 3, 4, 11, 12, 19, 20,
27, 28)
U1.1-P1-M1.4 x4 Memory Quad B Even (DIMMs 4, 12, 20, 28)
U1.1-P1-M1.5 x4 Memory Quad C Odd (DIMMs 5, 13, 21, 29)
U1.1-P1-M1.5 x8 Memory Octal C (DIMMs 5, 6, 13, 14, 21, 22,
29, 30)
U1.1-P1-M1.6 x4 Memory Quad C Even (DIMMs 6, 14, 22, 30)
U1.1-P1-M1.7 x4 Memory Quad D Odd (DIMMs 7, 15, 23, 31)
U1.1-P1-M1.7 x8 Memory Octal D (DIMMs 7, 8, 15, 16, 23, 24,
31, 32)
U1.1-P1-M1.8 x4 Memory Quad D Even (DIMMs 8, 16, 24, 32)
U1.1-P1-M2.n Individual Memory DIMMs on Riser Card 2 (n
denotes DIMM number)
U1.1-P1-M2.1 x4 Memory Quad A Odd (DIMMs 1, 9, 17, 25)
U1.1-P1-M2.1 x8 Memory Octal A (DIMMs 1, 2, 9, 10, 17, 18, 25,
26)
U1.1-P1-M2.1 x32 All Memory DIMMs on Riser Card 2
U1.1-P1-M2.2 x4 Memory Quad A Even (DIMMs 2, 10, 18, 26)
U1.1-P1-M2.3 x4 Memory Quad B Odd (DIMMs 3, 11, 19, 27)
U1.1-P1-M2.3 x8 Memory Octal B (DIMMs 3, 4, 11, 12, 19, 20,
27, 28)
U1.1-P1-M2.4 x4 Memory Quad B Even (DIMMs 4, 12, 20, 28)
U1.1-P1-M2.5 x4 Memory Quad C Odd (DIMMs 5, 13, 21, 29)
U1.1-P1-M2.5 x8 Memory Octal C (DIMMs 5, 6, 13, 14, 21, 22,
29, 30)
U1.1-P1-M2.6 x4 Memory Quad C Even (DIMMs 6, 14, 22, 30)
U1.1-P1-M2.7 x4 Memory Quad D Odd (DIMMs 7, 15, 23, 31)
U1.1-P1-M2.7 x8 Memory Octal D (DIMMs 7, 8, 15, 16, 23, 24,
31, 32)
U1.1-P1-M2.8 x4 Memory Quad D Even (DIMMs 8, 16, 24, 32)
For physical locations, see “Memory Riser Card and Memory DIMM Locations” on
page 20.
!
9 R
8
7 5
6
Note: You must activate the service processor reset button very carefully. An insulated
paper clip is recommended. Unbend the clip so that it has a straight section
about two inches long. Insert the clip straight into the hole, keeping the clip
perpendicular to the plastic bezel. When you engage the test reset switch, you
should feel the detent of the switch. After you press the switch, the service
processor resets and then shuts down the system.
22 Service Guide
SCSI IDs and Bay Locations
The following figure shows the SCSI IDs for media devices. The SCSI IDs for the media
devices installed at the factory are set using jumpers on the drives when the drives are
installed.
Note: The SCSI IDs shown for the media indicate how installed devices are set when
shipped from the factory.
Four sizes of DIMMs are available: 128 MB, 256 MB, 512 MB and 1 GB. In addition,
certain 32 MB DIMMS from earlier RS/6000 systems can also be used.
24 Service Guide
I/O Drawer Features
I/O drawers may have two redundant power supplies. One of these power supplies is
capable of providing the necessary voltages and currents independently of the other.
The left and right power supply output voltages are connected and monitored by the
power distribution board contained in the 5 EIA-unit I/O drawer. Both power supplies
provide +12 V dc to the fans in the drawer to ensure sufficient cooling if one of the
supplies fails. However, the short circuit prevention is on the I/O board to protect
against shorts in one output, thus causing the +12 V dc to fail completely.
The left and right power supplies are hot-pluggable and may be changed one-at-a-time
while the system is operational. Each power supply provides 5 V dc, 3.3 V dc, 12 V dc,
and 5 V dc standby power.
The 5 V dc standby power is provided by both power supplies; this can be done
because the power supplies contain overload protection against one supply shorting the
other. Standby power (5 V dc) is provided to the part of the primary I/O drawer planar
on which the SPCN and service processor logic reside.
When ac power (dc power in a dc-powered system) has been disconnected from an I/O
drawer and then reconnected, the I/O drawer addresses on the secondary drawers will
temporarily appear in the format *0n, where n is the drawer number. After the system is
powered on and the boot sequence has been initiated, the drawer numbers will be
restored to the U0.n format.
26 Service Guide
Secondary I/O Drawer Operator Panel Message Hierarchy
The messages that are displayed in the operator panel on secondary I/O drawers are
arranged hierarchically, from highest priority to lowest priority. The message hierarchy is
as follows:
1. 8-character error code
This indicates a critical error; the drawer power will be shut down. Standby power is
still on, so the error message is still displayed in the operator panel. The I/O drawer
Attention LED will be on. The error code displays on the top line of the display
panel, left-justified. A FRU part number may also be displayed, right-justified, in the
second line of the display.
2. 8-character informational message
This is a noncritical message, so the drawer power will not be shut down. The
message should be investigated to see if service actions are indicated. A FRU part
number may also be displayed, right-justified, in the second line of the display.
When 8-character codes are displayed, they are in the form 10n1xxxx, where n is
the drawer number.
3. 8-character checkpoint
There is one 8-character checkpoint that displays, 10n100AA, where n is the drawer
number. This checkpoint indicates that the system power control network firmware is
being updated, and the checkpoint displays for approximately 15 seconds. This is
the only checkpoint that displays on a secondary I/O drawer operator panel.
4. Drawer number
This will be in the form U0.n, where n is the drawer number. This message will
normally display unless a higher-level message is required.
When ac power (dc power in a dc-powered system) has been disconnected from
the system, then reconnected, the I/O drawer addresses on the secondary drawers
will temporarily appear in the format *0n, where n is the drawer number. After the
system is powered on, and the boot sequence has been initiated, the drawer
numbers are restored to the U0.n format.
Critical error codes are posted on the top line (left-justified) of the drawer display with
the attention light on solid. Any location codes are posted on the bottom line
(left-justified).
Noncritical error codes are not posted. The drawer display remains blank with the
attention light on solid.
28 Service Guide
Secondary I/O Drawer Operator Panel Behavior During Power-On
The period (.) identifies sublocations (DIMMs on a memory card, SCSI addresses,
cables). The following are examples:
v P1-M1.4 identifies DIMM 4 on memory card 1 plugged into planar P1.
v P1-C1.1 identifies processor 1 plugged into processor card 1 which is plugged into
planar P1.
v U0.1-P1/Z1–A3 identifies a SCSI device with SCSI ID 3 attached to SCSI bus 1 on
planar 1 in the primary I/O drawer.
v P2-Z1-A3.1 identifies a SCSI device with SCSI address of LUN 1 at SCSI ID 3
attached to SCSI bus 1 from planar 2.
v P1-I2#E3.2 identifies the second in a series of cables attached to Ethernet controller
3 on the card in slot 2 (I2) on planar P1.
Depending on the AIX and firmware levels, AIX Diagnostics may include the extended
location information when identifying a planar or card. The extended location
information or cable information is always included when identifying a cable or
connector. Location codes with extended location information that are displayed without
a description identifying the devices always identify the cable attached to the port.
For example, if the location code U1.1-P1-M1.1 x4 was displayed, this indicates
memory quad A (four DIMMs) on the first memory riser card was suspected.
You can determine the FRU part numbers of the electronic assemblies in the system in
one of the following ways:
v Using the service processor menus
From the general user menu, select Read VPD Image from Last System Boot, then
enter 90 to display detailed vital product data (VPD).
v Using the lscfg -vp | pg command on the AIX command line
30 Service Guide
Enter the following command: lscfg -vp | pg to display the detailed VPD of all
assemblies. Notice that the FRU part number information for processors and memory
DIMMs may be at the bottom of the command output.
For planars, cards, and non-SCSI devices, the location code is defined as follows:
AB-CD-EF-GH
| | | |
| | | Device/FRU/Port ID
| | Connector ID
| devfunc Number, Adapter Number or Physical Location
Bus Type or PCI Parent Bus
v The AB value identifies a bus type or PCI parent bus as assigned by the firmware.
v The CD value identifies adapter number, adapter’s devfunc number, or physical
location. The devfunc number is defined as the PCI device number times 8, plus the
function number.
v The EF value identifies a connector.
v The GH value identifies a port, address, device, or FRU.
The possible values for CD depend on the adapter/card. For pluggable PCI
adapters/cards, CD is the device’s devfunc number (PCI device number times 8, plus
the function number). The C and D are characters in the range of 0-9, and A-F (hex
numbers). The location codes therefore uniquely identify multiple adapters on individual
PCI cards.
EF is the connector ID, used to identify the adapter’s connector to which a resource is
attached.
For integrated adapter, EF-GH is the same as the definition for a pluggable adapter. For
example, the location code for a diskette drive is 01-D1-00-00. A second diskette drive
is 01-D1-00-01.
32 Service Guide
For SCSI devices, the location code is defined as follows:
AB-CD-EF-G,H
| | | | |
| | | | Logical Unit address of the SCSI Device
| | | Control Unit Address of the SCSI Device
| | Connector ID
| devfunc Number, Adapter Number or Physical Location
Bus Type or PCI Parent Bus
Where:
v AB-CD-EF are the same as non-SCSI devices.
v G defines the control unit address of the device. Values of 0 to 15 are valid.
v H defines the logical unit address of the device. Values of 0 to 255 are valid.
34 Service Guide
Location AIX Physical Logical
FRU Name Code Location Connection Connection
Code
Memory Octal B on U1.1-P1-M1.3 x8 00-00 Refer to
Riser Card 1 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
Memory Quad B U1.1-P1-M1.4 x4 00-00 Refer to
Even on Riser Card 1 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
Memory Quad C Odd U1.1-P1-M1.5 x4 00-00 Refer to
on Riser Card 1 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
Memory Octal C on U1.1-P1-M1.5 x8 00-00 Refer to
Riser Card 1 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
Memory Quad C U1.1-P1-M1.6 x4 00-00 Refer to
Even on Riser Card 1 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
Memory Quad D Odd U1.1-P1-M1.7 x4 00-00 Refer to
on Riser Card 1 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
Memory Octal D on U1.1-P1-M1.7 x8 00-00 Refer to
Riser Card 1 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
Memory Quad D U1.1-P1-M1.8 x4 00-00 Refer to
Even on Riser Card 1 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
36 Service Guide
Location AIX Physical Logical
FRU Name Code Location Connection Connection
Code
Memory Octal B U1.1-P1-M2.4 x4 00-00 Refer to
Even on Riser Card 2 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
Memory Quad C Odd U1.1-P1-M2.5 x4 00-00 Refer to
on Riser Card 2 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
Memory Octal C on U1.1-P1-M2.5 x8 00-00 Refer to
Riser Card 2 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
Memory Quad C U1.1-P1-M2.6 x4 00-00 Refer to
Even on Riser Card 2 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
Memory Quad D Odd U1.1-P1-M2.7 x4 00-00 Refer to
on Riser Card 2 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
Memory Octal D on U1.1-P1-M2.7 x8 00-00 Refer to
Riser Card 2 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
Memory Quad D U1.1-P1-M2.8 x4 00-00 Refer to
Even on Riser Card 2 “Memory Riser
Card and
Memory DIMM
Locations” on
page 20.
Processor Card 1 U1.1-P1-C1 Connector M03
in CEC Drawer
2W Card 1, U1.1-P1-C1 00-00 Connector M03
Processor 1 in CEC Drawer
38 Service Guide
Location AIX Physical Logical
FRU Name Code Location Connection Connection
Code
Regulator 1 U1.1-P1-V1 Connector M01
in CEC Drawer
Regulator 2 U1.1-P1-V1 Connector M01
in CEC Drawer
Power Distribution U1.1-X1
Card
LED Card U1.1-L1
Power Supply 1 U1.1-V1 PP1 in CEC
drawer
Power Supply 2 U1.1-V2 PP2 in CEC
drawer
Fan 1 U1.1-F1 BP1 in CEC
drawer
Fan 2 U1.1-F2 BP2 in CEC
drawer
Primary I/O Drawer Locations
Primary I/O Drawer U0.1
RIO Bus Adapter U0.1-P1.1
RIO Port 0 Connector U0.1-P1.1/Q1
RIO Port 1 Connector U0.1-P1.1/Q2
All RIO Port U0.1-P1.1
Connectors
Primary I/O drawer U0.1-P1
backplane
PCI to PCI bridge 1 U0.1-P1 10-58, 10-5A,
10-5C, 10-5E
PCI slot 1 U0.1-P1/I1 10-58
PCI slot 1 content U0.1-P1-I1 11-08 through
(card) 11-0F or 12-xx
or 13-xx
PCI slot 2 U0.1-P1/I2 10-5A
PCI slot 2 content U0.1-P1-I2 14-08 through
(card) 14-0F
or 15-xx or 16-xx
PCI slot 3 U0.1-P1/I3 10-5C
PCI slot 3 content U0.1-P1-I3 17-08 through
(card) 17-0F
or 18-xx or 19-xx
PCI slot 4 U0.1-P1/I4 10-5E
40 Service Guide
Location AIX Physical Logical
FRU Name Code Location Connection Connection
Code
PCI slot 12 content U0.1-P1-I12 37-08 through
(card) 37-0F
or 38-xx or 39-xx
PCI slot 13 U0.1-P1/I13 20-64
PCI slot 13 content U0.1-P1-I13 3A-08 through
(card) 3A-0F
or 3B-xx or
3C-xx
PCI slot 14 U0.1-P1/I14 20-66
PCI slot 14 content U0.1-P1/I14 3D-08 through
(card) 3D-0F
or 3E-xx or
3F-xx
PCI bus controller 0 U0.1-P1 00-FFF7F08000
PCI bus controller 1 U0.1-P1 00-FFF7F09000
PCI bus controller 2 U0.1-P1 00-FFF7F0A000
ISA Bus U0.1-P1 10-80
Service Processor U0.1-P1
JTAG Connector U0.1-P1/Q4
NVRAM U0.1-P1
Service Processor U0.1-P1
ISA Bridge
Real Time Clock U0.1-P1
(RTC)
Timer U0.1-P1
Interrupt Controller U0.1-P1
DMA Controller U0.1-P1
Diskette controller U0.1-P1-D1 01-D1
Diskette drive U0.1-P1/D1 01-D1-00-00
Keyboard controller / U0.1-P1/K1 01-K1-00
connector
Keyboard U0.1-P1/K1 01-K1-00-00
Mouse controller U0.1-P1/K1 01-K1-01
Mouse connector U0.1-P1-01 01-K1-01
Mouse U0.1-P1-01 01-K1-01-00
Parallel port U0.1-P1/R1 01-R1
connector
Serial port 1 U0.1-P1/S1 01-S1
connector
42 Service Guide
Location AIX Physical Logical
FRU Name Code Location Connection Connection
Code
Fan 2 U0.1-F2 Fan connector
P3 on power
supply 1
Fan 3 U0.1-F3 Fan connector
P3 on power
supply 2
Fan 4 U0.1-F4 Fan connector
P3 on power
supply 2
Cooling unit (w/fan 9) U0.1-F9 In cooling unit
System firmware U0.1-P1/Y1
Secondary I/O Drawer 2 Locations
Secondary I/O U0.2
Drawer
RIO Bus Adapter U0.2-P1.1
RIO Port 0 Connector U0.2-P1.1/Q1
RIO Port 1 Connector U0.2-P1.1/Q2
All RIO Port U0.2-P1.1
Connectors
Secondary I/O drawer U0.2-P1
backplane
PCI to PCI bridge 1 U0.2-P1 50-58, 50-5A,
50-5C, 50-5E
PCI slot 1 U0.2-P1/I1 50-58
PCI slot 1 content U0.2-P1-I1 51-08 through
(card) 51-0F
or 52-xx or 53-xx
PCI slot 2 U0.2-P1/I2 50-5A
PCI slot 2 content U0.2-P1-I2 54-08 through
(card) 54-0F
or 55-xx or 56-xx
PCI slot 3 U0.2-P1/I3 50-5C
PCI slot 3 content U0.2-P1-I3 57-08 through
(card) 57-0F
or 58-xx or 59-xx
PCI slot 4 U0.2-P1/I4 50-5E
PCI slot 4 content U0.2-P1-I4 5A-08 through
(card) 5A-0F
or 5B-xx or
5C-xx
44 Service Guide
Location AIX Physical Logical
FRU Name Code Location Connection Connection
Code
PCI slot 13 content U0.2-P1-I13 7A-08 through
(card) 7A-0F
or 7B-xx or
7C-xx
PCI slot 14 U0.2-P1/I14 60-66
PCI slot 14 content U0.2-P1/I14 7D-08 through
(card) 7D-0F
or 7E-xx or
7F-xx
PCI bus controller 1 U0.2-P1 00-FFF7E09000
PCI bus controller 2 U0.2-P1 00-FFF7E0A000
SPCN controller U0.2-P1
External SCSI U0.2-P1/Z1
connector
Secondary I/O drawer U0.2-L1-N1
VPD module
Thermal sensor U0.2-L1
(ambient)
Thermal sensor (left) U0.2-P1
Thermal sensor U0.2-P1
(right)
Power distribution U0.2-X1
board
PDB power connector U0.2-X1/V1
1
PDB power connector U0.2-X1/V2
2
Fan Controller Card U0.2-X2 I/O board
connector J3
Power Supply 1 U0.2-V1 PDB connector
(w/Fan 5) J2
Power Supply 2 U0.2-V2 PDB connector
(w/Fan 6) J1
Fan 1 U0.2-F1 Fan connector
P3 on power
supply 1
Fan 2 U0.2-F2 Fan connector
P3 on power
supply 1
Fan 3 U0.2-F3 Fan connector
P3 on power
supply 2
46 Service Guide
Location AIX Physical Logical
FRU Name Code Location Connection Connection
Code
PCI slot 5 content U0.3-P1-I5 A1-08 through
(card) A1-0F
or A2-xx or
A3-xx
PCI slot 6 (5V) U0.3-P1/I6 A0-5A
PCI slot 6 content U0.3-P1-I6 A4-08 through
(card) A4-0F
or A5-xx or
A6-xx
PCI slot 7 (5V) U0.3-P1/I7 A0-5B
PCI slot 7 content A0.3-P1-I7 A7-08 through
(card) A7-0F
or A8-xx or
A9-xx
PCI slot 8 A0.3-P1/I8 A0-5C
PCI slot 8 content U0.3-P1-I8 AA-08 through
(card) AA-0F
or AB-xx or
AC-xx
PCI slot 9 U0.3-P1/I9 A0-5E
PCI slot 9 content U0.3-P1-I9 AD-08 through
(card) AD-0F
or AE-xx or
AF-xx
PCI to PCI bridge 3 U0.3-P1 A0-60, A0-62,
A0-63, A0-64,
A0-66
PCI slot 10 U0.3-P1/I10 A0-60
PCI slot 10 content U0.3-P1-I10 B1-08 through
(card) B1-0F
or B2-xx or
B3-xx
PCI slot 11 (5V) U0.3-P1/I11 A0-62
PCI slot 11 content U0.3-P1-I11 B4-08 through
(card) B4-0F
or B5-xx or
B6-xx
PCI slot 12 (5V) U0.3-P1/I12 A0-63
PCI slot 12 content U0.3-P1-I12 B7-08 through
(card) B7-0F
or B8-xx or
B9-xx
PCI slot 13 U0.3-P1/I13 A0-64
48 Service Guide
Location AIX Physical Logical
FRU Name Code Location Connection Connection
Code
Fan 4 U0.3-F4 Fan connector
P3 on power
supply 2
Cooling unit (w/fan 9) U0.3-F9 In cooling unit
SPCN firmware U0.3-P1/Y3
SPCN VPD U0.3-P1/Y4
Secondary I/O Drawer 4 Locations
Secondary I/O U0.4
Drawer
RIO Bus Adapter U0.4-P1.1
RIO Port 0 Connector U0.4-P1.1/Q1
RIO Port 1 Connector U0.4-P1.1/Q2
All RIO Port U0.4-P1.1
Connectors
Secondary I/O drawer U0.4-P1
backplane
PCI to PCI bridge 1 U0.4-P1 D0-58, D0-5A,
D0-5C, D0-5E
PCI slot 1 U0.4-P1/I1 D0-58
PCI slot 1 content U0.4-P1-I1 D1-08 through
(card) D1-0F
or D2-xx or
D3-xx
PCI slot 2 U0.4-P1/I2 D0-5A
PCI slot 2 content U0.4-P1-I2 D4-08 through
(card) D4-0F
or D5-xx or
D6-xx
PCI slot 3 U0.4-P1/I3 D0-5C
PCI slot 3 content U0.4-P1-I3 D7-08 through
(card) D7-0F
or D8-xx or
D9-xx
PCI slot 4 U0.4-P1/I4 D0-5E
PCI slot 4 content U0.4-P1-I4 DA-08 through
(card) DA-0F
or DB-xx or
DC-xx
PCI to PCI bridge 2 U0.4-P1 E0-58, E0-5A,
E0-5B, E0-5C,
E0-5E
PCI slot 5 U0.4-P1/I5 E0-58
50 Service Guide
Location AIX Physical Logical
FRU Name Code Location Connection Connection
Code
PCI slot 13 content U0.4-P1-I13 FA-08 through
(card) FA-0F
or FB-xx or
FC-xx
PCI slot 14 U0.4-P1/I14 E0-66
PCI slot 14 content U0.4-P1/I14 FD-08 through
(card) FD-0F
or FE-xx or
FF-xx
PCI bus controller 1 U0.4-P1 00-FFF7C09000
PCI bus controller 2 U0.4-P1 00-FFF7C0A000
SPCN controller U0.4-P1
External SCSI U0.4-P1/Z1
connector
Secondary I/O drawer U0.4-L1-N1
VPD module
Thermal sensor U0.4-L1
(ambient)
Thermal sensor (left) U0.4-P1
Thermal sensor U0.4-P1
(right)
Power distribution U0.4-X1
board
PDB power connector U0.4-X1/V1
1
PDB power connector U0.4-X1/V2
2
Fan controller card U0.4-X2
Power supply 1 U0.4-V1 PDB connector
(w/Fan 5) J2
Power supply 2 U0.4-V2 PDB connector
(w/Fan 6) J1
Fan 1 U0.4-F1 Fan connector
P3 on power
supply 1
Fan 2 U0.4-F2 Fan connector
P3 on power
supply 1
Fan 3 U0.4-F3 Fan connector
P3 on power
supply 2
52 Service Guide
Cabling the CEC Drawer, Primary I/O Drawer, and Secondary I/O Drawer(s)
The minimum system consists of two drawers. The CEC drawer contains the system
processors, memory and supporting hardware. The primary I/O drawer contains PCI
adapters, operator panel, disk drives, and media drives. The system must have one
primary I/O drawer and it contains the service processor for the system. The system
supports up to three additional I/O drawers in various rack configurations.
To connect the CEC drawer to the I/O drawers, use the following steps:
1. Connect the cables that only connect between the CEC drawer and the primary I/O
drawer such as the JTAG and the V/S COMM cable.
2. Connecting the cables that connect between the CEC drawer and all the I/O
drawers in loops, such as the RIO cables and the SPCN cables.
All power cables connect to the PDU. If a second PDU is specified along with
secondary drawer power supplies for all drawers, the secondary power supplies
connect to the secondary PDU.
CEC Drawer
RIO Cabling
The following rules apply to RIO cable connections:
v I/O drawers 0 and 1 are connected using the first RIO loop, which uses RIO ports 0
and 1 on the CEC drawer.
v The primary I/O drawer must be installed and connected to RIO port 0 of the CEC
drawer. The connection must be made from RI0 port 0 of the CEC drawer to RIO
port 0 of the primary I/O drawer. This connection is required to make the primary
drawer the first drawer in the loop, which allows the firmware to initialize the system.
The following figures provide cabling examples for all valid cabling configurations.
Match your configuration to the correct figure and connect your RIO cables as shown.
A0
CEC Drawer
A1
R1
Primary Drawer
R0
54 Service Guide
CEC Drawer Attached to Two I/O Drawers Type A
A0
CEC Drawer
A1
R1
Secondary
I/O Drawer R0
R1
Primary
I/O Drawer R0
A0 B0
CEC Drawer
A1 B1
R1 R1
56 Service Guide
CEC Drawer Attached to Three I/O Drawers
A0 B0
CEC Drawer
A1 B1
R1 R1
R1
Primary I/O
R0
Drawer
A0 B0
CEC Drawer
A1 B1
R1 R1
R1 R1
Primary I/O Secondary I/O
Drawer R0 Drawer R0
58 Service Guide
SPCN Cabling
The following figures provide cabling examples for all valid cabling configurations.
Match your configuration to the correct figure and connect your SPCN cables as shown.
CEC Drawer
V/S COMM
V/S COMM
CEC Drawer
V/S COMM
Primary
I/O Drawer
V/S COMM J15 J16
Secondary
I/O Drawer
J15 J16
60 Service Guide
CEC Drawer Attached to Three I/O Drawers
CEC Drawer
V/S COMM
Primary I/O
Drawer
V/S COMM J15 J16
Secondary I/O
Drawer
J15 J16
Secondary I/O
Drawer
J15 J16
CEC Drawer
V/S COMM
Primary I/O
Drawer
V/S COMM J15 J16
Secondary I/O
Drawer
J15 J16
Secondary I/O
Drawer
J15 J16
Secondary I/O
Drawer
J15 J16
62 Service Guide
System Cable Diagrams
The following diagrams show system cable connections.
P3 J 12
+12 V G +3.3 V
J3
P 13
J 10 J 11
J 13 J 15
S 14
S 16
System Board
S 17 Power Power
Supply 1 Supply 2
V/S COMM
JTAG
S 15 S 18
J21
Diskette
CD-ROM
PCI Board
Optional
3.3 V
Media
Grd
Term 5V
DASD 1
J6 J7
Media Misc.
Power Power B
DASD 2 u
3.3 V
Ground
l
Term
5V k
h
J5
e
DASD I/P
Power J4 a
J3
Fan 3 Fan 4 Base SPCN Op Panel d
Power
J28
Distribution
Power Fan 6 Diskette
Board
Supply 2 or J1
Cooling J2
Unit Fan 9 Base SPCN
Power Fan 5 J2 J3
Supply 1 I/O Board
Fan 1 Fan 2 J9
Fan
JTAG
Controller
Card J5 J30
SCSI
Op Panel J4
V/S COMM
64 Service Guide
Secondary I/O Drawer Cable
J21
Optional
Media
PCI Board
Optional
3.3 V
Media
Grd
Term 5V
B
J6 J7 u
Media Misc.
Power
l
Power
k
3.3 V h
Ground
e
5V
a
d
I/P
J4 J3
Fan 3 Fan 4 Base SPCN Op Panel
Power
Distribution I/O Board
Power Fan 6
Board
Supply 2 or J1
Cooling J2
Unit Fan 9 Base SPCN
Power Fan 5 J2 J3
Supply 1
Fan 1 Fan 2
Fan SCSI
Controller
Card
Op Panel
PS2
CEC
PS1
PS2 PS1
66 Service Guide
Specifications
This section contains specifications for the system. Individual sections are provided for
CEC drawer, rack, and I/O drawer specifications.
CEC Drawer
The mechanical packaging, cooling, power supply, and environmental requirements for
the CEC drawer are shown in the following table:
Dimensions
Height 355.6 mm 14.0 in.
Width 445.5 mm 17.5 in.
Depth 825.5 mm 32.5 in.
Weight
Minimum 69.7 kg 158 lbs.
Maximum 74.6 kg 169 lbs.
Electrical
Power source loading typical in kVA 0.39
Power source loading maximum in kVA 0.6
Voltage range (V ac) 200 to 240
Frequency (hertz) 50 or 60
Thermal output (typical) M80: 1265 Btu/hr
6M1: 772 Btu/hr
Thermal output (maximum) M80: 1877 Btu/hr
6M1: 1378 Btu/hr
Power requirements (typical) M80: 370 watts
6M1: 226 watts
Power requirements (maximum) M80: 550 watts
6M1: 406 watts
Power factor 0.95
Inrush current 34 amps
Maximum altitude 2135 m (7000 ft.)
Install/Air Flow Maintenance of a proper service clearance should allow proper air flow.
Service (See service clearances for the 7014 Series Model T00 Rack)
1. See “Noise Emission Notes” on page 71 for definitions of noise emissions positions.
2. Noise emissions data are based on the following configuration: the drawer is mounted in a
7014 Series Model T00 Rack, a power distribution unit is installed in the rack, and the system
is operating in a normal environment of 25 °C (78 °F).
3. Inrush currents occur only at initial application of power; no inrush occurs during normal
power off-on cycle.
Dimensions
Height 1804 mm 71.0 in.
Width 644 mm 25.5 in.
Depth (including doors) 1098 mm 43.3 in.
Weight1 (Base Rack) 244 kg 535 lbs.
Electrical (see specifications for drawers or enclosures)
Temperature Range (see specifications for drawers or enclosures)
Humidity Requirements (see specifications for drawers or enclosures)
Noise Emissions (see specifications for drawers or enclosures)
Clearances Front Back Left Right
Install/Air Flow Maintenance of a proper service clearance should allow proper air
flow.
Service See “System Service Clearances” on page 70.
1. Configuration-dependent, base weight not including weight of drawers.
68 Service Guide
I/O Drawer Specifications
Dimensions
Height 218.0 mm 8.6 in.
Width 445.0 mm 17.5 in.
Depth 820.0 mm 32.3 in.
Weight
Minimum configuration 41 kg 90 lbs.
Maximum configuration 52 kg 115 lbs.
Electrical
Power source loading (typical in kVA) 0.23
Power source loading (maximum in kVA) 0.54
Voltage range 200 to 240 V ac
Frequency (hertz) 50 / 60
Thermal output (typical) 750 Btu/hr
Thermal output (maximum) 1750 Btu/hr
Power requirements (typical) 220 watts
Power requirements (maximum) 515 watts
Power factor 0.95
Inrush current 41 amps
Maximum altitude 2135 m (7000 ft.)
Install/Air Flow Maintenance of a proper service clearance should allow proper air flow.
Service (See “System Service Clearances” on page 70)
1. See “Noise Emission Notes” on page 71 for definitions of noise emissions positions.
2. Noise emissions data are based on the following configuration: the drawer is mounted in a
7014 Series Model T00 Rack and a power distribution unit is installed in the rack.
3. Inrush currents occur only at initial application of power; no inrush occurs during normal
power off-on cycle.
For multiple racks placed side by side, the left and right clearances apply only to the
leftmost and rightmost rack.
Rack Configuration
80
Rear cover (3.1)
thickness
20 mm (0.8 in.) Caster
Location
2921 mm
915 mm 915 mm (115 in.)
(36 in.) (36 in.)
Front cover
Side cover
thickness
thickness 2x
58 mm (2.4 in.)
10 mm (0.4 in.)
Front
915 mm (36 in.)
Note: Rack units are large and heavy and are not easily moved. Because maintenance
activities require access at both the front and back, allow for extra room. The
footprint shows the radius of the swinging doors on the I/O rack. The figure
shows the minimum space required.
70 Service Guide
Noise Emission Notes
1. LWAd is the declared (upper limit) sound power emission level for a production
series of machines.
2. LpAm is the mean value of the A-weighted sound pressure emission levels at the
operator position (if any) for a production series of machines.
3. <LpA>m is the mean value of the space-averaged A-weighted sound pressure
emission levels at the one-meter positions for a production series of machines.
4. N/A = Not Applicable (no operator position).
5. All measurements are made in accordance with ISO 7779 and reported in
conformance with ISO 9296.
Power cables used in the United States and Canada are listed by Underwriter’s
Laboratories (UL) and certified by the Canadian Standards Association (CSA). These
power cords consist of the following:
v Electrical cables, Type SVT or SJT
v Attachment plugs complying with National Electrical Manufacturers Association
(NEMA) 5-15P
″For 230 V operation in the United States use a UL listed cable set consisting of a
minimum 18 AWG, Type SVT or SJT three-conductor cable a maximum of 15 feet in
length, and a tandem blade, grounding type attachment plug rated at 15 A, 250 V.″
v Appliance couplers complying with International Electrotechnical Commission (IEC)
Standard 320, Sheet C13
″For units set at 230 V (outside of U.S.): use a cable set consisting of a minimum 18
AWG cable and grounding type attachment plug rated 15 A, 250 V. The cable set
should have the appropriate safety approvals for the country in which the equipment will
be installed and should be marked HAR.
Refer to Chapter 11, “Parts Information” on page 523 to find the power cables that are
available.
72 Service Guide
Service Inspection Guide
Perform a service inspection on the system when:
v The system is inspected for a maintenance agreement.
v Service is requested and service has not recently been performed.
v An alterations and attachments review is performed.
v Changes have been made to the equipment that may affect its safe operation.
v External devices with their own power cables have those cables attached.
Note: The owner of the system is responsible to correct any unsafe conditions.
The Entry Map is the starting point for problem determination. The purpose of this MAP
is to quickly point to the appropriate MAP or service reference information either in this
book, or in the common book set, which includes the RS/6000 and Eserver pSeries
Diagnostic Information for Multiple Bus Systems and the PCI Adapter Placement
Reference.
The Quick Entry MAP is a subset of the Entry MAP and helps to save time for some
types of problems.
The Problem Determination MAP provides a structured analysis method to get an error
code if one is not provided by the customer, or if diagnostics cannot be loaded.
75
The Power MAP deals with isolation of components to diagnose a power problem. The
nature of power problems can be related to powering on and down the system, or
power failures that can occur after power is turned on.
The Minimum Configuration MAP is used to locate defective components not found by
normal diagnostics or error-isolation methods. This MAP provides a systematic method
of isolation to the failing item or items.
Checkpoints
These servers use various types of checkpoints, error codes, and SRNs, which are
referred to throughout this book (primarily in Chapter 4, “Checkpoints” on page 147,
Chapter 5, “Error Code to FRU Index” on page 171, Chapter 6, “Loading the System
Diagnostics In Service Mode” on page 401, and Chapter 11, “Parts Information” on
page 523). These codes may appear in the service processor boot progress log, the
AIX error log, and the operator panel display. Understanding the definition and
relationships of these codes is important to the service personnel who are installing or
maintaining these servers.
Codes that can appear on the operator panel or in error logs are as follows:
Checkpoints
Checkpoints display in the operator panel from the time ac power (dc power in
a dc-powered system) is connected to the system until the AIX login prompt is
displayed after a successful operating system boot. These checkpoints have
the following forms:
8xxx 8xxx checkpoints display from the time ac power (dc power in a
dc-powered system) is connected to the system until the OK prompt is
displayed by the service processor on the operator panel display.
(Several 9xxx checkpoints are displayed just before the OK prompt
displays.)
9xxx 9xxx checkpoints are displayed by the service processor after the
power-on sequence is initiated. A system processor takes over control
when 91FF displays on the operator panel.
Exxx Exxx checkpoints indicate that a system processor is in control and is
initializing the system resources. Control is being passed to AIX when
E105 (normal mode boot) or E15B (service mode boot) displays on the
operator panel.
76 Service Guide
Note: Certain checkpoints may remain in the display for long periods of time.
A spinning cursor is visible in the upper-right corner of the display
during these periods to indicate that system activity is continuing.
Error Codes
If a fault is detected, an 8-digit error code is displayed in the operator panel
display. A location may be displayed at the same time on the second line of
the display.
Checkpoints can become error codes if the system fails to advance past the
point at which the code was presented.
For a list of checkpoints, see Chapter 4, “Checkpoints” on page 147. Each
entry provides a description of the event and the recommended action if the
system fails to advance.
SRNs Service request numbers, in the form xxx-xxx or xxx-xxxx, may also be
displayed on the operator panel display and be noted in the AIX error log.
SRNs are listed in the RS/6000 and Eserver pSeries Diagnostic Information
for Multiple Bus Systems, order number SA38-0509.
FRU Isolation
For a list of error codes and recommended actions for each code, see Chapter 5, “Error
Code to FRU Index” on page 171. These actions can refer to Chapter 11, “Parts
Information” on page 523, Chapter 3, “Maintenance Analysis Procedures (MAPs)” on
page 81, or provide informational message and directions. If a replacement part is
indicated, direct reference is made to the part name. The respective AIX and physical
location codes are listed for each occurrence as required. For a list of locations codes,
see “AIX and Physical Location Code Reference Tables” on page 33.
To look up part numbers and view component diagrams, see Chapter 11, “Parts
Information” on page 523. The beginning of the chapter provides a parts index with the
predominant field replaceable units (FRUs) listed by name. The remainder of the
chapter provides illustrations of the various assemblies and components which make up
these servers.
Service Processor
The service processor runs on its own power boundary and continually monitors
hardware attributes, the AIX operating system, and the environmental conditions within
the system. Any system failure which prevents the system from returning to an
operational state (a fully functional AIX operating system) is reported by the service
processor. The service processor is controlled by firmware and does not require the AIX
operating system to be operational to perform its tasks. If any system failures are
detected, the service processor has the ability to take predetermined corrective actions.
The methods of corrective actions are:
v Surveillance
v Call home
v AIX operating system monitoring
Surveillance is a function in which the service processor monitors the system through
heartbeat communication with the system firmware. The heartbeat is a periodic signal
that the firmware can monitor. During system startup, the firmware surveillance monitor
is automatically enabled to check for heartbeats from the firmware. If a heartbeat is not
detected within a default period, the service processor cycles the system power and
attempts to restart until the system either restarts successfully, or a predetermined retry
threshold is reached. In the event the service processor is unsuccessful in bringing the
system online (or in the event that the user asked to be alerted to any service
processor-assisted restarts), the system can call home to report the error.
The call home function can be initialized to call either a service center telephone
number, a customer administration center, or a digital pager telephone number. The
service processor can be configured to stop at the first successful call to any of the
numbers listed, or can be configured to call every number provided. If connected to the
service center, the service processor transmits the relevant system information (the
system’s serial number and model type) and service request number (SRN). If
connected to a digital pager service, the service processor inputs a customer voice
telephone number defined by the customer. An established sequence of digits or the
telephone number to a phone near the failed system could be used to signal a system
administrator to a potential system failure.
During normal operations, the service processor can also be configured to monitor the
AIX operating system. If AIX does not respond to the service processor heartbeat, the
service processor assumes the operating system is hung. The service processor can
automatically initiate a restart and, if enabled, initiate the call home function to alert the
appropriate people to the system hang. Enabling operating system surveillance also
enables AIX detect any service processor failures and report those failures to the
Electronic Service Agent application.
78 Service Guide
Unlike the Electronic Service Agent, the service processor cannot be configured in a
client/server environment where one system can be used to manage all dial-out
functions for a set of systems.
Prior to installing the Electronic Service Agent feature, ensure that you have the latest
level of system firmware. You also need a properly configured modem. For more
information on configuring a modem, see “Modem Configuration Menu” on page 433.
The service center receives the machine type/serial number, host name, SRN, and a
problem description. The service center analyzes the problem report and, if warranted,
dispatches a service person to the customer site. The service center also determines if
any hardware components need to be ordered prior to the service person’s arrival.
The Electronic Service Agent code also gives the user the option to establish a single
system as the problem reporting server. A single system, accessible over the user
network, can be used as the central server for all the other systems on the local area
network (LAN) that are running the Electronic Service Agent application. If the
Electronic Service Agent application on a remote client decides a service request needs
to be placed, the client forwards the information to the Electronic Service Agent server
that dials the service center telephone number from its locally attached modem. In this
scenario, the user only needs to maintain a single analog line for providing call-out
capabilities for a large set of servers.
A modem is required for enabling automated problem reporting to the service center.
Configuration files for several types of modems are included as part of the Electronic
Service Agent package. Refer to “Modem Configuration Menu” on page 433 for more
information on configuring your modem.
Entry MAP
Use the following table to determine your starting point.
When possible, run online diagnostics in service mode. Online diagnostics perform
additional functions, compared to standalone diagnostics. This ensures that the error
state of the system is captured in NVRAM for your use in fixing the problem. The AIX
error log and SMIT are only available when diagnostics are run from the hard drive.
Notes:
1. Licensed programs frequently rely on system information stored in the VPD module
on the primary I/O drawer operator panel. If the MAPs indicate that the primary I/O
drawer operator panel should be replaced, swap the VPD module from the old
primary I/O drawer operator panel to the new one. If the old VPD module has to be
replaced call technical support for recovery instructions. If recovery is not possible,
notify the system owner that new keys for licensed programs may be required.
2. If a network adapter is replaced, the network administrator must be notified so that
the client IP addresses used by the server can be changed. In addition, the
operating system configuration of the network controller may need to be changed in
order to enable system startup. Also check to ensure that any client or server that
addresses this system is updated.
81
Symptom Starting Point
The system stops and a 4-digit number Go to the ″Fast Path MAP″ in the RS/6000 and
beginning with 0 or 2 is displayed in the Eserver pSeries Diagnostic Information for
operator panel display. Multiple Bus Systems.
All other symptoms. Go to “Quick Entry MAP”.
Attention: If you replace FRUs and the problem is still not corrected, go to “MAP
1540: Minimum Configuration” on page 122 unless otherwise indicated in the tables.
If you replace FRUs and the problem is corrected, go to ″MAP 0410: Repair Checkout″
in the RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus Systems.
Symptom Action
Service Actions
You have parts to exchange or a corrective 1. Go to Chapter 10, “Removal and
action to perform. Replacement Procedures” on page 467.
2. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic
Information for Multiple Bus Systems.
82 Service Guide
Symptom Action
You need to verify that a part exchange or Go to ″MAP 0410: Repair Checkout″ in the
corrective action corrected the problem. RS/6000 and Eserver pSeries Diagnostic
Information for Multiple Bus Systems.
You need to verify correct system operation. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic
Information for Multiple Bus Systems.
A Repeating Pattern of Error Code Data Appears on the Primary I/O Drawer Operator Panel
Display
A sequence of displays, each appearing for five 1. Record the 8-digit error code seen after the
seconds, repeats continuously as follows: a five second blank display.
blank display, followed by an eight-digit error
code, followed by up to nine full panels of dump 2. Reset the service processor via the pinhole
data. reset.
3. Go to “Checkpoint and Error Code Index”
on page 173 with the error code you have
recorded.
OK does not appear in the primary I/O drawer operator panel display before pressing the
power on button
A bouncing or scrolling ball remains on the If an ASCII terminal is available, connect it to
primary I/O drawer operator panel display or the primary I/O drawer through serial port 1.
the primary I/O drawer operator panel display is 1. If the service processor menu is displayed,
filled with dashes then:
a. Replace the primary I/O drawer operator
panel assembly, location: U0.1-L1. Refer
to Chapter 10, “Removal and
Replacement Procedures” on page 467.
b. Replace the primary I/O drawer
backplane, location: U0.1-P1. Refer to
Chapter 10, “Removal and Replacement
Procedures” on page 467.
2. If the service processor menu is not
displayed, then replace the primary I/O
drawer backplane, location: U0.1-P1. Refer
to Chapter 10, “Removal and Replacement
Procedures” on page 467.
84 Service Guide
Symptom Action
You have an 8-digit error code of the form 10n1 Shut down the system and remove all but one
xxxx and two or more drawers have the same of the drawers with duplicate drawer numbers.
drawer number. Refer to the steps in “MAP 1540: Minimum
Configuration” on page 122 for a procedure to
connect/reconnect I/O drawers. Attempt to
reboot the system.
1. If the system fails, you have found the
failing drawer. Substitute 2 for n when
looking up the error code in “Checkpoint
and Error Code Index” on page 173.
2. If the system boots, shut down and add
another of the detached drawers back into
the system. Repeat steps one and two until
the failing drawer is isolated.
86 Service Guide
Symptom Action
The system stops and all of the POST Go to “MAP 1540: Minimum Configuration” on
indicators are displayed on the system console. page 122 to isolate the problem.
POST indicators refers to the device
mnemonics (words memory, keyboard, network,
scsi, and speaker) that appear on the system
console during the power-on self-test (POST).
The system stops and the message STARTING Go to Chapter 4, “Checkpoints” on page 147.
SOFTWARE PLEASE WAIT... is displayed on the
system console.
The system does not respond to the password v If the password is being entered, or the
being entered or the system login prompt is service mode boot is being initiated on a
displayed when booting in service mode. keyboard attached to the system, then the
keyboard or its controller may be faulty. In
this case, replace the following, one at a
time, in the sequence listed:
1. Keyboard
2. Primary I/O backplane, location: U0.1-P1
v If the password is being entered, or the
service mode boot is being initiated, from an
ASCII terminal, first insure that the terminal
is connected to S1 or S2. Use the problem
determination procedures for the ASCII
terminal.
If the problem persists, replace the primary
I/O backplane, location: U0.1-P1.
No codes are displayed on the operator panel Reseat the operator panel cable. If problem not
within a few seconds of turning on the system. resolved, replace these parts, one at a time, in
The operator panel displays OK before the the order listed:
system is powered on. 1. Operator panel assembly. Swap the VPD
module.
2. Primary I/O drawer backplane, location:
U0.1-P1. (See notes on 81.)
The SMS configuration list or boot sequence A device may be set to use the same SCSI bus
selection menu shows more SCSI devices ID as the control adapter. Note the ID being
attached to a controller/adapter than are used by the controller/adapter (this can be
actually attached. checked and/or changed via an SMS utility),
and verify that no device attached to the
controller is set to use that ID.
88 Service Guide
MAP 1020: Problem Determination
Use this MAP to get an error code if you were not provided one by the customer or you
are unable to load diagnostics. If you are able to load the diagnostics, go to ″MAP
0020: Problem Determination Procedure″ in the RS/6000 and Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
The service processor may have recorded one or more symptoms in its error log. If the
system has attempted to boot since an error terminated normal operation, there may be
specific fault information in the service processor error log. Examine this error log
before proceeding (see “System Information Menu” on page 424).
The service processor may have been set by the user to monitor server operations and
to attempt recoveries. You can disable these actions while you diagnose and service
the system. If the system was set up according to the recommendations of the RS/6000
Enterprise Server Model M80, Eserver pSeries 660 Model 6M1 User’s Guide, all the
settings of the service processor (except language) were saved by using the
SAVE/RESTORE HARDWARE MAINTENANCE POLICIES service aid. You can use the
same service aid to restore the settings at the end of your service action.
Just in case the service processor settings were not saved by the user, if you disable
them, make notes of their current settings for restoration before you leave.
In addition to the parameters in the table below, you can disconnect the modem to
prevent incoming signals that could cause the system to power on.
Following are the service processor settings. The service processor menus are
described in Chapter 8, “Using the Service Processor” on page 411.
Be prepared to record code numbers and use those numbers in the course of analyzing
a problem. Go to “Step 1020-1” on page 90.
Note: Be prepared to answer questions regarding the operator panel display and
perform certain actions based on displayed POST indicators.
1. Power off the system. Refer to “Powering Off the System” on page 469.
2. Select slow boot mode (select disable fast boot) on the System Power Control
menu from the service processor main menu.
3. Power on the system. Refer to “Powering On the System” on page 469
4. Insert the diagnostic CD-ROM into the CD-ROM drive.
5. When the keyboard indicator is displayed (the word Keyboard), press the 5 key on
the system console.
6. Enter a password, if requested.
7. Wait until the diagnostics are loaded or the system appears to stop.
8. Find your symptom in the following table, then follow the instructions given in the
Action column.
Symptom Action
The system stopped and a code is displayed on Go to the “Entry MAP” on page 81.
the operator panel.
The system stops with a prompt to enter a Enter the password. You are not allowed to
password. continue until a correct password has been
entered. When you have entered a valid
password go to the beginning of this table and
wait for one of the other conditions to occur.
The diagnostic operating instructions are Go to ″MAP 0020: Problem Determination
displayed. Procedure″ in the RS/6000 and Eserver
pSeries Diagnostic Information for Multiple Bus
Systems.
The primary I/O drawer power on/off LED does Go to “MAP 1520: Power” on page 94.
not come on or does not stay on or you have a
power problem.
The system login prompt is displayed. You may not have pressed the correct key or
you may not have pressed the key soon
enough when you were to trying to indicate a
service mode IPL of the diagnostic programs. If
this is the case, start over at the beginning of
this step.
Note: Perform the system shutdown procedure
before turning off the system.
90 Service Guide
Symptom Action
The system stopped. A POST indicator is If the POST indicator represents:
displayed on the system console and an 1. Memory, go to “MAP 1540: Minimum
eight-digit error code is not displayed. Configuration” on page 122.
2. Keyboard
a. Replace the keyboard.
b. Go to “MAP 1540: Minimum
Configuration” on page 122.
3. Network, go to “MAP 1540: Minimum
Configuration” on page 122.
4. SCSI, go to “MAP 1540: Minimum
Configuration” on page 122.
5. Speaker, “MAP 1540: Minimum
Configuration” on page 122.
The System Management Services menu is Go to “Step 1020-4” on page 93.
displayed
All other symptoms. If you were directed here from the Entry MAP,
go to “MAP 1540: Minimum Configuration” on
page 122. Otherwise, find the symptom in the
“Entry MAP” on page 81.
Find the type of keyboard you are using in the following table; then follow the
instructions given in the Action column.
Step 1020-3
Perform the following:
1. Find the eight-digit error code in Chapter 5, “Error Code to FRU Index” on page 171.
Note: If the eight-digit error code is not listed in the Error Code to FRU Index, look
for it in the following:
v Any supplemental service manuals for attached devices
v The diagnostic problem report screen for additional information
v The Service Hints service aid
v The CEREADME file (by using the Service Hints service aid).
Note: Service aids can be found in RS/6000 and Eserver pSeries Diagnostic
Information for Multiple Bus Systems.
2. Perform the action listed.
92 Service Guide
Step 1020-4
1. Turn off, then turn on the system unit.
2. When the keyboard indicator (the word keyboard) is displayed, press the 1 key on
the system console.
3. When the System Management Services menus appear, check the error log for any
errors.
a. Choose Error Log
b. If an error is logged, check the time stamp.
c. If the error was logged during the current boot attempt, record it.
d. Look up the error in Chapter 5, “Error Code to FRU Index” on page 171 and do
the listed action.
e. If no recent error is logged in the error log, go to “MAP 1540: Minimum
Configuration” on page 122.
DANGER
An electrical outlet that is not correctly wired could place hazardous voltage
on metal parts of the system or the devices that attach to the system. It is the
responsibility of the customer to ensure that the outlet is correctly wired and
grounded to prevent an electrical shock.
Before installing or removing signal cables, ensure that the power cables for
the system unit and all attached devices are unplugged.
During an electrical storm, do not connect cables for display stations, printers,
telephones, or station protectors for communications lines.
CAUTION:
This product is equipped with a three–wire power cable and plug for the user’s
safety. Use this power cable with a properly grounded electrical outlet to avoid
electrical shock.
CAUTION:
This unit has more than one power supply cord. To reduce the risk of electrical
shock, disconnect two power supply cords before servicing.
CAUTION:
Energy hazard, remove power before servicing. Disconnect two power supply
cords.
94 Service Guide
Find the symptom in the following table; then follow the instructions given in the Action
column.
Symptom Action
Power Problems
The primary I/O drawer does not become Go to “Map 1521: Cannot Power On the
powered on, no OK prompt. Primary I/O Drawer, Operator Panel Does Not
Show OK Prompt” on page 96.
The CEC drawer does not become powered on. Go to “Map 1522: Cannot Power On the CEC
Drawer” on page 103.
The secondary I/O drawer does not become Go to “Map 1523: Cannot Power On the
powered on, no power present LED. Secondary I/O Drawer, Operator Panel Power
Present LED Does Not Come On” on page 110.
You have been directed here by the SPCN Go to “Map 1524: The SPCN Error Codes
Error Codes table and the error code is 1011 Table Directed You Here and the Error Code is
1C0x. 1011 1C0x” on page 118.
Step 1521-1
Attempt to power on the primary I/O drawer (see “Powering Off and Powering On the
System” on page 469)
Step 1521-2
Look at the power supply LEDs at the front of the primary I/O drawer.
Step 1521-3
Is the power supply LED flashing?
NO Go to “Step 1521-7” on page 97.
YES Go to “Step 1521-4”.
Step 1521-4
Use the information in the “I/O Drawer and Power Supply LED Status” on page 26 to
answer the following question.
Is the power supply LED blinking quickly?
NO Check the SPCN cabling.
Check internal drawer cabling from power distribution card to primary I/O
backplane. See “Primary I/O Drawer Cables” on page 538.
Replace the I/O drawer power supply.
Replace the primary I/O drawer backplane.
Replace the I/O drawer power distribution card.
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Replace the power supply with the flashing LED.
96 Service Guide
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
Step 1521-5
Is there a power supply with a flashing LED?
NO Go to “Step 1521-7”.
YES Go to “Step 1521-6”.
Step 1521-6
Are both I/O drawer power supply LEDs flashing?
NO Replace the power supply with the flashing LED.
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Check internal drawer cabling from power distribution card to primary I/O
backplane. See “Primary I/O Drawer Cables” on page 538.
Replace the following, one at a time:
v Primary I/O drawer BACKPLANE
v Power distribution board
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
Step 1521-7
Are all SPCN cables seated firmly and connected correctly between the primary
I/O drawer, secondary I/O drawers, and CEC drawer?
NO Fix the cable connections.
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Go to “Step 1521-8” on page 98.
Step 1521-9
Are all circuit breakers reset at the rack power distribution panel?
NO Reset the popped breaker by first turning off, then turning on.
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Go to “Step 1521-10”.
Step 1521-10
Are all the dc power cables seated firmly and connected correctly from the rack
power distribution panel to the rack drawers?
NO Fix the connection.
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Go to “Step 1521-11”.
Step 1521-11
Note: Read the danger and caution notices under “Safety Notices” on page xi before
continuing with this procedure.
98 Service Guide
Step 1521-12
Are all circuit breakers reset at the rack power distribution unit(s)?
NO Reset the popped breaker.
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Go to “Step 1521-13”.
Step 1521-13
Are all the ac power cables seated firmly and connected correctly from the rack
power distribution unit(s) to the rack drawers?
NO Fix the connection.
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Go to “Step 1521-14”.
Step 1521-14
Follow the ac power cable(s) from the primary I/O drawer to the appropriate power
distribution unit.
Step 1521-15
Note: Read the danger and caution notices under “Safety Notices” on page xi before
continuing with this procedure.
Note: Read the danger and caution notices under “Safety Notices” on page xi before
continuing with this procedure.
Step 1521-17
Check internal drawer cabling from power distribution card to primary I/O backplane.
See “Primary I/O Drawer Cables” on page 538.
Replace the following in the primary I/O drawer, one at a time, disconnecting the ac
power cable (dc power cable for a dc-powered system) to each power supply before
replacement and reconnecting the ac power cable (dc power cable for a dc-powered
system) to each supply after replacement:
v Primary I/O drawer operator panel
v Fan control card
v I/O drawer power supply (including redundant power supply, if present)
v Primary I/O drawer backplane
Does the primary I/O drawer operator panel show the OK prompt?
NO Contact your next level of support for assistance.
YES This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
Step 1521-19
Note: Read the danger and caution notices under “Safety Notices” on page xi before
continuing with this procedure.
Step 1522-1
Attempt to power on the CEC drawer (see “Powering Off and Powering On the System”
on page 469).
Step 1522-2
Look at the power supply LEDs at the back of the CEC drawer.
Step 1522-3
Are all SPCN cables seated firmly and connected correctly between the primary
I/O drawer, secondary I/O drawers and CEC drawer?
NO Fix the SPCN cable connections
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Go to “Step 1522-4”.
Step 1522-4
Is this machine connected to dc power?
NO Go to “Step 1522-8” on page 104.
YES Go to “Step 1522-5” on page 104.
Step 1522-6
Are all the dc power cables seated firmly and connected correctly from the rack
power distribution panel to the rack drawers?
NO Fix the connection.
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Go to “Step 1522-7”.
Step 1522-7
Note: Read the danger and caution notices under “Safety Notices” on page xi before
continuing with this procedure.
Step 1522-8
Are all circuit breakers reset at the rack power distribution unit(s)?
NO Reset the popped breaker.
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Go to “Step 1522-9” on page 105.
Step 1522-10
Follow the ac power cable(s) from the CEC drawer to the appropriate power distribution
unit.
Step 1522-11
Note: Read the danger and caution notices under “Safety Notices” on page xi before
continuing with this procedure.
Step 1522-13
Disconnect the ac power cable (dc power cable for a dc-powered system) to each
power supply of the CEC drawer and remove the following components from the CEC
drawer, one at a time, in the sequence listed:
1. CEC processor card(s)
2. CEC RIO adapter card
3. CEC memory card(s)
Reconnect the ac power cables (dc power cable for a dc-powered system) to each
power supply of the CEC drawer and attempt to power on the CEC.
Step 1522-15
Have all cards in this step been reinserted?
NO Go to “Step 1522-14”.
YES The symptom has changed. Call your next level of support for assistance.
This ends the procedure.
Step 1522-16
Replace the following in the CEC drawer, one at a time, disconnecting the ac power
cable (dc power cable in a dc-powered system) to each power supply before
replacement and reconnecting the ac power cable (dc power cable for a dc-powered
system) to each power supply after replacement:
1. CEC regulator card
2. CEC backplane
Step 1522-18
Is the ac power cable from the rack power distribution unit to the customer’s ac
power outlet connected and seated correctly at both ends?
NO Connect the ac power cable correctly at both ends.
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Go to “Step 1522-19”.
Step 1522-19
Note: Read the danger and caution notices under “Safety Notices” on page xi before
continuing with this procedure.
Step 1522-20
Note: Read the danger and caution notices under “Safety Notices” on page xi before
continuing with this procedure.
Step 1523-1
Attempt to power on the secondary I/O drawer (see “Powering Off and Powering On the
System” on page 469)
Step 1523-2
Look at the power supply LED(s) at the front of the secondary I/O drawer.
Step 1523-3
Is the power supply LED flashing?
NO Go to “Step 1523-7” on page 112.
YES Go to “Step 1523-4” on page 111.
Step 1523-5
Is there a power supply with a flashing LED?
NO Go to “Step 1523-7” on page 112.
YES Go to “Step 1523-6”.
Step 1523-6
Are both I/O drawer power supply LEDs flashing?
NO Replace the power supply with the flashing LED.
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Check internal drawer cabling from power distribution card to secondary I/O
backplane. See “Secondary I/O Drawer Cables” on page 540.
Replace the following, one at a time:
v Secondary I/O drawer backplane
v Power distribution board
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
Step 1523-8
Is this machine connected to dc power?
NO Go to “Step 1523-12” on page 113.
YES Go to “Step 1523-9”.
Step 1523-9
Are all circuit breakers reset at the rack power distribution panel?
NO Reset the popped breaker by first turning off, then turning on.
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Go to “Step 1523-10” on page 113.
Step 1523-11
Note: Read the danger and caution notices under “Safety Notices” on page xi before
continuing with this procedure.
Step 1523-12
Are all circuit breakers reset at the rack power distribution unit(s)?
NO Reset the popped breaker.
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Go to “Step 1523-13” on page 114.
Step 1523-14
Follow the ac power cable(s) from the secondary I/O drawer to the appropriate power
distribution unit.
Step 1523-15
Note: Read the danger and caution notices under “Safety Notices” on page xi before
continuing with this procedure.
Note: Read the danger and caution notices under “Safety Notices” on page xi before
continuing with this procedure.
Step 1523-17
Check internal drawer cabling from power distribution card to secondary I/O backplane.
See “Secondary I/O Drawer Cables” on page 540.
Replace the following in the secondary I/O drawer, one at a time, disconnecting the ac
power cable (dc power cable for a dc-powered system) to each power supply before
replacement and reconnecting the ac power cable (dc power cable for a dc-powered
system) to each supply after replacement:
v Secondary I/O drawer operator panel
v Fan control card
v Power distribution card
v I/O drawer power supply (including redundant power supply, if present)
v Secondary I/O drawer backplane
Does the secondary I/O drawer operator panel turn on the power present LED?
NO Contact your next level of support for assistance.
YES This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
Step 1523-19
Note: Read the danger and caution notices under “Safety Notices” on page xi before
continuing with this procedure.
Note: Read the danger and caution notices under “Safety Notices” on page xi before
continuing with this procedure.
Step 1524-1
1. Turn off the power.
2. Disconnect the ac power cables (dc power cables in a dc-powered system) from the
CEC drawer.
3. At the top of the CEC drawer, remove the following cards, in the sequence listed:
a. CEC regulator at M01
b. CEC memory riser 1 at M02
c. CEC processor at M03
d. CEC processor at M04 (if present)
e. CEC memory riser 2 at M05 (if present)
f. CEC RIO adapter at M06
4. Reconnect the ac power cables (dc power cables in a dc-powered system) to the
CEC drawer.
5. Turn on the power.
Step 1524-2
1. Turn off the power.
2. Disconnect the ac power cables (dc power cables in a dc-powered system) from the
CEC drawer.
3. Remove CEC power supply 2.
4. Reconnect the ac power cable (dc power cable in a dc-powered system) to the CEC
power supply 1.
5. Turn on the power.
Step 1524-4
1. Turn off the power.
2. Reinstall CEC power supply 2 that was removed in “Step 1524-2” on page 118.
3. Reconnect the ac power cable (dc power cable in a dc-powered system) to the CEC
power supply 2.
4. Disconnect the ac cable (dc power cable in a dc-powered system) to CEC power
supply 1.
5. Remove CEC power supply 1.
6. Turn on the power.
Step 1524-6
1. Turn off the power.
2. Reinstall CEC fan 1 that was removed in “Step 1524-5”.
3. Remove CEC fan 2. Disregard a CEC fan error code if it occurs during this step.
4. Turn on the power.
Step 1524-8
Have all the cards that were removed in “Step 1524-7” been reinstalled?
NO Go to “Step 1524-7” and reinstall the next card in the list.
YES Replace the CEC backplane, location: U1.1-P1.
This ends the procedure. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
Step 1540-2
1. When the keyboard indicator is displayed (the word keyboard), press the 5 key on
the system console.
2. Enter the appropriate password when you are prompted to do so.
Step 1540-3
The system is unable to boot standalone diagnostics.
Check the service processor error log and the primary I/O drawer operator panel for
additional error codes resulting from the slow boot in “Step 1540-1”.
Did the slow boot generate a different error code from the one that originally sent
you to MAP 1540?
YES Restore fast boot mode (select enable fast boot) on the System Power Control
menu from the service processor main menu. Go to “Checkpoint and Error
Code Index” on page 173 and follow the actions for the new error code.
NO Continue “Step 1540-3” below.
The boot attempts that follow will attempt to get the Please define the System Console
prompt on the system console. Ignore any codes that may appear on the operator
panel unless stated otherwise.
Use the following table to locate, if possible, the error code that sent you to minimum
configuration.
First Four Characters of the Error Code Last Four Characters of the Error Code
406x 0009
00A1
00A3
00A6
00A7
450x 245B
460x 245C
4B2x 245D
24A5
24A6
25B3
25B4
267D
2733
27AA
27AB
288A
B006 1406
1407
1408
B1xx 4601
4602
4648
4660
4690
4691
469E
469F
Use the following table tolocate, if possible, the checkpoint that sent you to minimum
configuration.
Did you find the error code or checkpoint that sent you here in the previous
tables or did the action that sent you to MAP 1540 explicitly tell you to run the
CEC drawer minimum configuration?
NO Go to “Step 1540-13” on page 130.
YES Go to “Step 1540-4”.
Step 1540-4
1. Turn off the power.
2. At the CEC drawer, disconnect the RIO cable at RIO port 0.
3. At the primary I/O drawer, disconnect the RIO cable at RIO port 0. The RIO cable
should now be loose and can be removed.
4. At the CEC drawer, disconnect the RIO cable at RIO port 1 and reconnect it to RIO
port 0.
5. Follow the RIO cable you have connected to RIO port 0 of the CEC drawer back to
an I/O drawer, disconnect the I/O drawer end (RIO port 1, note location) of the RIO
cable and reconnect it to the primary I/O drawer RIO port 0.
6. With only one RIO cable connected between the CEC drawer RIO port 0 and the
primary I/O drawer RIO port 0, power the system on to boot standalone diagnostics
from CD.
Step 1540-6
1. Turn off the power.
2. Disconnect the ac power cables (dc power cables in a dc-powered system) from the
CEC drawer.
3. Reinsert the following cards and DIMMs, in the sequence listed:
a. Memory DIMMs in octals B, C, and D on memory riser card 1 at M02, if
previously present.
b. Memory riser card 2 at M05, if previously present.
4. Reconnect the ac power cables (dc power cables in a dc-powered system) to the
CEC drawer.
5. Turn on the power to boot standalone diagnostics from CD.
6. When the keyboard indicator is displayed (the word Keyboard), press the 5 key on
the system console.
7. Enter the appropriate password when you are prompted to do so.
Is the Please define the System Console screen displayed?
NO The card you just reinserted is defective; replace with a new card. Return the
machine to the original configuration. Go to ″MAP 0410: Repair Checkout″ in
the RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
YES Go to “Step 1540-7” on page 127.
Step 1540-8
1. Turn off the power.
2. Disconnect the ac power cables (dc power cables in a dc-powered system) from the
CEC drawer.
3. Replace the memory DIMMs in octal A on memory riser card 1 with memory DIMMs
previously removed, if any, or replace with new memory DIMMs.
4. Connect the ac power cables (dc power cables in a dc-powered system) to the CEC
drawer and turn on the power to boot standalone diagnostics from CD.
Step 1540-9
1. Turn off the power.
2. Disconnect the ac power cables (dc power cables in a dc-powered system) from the
CEC drawer.
3. Replace the memory riser card at M02 with a new memory riser card.
4. Connect the ac power cables (dc power cables in a dc-powered system) to the CEC
drawer and turn on the power to boot standalone diagnostics from CD.
Step 1540-11
Attention: Contact your second level of support before performing this action.
1. Turn off the power.
2. Disconnect the ac power cables (dc power cables in a dc-powered system) from the
CEC drawer.
3. Replace the CEC backplane, processor at M03, and processor at M04 (if installed)
with new components.
4. Reinsert the original CEC RIO bus adapter at M06.
5. Connect the ac power cables (dc power cables in a dc-powered system) to the CEC
drawer and turn on the power to boot standalone diagnostics from CD.
Are there any devices missing from the list of all adapters and devices?
NO Reinstall all remaining adapters, if any, and reconnect all devices. Return the
system to its original configuration. Be sure to select fast boot mode (select
enable fast boot) on the System Power Control menu from the service
processor main menu. Go to ″MAP 0410: Repair Checkout″ in the RS/6000
and Eserver pSeries Diagnostic Information for Multiple Bus Systems.
YES The boot attempts that follow will attempt to isolate any remaining I/O drawer
problems with missing devices. Ignore any codes that may appear on the
operator panel unless stated otherwise.
Go to “Step 1540-13” on page 130.
Select fast boot mode (select enable fast boot) on the System Power Control menu
from the service processor main menu.
Look at the rear of the CEC drawer and locate the JTAG cable (connected to the rear
of the CEC drawer connector position shown in Chapter 1).
Follow the JTAG cable back to the primary I/O drawer. Examine SPCN cable
connections at J15 and J16. These cables, if any, go to secondary I/O drawers.
Note: Before continuing, check the cabling from the CEC drawer to I/O drawer(s) to
ensure that the system is cabled correctly. Refer to “Cabling the CEC Drawer,
Primary I/O Drawer, and Secondary I/O Drawer(s)” on page 53 for valid
configurations. Record the current cabling configuration and then continue with
the following steps.
1. Turn off the power.
2. At the CEC drawer, disconnect the cable connection at RIO port 0.
3. At the primary I/O drawer, disconnect the cable connection at RIO port 0. The RIO
cable that was connected to RIO port 0 should now be loose and can be removed.
4. At the CEC drawer, disconnect the cable connection at RIO port 1 and reconnect it
to RIO port 0.
5. Follow the RIO cable you have connected to RIO port 0 of the CEC drawer back
to a secondary I/O drawer; disconnect the I/O drawer end (RIO port 1, note
location) of the RIO cable and reconnect it to the primary I/O drawer RIO port 0.
6. Verify that primary I/O drawer RIO port 0 is connected to the CEC drawer RIO port
0.
7. Turn on the power to boot standalone diagnostics from CD.
8. If the Please define the System Console screen is displayed, follow directions to
select the system console.
9. Use the Display Configuration and Resource List to list all attached devices and
adapters (refer to the RS/6000 and Eserver pSeries Diagnostic Information for
Multiple Bus Systems if necessary).
10. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Go to “Step 1540-15” on page 132.
YES The RIO cable that was removed in step 3 above is defective. Replace the
RIO cable. Connect the new RIO cable from CEC drawer RIO port 1 to the
secondary I/O drawer RIO port 1 noted in step 5 above. Reconfigure the
system to its original condition. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
Note: The primary I/O drawer should be cabled as shown in “Cabling the CEC
Drawer, Primary I/O Drawer, and Secondary I/O Drawer(s)” on page 53. Be
sure to look at both RIO cables and SPCN cables.
7. Disconnect ac power cables (dc power cables in a dc-powered system) from
remaining I/O drawer(s).
Attention: For ac-powered systems, the rack that you are working on may have
I/O drawers that are connected to other CEC drawers or other third-party drawers
attached to the power distribution unit. To avoid removing power from drawers that
are used by other systems, when you are instructed to unplug power from an I/O
drawer, remove only the power distribution unit (PDU) side of the single ac cables
connected to the I/O drawer in question.
8. All I/O drawers except the primary I/O drawer should now be physically
disconnected from the CEC drawer.
Go to “Step 1540-16”.
Step 1540-16
The primary I/O drawer is the only remaining I/O drawer connected to the CEC drawer.
1. Turn on the power to boot standalone diagnostics from CD.
2. If the Please define the System Console screen is displayed, follow directions to
select the system console.
3. Use the Display Configuration and Resource List to list all attached devices and
adapters (refer to the RS/6000 and Eserver pSeries Diagnostic Information for
Multiple Bus Systems if necessary).
4. Check that all attached devices and adapters are listed.
If the Please define the System Console prompt did not display or all attached devices
and adapters are not listed, the problem is in the primary I/O drawer.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Go to “Step 1540-17” on page 133.
YES Go to “Step 1540-27” on page 139.
Step 1540-19
Are there any adapters in slots 10, 11, 12, 13 or 14?
NO Go to “Step 1540-25” on page 138.
YES Go to “Step 1540-22” on page 135.
Step 1540-20
1. If it is not already off, turn off the power.
2. Remove all adapters from slots 1, 2, 3, and 4 that are not attached to the boot
device.
3. Label and record the location of any cables attached to the adapters.
4. Record the slot number of the adapters.
5. Turn on the power to boot standalone diagnostics from CD.
6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the
TTY terminal’s keyboard.
7. If the Please define the System Console screen is displayed, follow directions to
select the system console.
8. Use the Display Configuration and Resource List to list all attached devices and
adapters (refer to the RS/6000 and Eserver pSeries Diagnostic Information for
Multiple Bus Systems if necessary).
9. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Go to “Step 1540-18”.
YES Go to “Step 1540-23” on page 136.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Go to “Step 1540-19” on page 134.
YES Go to “Step 1540-23” on page 136.
Step 1540-22
1. If it is not already off, turn off the power.
2. Remove all adapters from slots 10, 11, 12, 13, and 14 that are not attached to the
boot device.
3. Label and record the location of any cables attached to the adapters.
4. Record the slot number of the adapters.
5. Turn on the power to boot standalone diagnostics from CD.
6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the
TTY terminal’s keyboard.
7. If the Please define the System Console screen is displayed, follow directions to
select the system console.
8. Use the Display Configuration and Resource List to list all attached devices and
adapters (refer to the RS/6000 and Eserver pSeries Diagnostic Information for
Multiple Bus Systems if necessary).
9. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Go to “Step 1540-25” on page 138.
YES Go to “Step 1540-23” on page 136.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Go to “Step 1540-24” on page 137.
YES Reinstall the next adapter or device and return to the beginning of this step.
Continue repeating this process until an adapter or device causes the Please
define the System Console screen to not display or all attached devices and
adapters to not be listed.
After installing all of the adapters and the Please define the System Console
screen does display and all attached devices and adapters are listed, return
the system to its original configuration. Go to ″MAP 0410: Repair Checkout″ in
the RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO The I/O drawer backplane is defective. Replace the I/O drawer backplane; then
continue below.
YES The adapter was defective. Continue below.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Replacel the next part in the list and return to the beginning of this step.
Continue repeating this process until a part causes the Please define the
System Console screen to be displayed and all attached devices and adapters
to be listed.
If you have replaced all the items listed above and the Please define the
System Console screen does not display or all attached devices and adapters
are not listed, check any external devices and cabling. If you do not find a
problem, contact your next level of support for assistance.
YES Go to “Step 1540-26” on page 139.
Reconfigure the system to its original condition. Go to ″MAP 0410: Repair Checkout″ in
the RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus Systems.
Step 1540-27
If the Please define the System Console screen does display and all attached devices
and adapters are listed, the problem is with a remaining secondary I/O drawer. Turn off
the power.
Is the primary I/O drawer the only drawer connected at this time?
NO Go to “Step 1540-28” on page 140.
YES Reconnect a secondary I/O drawer in the configuration that you recorded
earlier in this MAP.
Note: The I/O drawers should be cabled as shown in “Cabling the CEC
Drawer, Primary I/O Drawer, and Secondary I/O Drawer(s)” on page 53.
Be sure to look at both RIO cables and SPCN cables.
1. Connect ac power (dc power in a dc-powered system) to this secondary
I/O drawer.
2. All I/O drawers except the primary I/O drawer and one secondary I/O
drawer should now be physically disconnected from the CEC drawer.
Note: The I/O drawers should be cabled as shown in “Cabling the CEC Drawer,
Primary I/O Drawer, and Secondary I/O Drawer(s)” on page 53. Be sure to look
at both RIO cables and SPCN cables.
Go to “Step 1540-29”
Step 1540-29
1. Turn on the power to boot standalone diagnostics from CD.
2. If the Please define the System Console screen is displayed, follow directions to
select the system console.
3. Use the Display Configuration and Resource List to list all attached devices and
adapters (refer to the RS/6000 and Eserver pSeries Diagnostic Information for
Multiple Bus Systems if necessary).
4. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Is the SRC that sent you to MAP 1540 20FB99xy?
NO Go to “Step 1540-30”.
YES Go to “Step 1540-41” on page 146.
YES Is there another I/O drawer to connect?
NO The problem has changed. Call your next level of support.
YES Go to “Step 1540-28”.
Step 1540-30
The problem is in the last secondary I/O drawer that was connected. To deconfigure
this secondary I/O drawer, perform the following steps.
Step 1540-32
Are there any adapters in slots 10, 11, 12, 13, or 14?
NO Go to “Step 1540-38” on page 144.
YES Go to “Step 1540-35” on page 142.
Step 1540-33
1. If it is not already off, turn off the power.
2. Remove all adapters from slots 1, 2, 3, and 4 that are not attached to the boot
device.
3. Label and record the location of any cables attached to the adapters.
4. Record the slot number of the adapters.
5. Turn on the power to boot standalone diagnostics from CD.
6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the
TTY terminal’s keyboard.
7. If the Please define the System Console screen is displayed, follow directions to
select the system console.
8. Use the Display Configuration and Resource List to list all attached devices and
adapters (refer to the RS/6000 and Eserver pSeries Diagnostic Information for
Multiple Bus Systems if necessary).
9. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Go to “Step 1540-31”.
YES Go to “Step 1540-36” on page 143.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Go to “Step 1540-32” on page 141.
YES Go to “Step 1540-36” on page 143.
Step 1540-35
1. If it is not already off, turn off the power.
2. Remove all adapters from slots 10, 11, 12, 13, and 14 that are not attached to the
boot device.
3. Label and record the location of any cables attached to the adapters.
4. Record the slot number of the adapters.
5. Turn on the power to boot standalone diagnostics from CD.
6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the
TTY terminal’s keyboard.
7. If the Please define the System Console screen is displayed, follow directions to
select the system console.
8. Use the Display Configuration and Resource List to list all attached devices and
adapters (refer to the RS/6000 and Eserver pSeries Diagnostic Information for
Multiple Bus Systems if necessary).
9. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Go to “Step 1540-38” on page 144.
YES Go to “Step 1540-36” on page 143.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Go to “Step 1540-37”.
YES Reinstall the next adapter or device and return to number 1 in this step.
Continue repeating this process until an adapter or device causes the Please
define the System Console screen to not be displayed or all attached devices
and adapters to not be listed.
Step 1540-37
Replace the adapter or device that is causing the Please define the System Console
screen to not be displayed or all attached devices and adapters to not be listed with a
new adapter.
Reinstall all remaining adapters and reconnect all devices. Return the system to its
original configuration. Go to ″MAP 0410: Repair Checkout″ in the RS/6000 and
Eserver pSeries Diagnostic Information for Multiple Bus Systems.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Replace the next part in the list and return to the beginning of this step.
Continue repeating this process until a part causes the boot to be successful.
If you have replaced all the items listed above and the boot is not successful,
check any external devices and cabling. If you do not find a problem, contact
your next level of support for assistance.
YES Go to “Step 1540-39”.
Step 1540-39
1. Turn off the power.
2. The item just replaced fixed the problem.
3. Plug in all adapters that were previously removed but not reinstalled.
Reconfigure the system to its original condition. Go to ″MAP 0410: Repair Checkout″ in
the RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus Systems.
Note: Before continuing, check the cabling from the CEC drawer to I/O drawer(s) to
ensure that the system is cabled correctly. Refer to “Cabling the CEC Drawer,
Primary I/O Drawer, and Secondary I/O Drawer(s)” on page 53 for valid
configurations. Record the current cabling configuration and then continue with
the following steps.
1. Turn off the power.
2. At the CEC drawer, disconnect the cable connection at RIO port 0.
3. At the primary I/O drawer, disconnect the cable connection at RIO port 0. The RIO
cable that was connected to RIO port 0 should now be loose and can be removed.
4. At the CEC drawer, disconnect the cable connection at RIO port 1 and reconnect it
to RIO port 0.
5. At the primary I/O drawer, disconnect the RIO connection at RIO port 1. Reconnect
this RIO cable end to RIO port 0.
6. Verify that primary I/O drawer RIO port 0 is connected to the CEC drawer RIO port
0.
7. Turn the power on to boot standalone diagnostics from CD.
8. If the Please define the System Console screen is displayed, follow directions to
select the system console.
9. Use the Display Configuration and Resource List to list all attached devices and
adapters (refer to the RS/6000 and Eserver pSeries Diagnostic Information for
Multiple Bus Systems if necessary).
10. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Go to “Step 1540-17” on page 133
YES The RIO cable that was removed in step 3 above is defective. Replace the
RIO cable. Connect the new RIO cable from CEC drawer RIO port 1 to the
primary I/O drawer RIO port 1. Reconfigure the system to its original condition.
Go to ″MAP 0410: Repair Checkout″ in the RS/6000 and Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
Did the Please define the System Console screen display and are all attached
devices and adapters listed?
NO Replace the next part in the list and return to the beginning of this step.
Continue repeating this process until a part causes the Please define the
System Console screen to be displayed and all attached devices and adapters
to be listed
If you have replaced all the items listed above and the Please define the
System Console screen does not display or all attached devices and adapters
are not listed, check any external devices and cabling. If you do not find a
problem, contact your next level of support for assistance.
YES The last item that was replaced in step 3 above was defective. Reconfigure the
system to its original condition. Go to ″MAP 0410: Repair Checkout″ in the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
IPL Flow
The IPL process starts when ac power (dc power in a dc-powered system) is connected
to the system. The IPL process has four phases:
v Phase 1: Service Processor Initialization
Phase 1 starts when ac power (dc power in a dc-powered system) is connected to
the system and ends when OK is displayed in the primary I/O drawer operator panel.
8xxx checkpoints are displayed during this phase. Several 9xxx codes may also be
displayed. Service processor menus are available at the end of this phase by
pressing any key on the console keyboard.
v Phase 2: Hardware Initialization by the Service Processor
Phase 2 starts when system power up is initiated by pressing the power on button on
the primary I/O drawer operator panel. 9xxx checkpoints are displayed during this
time. 91FF, the last checkpoint in this phase, indicates the transition to Phase 3 is
taking place.
v Phase 3: System Firmware initialization
At Phase 3, a system processor takes over control and continues initializing system
resources. During this phase, checkpoints in the form Exxx are displayed. E105, the
last checkpoint in this phase, indicates that control is being passed to the AIX boot
program. Location codes may also be displayed in the second line of the primary I/O
drawer operator panel.
v Phase 4: AIX Boot
When AIX starts to boot, checkpoints in the form 0xxx are displayed. Location codes
may also be displayed in the second line of the primary I/O drawer operator panel.
This phase ends when the AIX login prompt appears on the AIX console. The
primary I/O drawer operator panel should be blank.
147
The following table outlines the IPL phases from power-on to the AIX login prompt,
matching the phases to corresponding operator-panel checkpoints for a typical system
boot. Also listed are the approximate boot times for each phase, along with their
dependencies for each phase.
Before taking actions listed with a checkpoint, check for additional symptoms in the
service processor error log. See the “System Information Menu” on page 424 for
information on how to access the service processor error log.
Note: Go to “MAP 1540: Minimum Configuration” on page 122 for any of the following
checkpoint conditions. If mentioned as an action, remember to run the System
Rack Minimum Configuration:
v A four-digit code in the range of 8xxx through EFFF not listed in the
checkpoint tables.
v A four-digit code is in the checkpoint tables, but does not contain a repair
action or FRU listing.
v All of the FRUs listed in the repair action have been replaced and the problem
has not been corrected.
Checkpoints listed in the following tables are in the format 8xxx, 9xxx, Axxx,
Bxxx, or Exxx, where x is any hexadecimal digit from 1-9 or A-F. If your system
hangs with a displayed checkpoint that begins with anything other than 8, 9, A,
B, or E, go to the RS/6000 and Eserver pSeries Diagnostic Information for
Multiple Bus Systems.
Symptom Action
You have a code that is of the form 8xxx, 9xxx, Go to “Service Processor Checkpoints” on
Axxx, or Bxxx. page 150.
You have a code of the form Exxx. Go to “Firmware Checkpoints” on page 156.
You have a four-character code that is not of Refer to the RS/6000 and Eserver pSeries
the form 8xxx, 9xxx, Axxx, Bxxx, or Exxx. Diagnostic Information for Multiple Bus Systems
manual.
Note: A spinning (or twirling) slash will be shown in the upper-right corner of the
operator panel display while some checkpoints are being displayed. If the slash
stops spinning, a hang condition is indicated.
Notes:
1. If the system fails to progress after this checkpoint is put in the display, do the
following:
a. Activate the service processor pinhole reset switch on the system’s operator
panel.
b. If the hang repeats, check with service support to see if there is a firmware
update that fixes the problem.
c. If not, or the update does not fix the problem, change the primary I/O backplane.
d. If this procedure does not fix the problem, call service support.
2. If the system fails to progress after this checkpoint is put in the display, do the
following:
a. Activate the service processor pinhole reset switch on the system’s operator
panel.
b. If the hang repeats, check with service support to see if there is a firmware
update that fixes the problem.
c. If not, or the update does not fix the problem, go to “MAP 1540: Minimum
Configuration” on page 122.
d. If this procedure does not fix the problem, call service support.
3. Recovery mode flash update has failed. Do the following:
a. Activate the service processor pinhole reset switch on the system’s operator
panel and try the recovery mode flash update again.
b. If the system hangs again, make a new set of firmware update diskettes, using
new, blank diskettes. Retry the flash update with the new diskettes.
c. If the same hang occurs with the new diskettes, replace the following FRUs one
at a time in this order:
1) Diskette drive
2) I/O backplane in the primary I/O drawer
d. If this procedure does not fix the problem, call service support.
4. The service processor was unable to configure the modem(s) attached to the serial
port(s). Do the following:
a. Check the modem, modem cable, and serial port for loose or unconnected
cables. Reset the service processor using the pinhole reset switch on the
primary I/O drawer operator panel.
b. If the hang repeats, check with service support to see if there is a firmware
update that fixes the problem.
c. If there is not, or the update does not fix the problem, replace the following
FRUs one at a time in this order:
1) Modem cable
Attention: If you replace FRUs or perform an action and the problem is still not
corrected, go to “MAP 1540: Minimum Configuration” on page 122 unless otherwise
indicated in the tables.
If you replace FRUs or perform an action and the problem is corrected, go to ″MAP
0410: Repair Checkout″ in the RS/6000 and Eserver pSeries Diagnostic Information
for Multiple Bus Systems.
Note:
Firmware can hang due to multiple causes. In many cases, it can be caused by
the primary processor or a memory problem in the first 64 MB of real memory. If
a location code is not provided with the checkpoint, the following procedure will
use the primary processor and low memory as the probable FRUs.
1. If firmware is hung (E100 through EFFF) and a location code is displayed on
the second line of the operator panel, use that location code as the primary
FRU.
2. If no location code is present or if replacing the primary FRU indicated in the
location code did not fix the problem, then perform the following actions:
a. Power off the system and go to the service processor menus.
b. From the service processor main menu, select option 3, System
Information Menu, then select option 5, Read NVRAM.
c. Enter 330 when prompted for an address.
d. Look at the byte of data at address 330. If the value is FF, go to note 3. If
the value is 00-0F, replace the processor card, location: U1.1-P1-C1.
Attempt to reboot the system. If the system hangs at the same
checkpoint, go to 3. If the system hangs at a new checkpoint, perform the
actions associated with the new checkpoint.
3. Power off the system. From the service processor main menu, select option
3, System Information Menu, then option 8, Memory
Configuration/Deconfiguration Menu. Identify the first configured memory
riser card and octal of DIMMs. Replace the first octal of DIMMs.
4. If replacing the first octal of configured DIMMs does not fix the problem,
replace the memory riser card. If the problem is still not fixed, go to “MAP
1540: Minimum Configuration” on page 122
Note: If you receive a four-digit code in the range of E1xx to EFFF that is not listed in
the table above, go to “MAP 1540: Minimum Configuration” on page 122.
Note: For network boot attempts, if the system is not connected to an active network or
if the target server is inaccessible (this can also result from incorrect IP
parameters being supplied), the system will still attempt to boot. Because
time-out durations are necessarily long to accommodate retries, the system may
appear to be hung.
This procedure assumes that a CD-ROM drive is connected to the internal SCSI
connector and a diagnostics CD-ROM is available.
Step 1
Restart the system and access the firmware SMS Main Menu. Select Multiboot.
1. If the intended boot device is correctly specified in the boot list:
a. Remove all removable media from devices in the boot list from which you do not
want to boot.
b. If attempting to boot from the network, go to “Step 2”.
c. If attempting to boot from a disk drive or CD-ROM, go to “Step 3” on page 168.
2. If the intended boot device is not correctly identified in the boot sequence, add it to
the boot sequence using the SMS menus. If the intended boot device cannot be
added to the boot sequence, go to “Step 3” on page 168.
3. Remove all removable media from devices in the boot list from which you do not
want to boot.
4. If attempting to boot from the network, go to “Step 2”.
5. If attempting to boot from a disk drive or CD-ROM, go to “Step 3” on page 168.
Step 2
If attempting to boot from the network:
1. Verify that IP parameters are correct.
2. Attempt to ″ping″ the target server using the SMS Ping utility. If the ping is not
successful, have the network administrator verify the server configuration for this
client.
3. Check with the network administrator to ensure that the network is up.
4. Check the network cabling to the adapter.
5. Turn the power off, then on and retry the boot operation.
Step 4
It is possible that another installed adapter is causing the problem. Do the following:
1. Remove all installed adapters except the one the CD-ROM drive is attached to and
the one used for the console.
2. Try to boot the standalone diagnostics again.
3. If unable to load standalone diagnostics, go to “Step 5” on page 169.
4. If standalone diagnostics load, reinstall adapters (and attached devices as
applicable) one at a time and retry the boot operation until the problem recurs. Then
replace the adapter or device that caused the problem. (The system board can also
cause the device or adapter to fail.) Go to ″MAP 0410: Repair Checkout″ in
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus Systems.
The Error Code to FRU Index lists fault symptoms and possible causes. The most likely
cause is listed first. Use this index to help you decide which FRUs to replace when
servicing the system.
v If you replace FRUs or perform an action and the problem is still not corrected, go to
“MAP 1540: Minimum Configuration” on page 122 unless otherwise indicated in the
tables.
v If you replace FRUs or perform an action and the problem is corrected, go to ″MAP
0410: Repair Checkout″ in RS/6000 and Eserver pSeries Diagnostic Information for
Multiple Bus Systems.
Note: Do not shut down the system if the FRU was replaced by hot swap.
Four-Character Checkpoints
Four-character progress codes and checkpoints in the form 8xxx, 9xxx, Axxx, Bxxx, and
Exxx are listed in “Firmware Checkpoints” on page 156. If you have a four-character
code that begins with any character other than 8, 9, A, B, or E, refer to the RS/6000
and Eserver pSeries Diagnostic Information for Multiple Bus Systems.
171
Analyzing SPCN Errors on Secondary Drawers
The drawer number of a secondary I/O drawer (which displays in the operator panel in
the form U0.n, where n is the drawer number) does not change unless a repair action
or a configuration change has occurred. However, after a secondary I/O drawer has
been removed, added, or had its backplane replaced, there is a window of time during
the IPL process in which the drawer number may appear to be out of range. For
example, if there is one I/O drawer in the system and one is added, it may temporarily
be assigned a drawer number of 5 or greater. The drawer numbers are reallocated and
finalized during the latter stages of the boot process.
If a fatal SPCN error occurs while the drawer has a temporary number, the system halts
with an 8-character error code displayed in the form 10n1xxxx, where n is the drawer
number. To determine the error, if n is not equal to 1, substitute the number 2 for n,
then look up the error code in Chapter 5, “Error Code to FRU Index” on page 171.
Perform the indicated actions on the failing drawer.
If a nonfatal SPCN error occurs while the drawer has a temporary number, this
temporary number may appear as part of a location code in the service processor error
log or while running AIX diagnostics. If the nonfatal error became fatal or is the cause of
a fatal error, the drawer address will have been resolved and the original error log entry
can be ignored.
To determine which I/O drawer is failing, you may find the attention LED on an I/O
drawer indicates a fault. Check the LEDs on the front of each I/O drawer to isolate
failures to a single drawer. See “Primary I/O Drawer Operator Panel” on page 22 and
“Secondary I/O Drawer Operator Panel” on page 22 for the location and meaning of the
LEDs on the I/O drawer operator panels.
Go to “Operator Panel Error Codes” on page 176 and follow the instructions
in the operator panel error code table.
1xxx xxxx System Power Control Network (SPCN) Error Codes
Go to “SPCN Error Codes” on page 177 and follow the instructions in the
SPCN error code table.
2xxx xxxx Firmware Error Codes
In the service processor error log, the most recent error is at the top of the list and the
oldest error is at the bottom. Examine the time stamp associated with each error. (The
time stamps in the service processor error log reflect coordinated universal time (UTC),
which is also referred to as Greenwich mean time.)
On this system, the error code displayed in the operator panel may not be indicative of
the root cause of the problem. This is also true of the most recent error in the service
processor error log.
Find the error code in the service processor error log that directed you to this note. (If
the error code is not in the error log, perform the service actions for the error code that
was first reported.) Examine the time stamps to look for a group of failures that
happened within minutes prior to this error code. In this group, start at the error code
with the earliest time stamp. To get a list of possible FRUs, select the detail screen for
each error code and perform the listed repair actions. If the repair actions for an error
code are ineffective, continue to work upward through this group of error codes (which
is forward in time from the earliest time stamp to the latest) until the problem is fixed, or
you reach the error code that was first reported. If the system is still not fixed, go to
“MAP 1540: Minimum Configuration” on page 122.
Boot the system to the SMS menus. Power off the system and go to the service
processor Memory Configuration/Deconfiguration Menu to verify that all memory
DIMMs are in the configured state after the repair action. Go to ″MAP 0410: Repair
Checkout″ in RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems.
Attention: If you replace FRUs or perform an action and the problem is still not
corrected, go to “MAP 1540: Minimum Configuration” on page 122 unless otherwise
indicated in the tables.
If you replace FRUs and the problem is corrected, go to ″MAP 0410: Repair Checkout″
in RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus Systems.
Note: Do not shut down the system if the FRU was replaced by hot swap.
Attention: If a power supply or fan is hot-swapped, wait five minutes, then recheck
the AIX error log. If the system has to be shut down to replace a power supply or fan,
reboot the system, wait five minutes after the AIX login prompt appears, then recheck
the AIX error log. If the detailed information for the most recent error with the same
identifier indicates that the original problem has not been fixed by performing the first
action in the action column, continue with the next action in the action column.
Attention: If you replace FRUs or perform an action and the problem is still not
corrected, go to “MAP 1540: Minimum Configuration” on page 122 unless otherwise
indicated in the tables.
If you replace FRUs and the problem is corrected, go to ″MAP 0410: Repair Checkout″
in RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus Systems.
Note: Do not shut down the system if the FRU was replaced by hot swap.
Attention: If you replace FRUs or perform an action and the problem is still not
corrected, go to “MAP 1540: Minimum Configuration” on page 122 unless otherwise
indicated in the tables.
If you replace FRUs and the problem is corrected, go to ″MAP 0410: Repair Checkout″
in the RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus Systems.
Note: Do not shut down the system if the FRU was replaced by hot swap.
1 = Reserved
Example: 000.000.000.000
Example: 000.000.000.000
20EE 0005 Invalid IP parameter (>255) Enter valid IP parameter.
Example: 255.192.002.000
20EE 0006 No recognized SCSI adapter present This warning occurs when the
selected SMS function cannot locate
any SCSI adapter supported by the
function. If a supported SCSI adapter
is installed:
1. Replace SCSI adapter(s).
2. Replace the I/O drawer
backplane in drawer(s) containing
SCSI adapter(s).
20EE 0007 Keyboard not found 1. Plug in keyboard.
2. Replace primary I/O drawer
backplane
Location: U0.1-P1
20EE 0008 No configurable adapters were found This warning occurs when the remote
in the system by the remote IPL IPL menu in the SMS utilities cannot
menu in the SMS utilities locate any LAN adapters that are
supported by the remote IPL function.
If a supported device is installed:
1. Replace the device or adapter.
2. Replace the I/O drawer planar in
thedrawer(s) containing the
adapter.
20EE 0009 Unable to communicate with the Replace primary I/O drawer
service processor backplane
Location: U0.1-P1
20EE 000B The system was not able to find an Go to “Boot Problems and Concerns”
operating system boot record on the on page 167.
devices listed in the boot list. This
can be caused by the wrong devices
being in the boot record, a problem
that prevents the system from
communicating with the device that
contains the boot record, or the boot
record being corrupted.
58 32 MB memory DIMM
68 256 MB memory DIMM
78 1 GB memory DIMM
Notes:
1. Refer to “System Memory” on
page 24 for the memory DIMM
configuration rules. Memory
DIMMs must be replaced with a
supported type of memory DIMM,
and all memory DIMMs in a
group must be of the same size.
2. Check the system error logs to
determine the location code
associated with the error code.
Refer to the “Logical and Physical
Locations” on page 29 to locate
the memory DIMM and the rest of
its group.
Attention: Follow the procedure defined in “Checkpoint and Error Code Index” on
page 173. If you replace FRUs or perform an action and the problem is still not
corrected, go to “MAP 1540: Minimum Configuration” on page 122 unless otherwise
indicated in the tables.
If you replace FRUs or perform an action and the problem is corrected, go to ″MAP
0410: Repair Checkout″ in RS/6000 and Eserver pSeries Diagnostic Information for
Multiple Bus Systems.
Note: Do not shut down the system if the FRU was replaced by hot swap.
Table 7. Service Processor Error Codes
Error Code Description Action/
Possible Failing FRU
4066 0001 Scan interface basic assurance tests CEC backplane
detected CEC backplane failure Location: U1.1-P1
4066 0003 Scan interface basic assurance tests 1. Verify loose or defective JTAG
detected processor fault cable or connector.
2. All processors
Locations: U1.1-P1-C1
U1.1-P1-C2
4066 0009 Scan interface basic assurance tests All memory
detected memory fault Location: U1.1-P1-M1 x2
4066 00A0 Scan interface basic assurance tests Update system firmware
detected CSP service processor Location: U0.1-P1/Y1
firmware fault
4066 00A1 Scan interface basic assurance tests 1. Check for AIX updates
detected AIX or system firmware fault 2. Check for system firmware
updates
Location: U0.1-P1/Y1
4066 00A2 Scan interface basic assurance tests Check for system firmware updates
detected service processor firmware Location: U0.1-P1/Y1
(processor runtime diagnostics) fault
4066 00A3 Scan interface basic assurance tests All pluggable memory
detected memory plugging fault
4066 00A4 Scan interface basic assurance tests Application software
detected application-caused fault
4066 00A5 Scan interface basic assurance tests Verify system components are
detected invalid system configuration supported and properly installed.
Attention: Follow the procedure defined in “Checkpoint and Error Code Index” on
page 173. If you replace FRUs or perform an action and the problem is still not
corrected, go to “MAP 1540: Minimum Configuration” on page 122 unless otherwise
indicated in the tables.
If you replace FRUs and the problem is corrected, go to ″MAP 0410: Repair Checkout″
in RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus Systems.
Note: Do not shut down the system if the FRU was replaced by hot swap.
Table 9. Common Firmware Error Codes
Error Code Description Action/
Possible Failing FRU
B006 1403 Missing service processor resource 1. Update system firmware
Location: U0.1-P1/Y1
2. Primary drawer I/O backplane
Location: U0.1-P1
B006 1404 Service processor firmware fault 1. Update system firmware
Location: U0.1-P1/Y1
2. Primary drawer I/O backplane
Location: U0.1-P1
B006 1405 Service processor firmware fault 1. Update system firmware Location:
U0.1-P1/Y1
2. Primary drawer I/O backplane
Location: U0.1-P1
B006 1406 Service processor program error 1. Update system firmware
Location: U0.1-P1/Y1
2. Check for AIX updates
3. Primary drawer I/O backplane
Location: U0.1-P1
B006 1407 Service processor resource allocation 1. Update system firmware
problem Location: U0.1-P1/Y1
2. Check for AIX updates
3. Primary drawer I/O backplane
Location: U0.1-P1
B006 1408 Service processor program error 1. Update system firmware
Location: U0.1-P1/Y1
2. Check for AIX updates
3. Primary drawer I/O backplane
Location: U0.1-P1
B006 1409 Service processor program error 1. Update system firmware
Location: U0.1-P1/Y1
2. Primary drawer I/O backplane
Location: U0.1-P1
B10F 1000 Service processor failure Primary I/O drawer backplane
Location: U0.1-P1
For more information about using standalone or online diagnostics, see “Using the
Standalone and Online Diagnostics” in RS/6000 and Eserver pSeries Diagnostic
Information for Multiple Bus Systems, order number SA38-0509.
Attention: If this system unit is attached to another system, be sure you isolate this
system unit before stopping the operating system or running diagnostic programs.
Some system-cabling changes (such as installing wrap plugs or removing a device from
the configuration) may require action by the operator of the attached system before
making the cabling changes on this system.
To load standalone diagnostics from the default boot list, perform the following
procedure:
1. Verify with the system administrator and users that all programs may be stopped,
then do so.
2. Turn off the system.
Online diagnostics can be loaded by booting from a hard file in the custom boot list by
pressing the numeric 6 key on the keyboard.
To boot from the devices listed in the service mode boot list, perform the following
procedure:
1. Verify with the system administrator and users that all programs may be stopped,
then do so.
2. Turn off the system.
401
5. Enter any requested passwords.
After any requested passwords have been entered, the system attempts to boot from
the first device of each type found on the list. If no bootable image is found on the first
device of each type on the list, the system does not search through the other devices of
that type for a bootable image; instead, it polls the first device of the next type.
If all types of devices in the boot list have been polled without finding a bootable image,
the system restarts. This action gives the user the opportunity to start the System
Management Services (by pressing the number 1 key on a directly attached keyboard
or the number 1 on an ASCII terminal) before the system attempts to boot again.
Pressing the numeric 5 key on the keyboard causes the system to load standalone
diagnostics if a diagnostic CD is present in the CD-ROM drive.
Pressing the numeric 6 key on the keyboard loads online diagnostics from the hard
file in the customized service mode boot list. No diagnostic CD should be in the
CD-ROM drive because online diagnostics are typically run from a hard disk. Like the
default boot list, the service mode boot list can contain five entries. The numeric 6 key
works like the numeric 5 key with the following exceptions:
v The system searches for a boot record according to the service mode boot list.
v If the service mode boot list is discovered by a cyclical redundancy check to be
corrupted, the system rebuilds the service mode boot list according to the default
boot list. (The default boot list contains five entries, and for each matching device
type found in the system unit, the system makes an entry in the custom boot list.)
v If no service mode boot list is present, the system uses the default boot list.
Note: The Capacity Upgrade on Demand feature is available for this system when a
minimum of four processors running at 750 MHz processor are installed. The
Capacity Upgrade on Demand feature is not available on systems in which 500
MHz processors are installed.
System Processors
The processors are ordered by feature code (FC) for the system on which you are
installing the additional capacity. The following feature codes can be ordered with the
system, or added as upgrades to allow additional processors to be installed in the
system and then enabled later.
FC 8306
This Capacity Upgrade on Demand feature code causes a processor board
with four processors to be installed, but none of the processors are enabled.
Only one of these feature codes can be installed.
FC 8307
This Capacity Upgrade on Demand feature code causes a processor board
with four processors to be installed, and two of the processors are enabled.
After the system is running with any of the above features installed, if the system
administrator enables additional processors on a processor board, the records for the
system are changed to show that the appropriate feature code is in use.
As processors are enabled by the system administrator, the feature records for the
system are automatically updated in the AIX error logs to reflect the number of
processors in use. For system records purposes, the error logs record changes to the
processor configuration and the changes are sent, using the Electronic Service Agent,
to the CUoD (Capacity Upgrade on Demand) project office.
403
Process Flow for Capacity Upgrade on Demand
The following flowchart provides an overview of the process that is required to enable
Capacity Upgrade on Demand on systems that need reserve capacity.
If the Capacity Upgrade on Demand feature is already installed in your system and you
want to upgrade to use additional processors, go to “Enabling Additional Capacity
Upgrade on Demand Processors” on page 409.
Note: Ensure that the system and all new hardware have been installed before
beginning this procedure.
To install a system with the Capacity Upgrade on Demand feature hardware, do the
following:
1. Log in to the system as root user or as a user with system group privileges.
2. Type the following command:
chcod
where (xx) is the number of processors installed in the system, (yy) is the number of
Capacity Upgrade on Demand-enabled processors.
If the number of installed processors is not correct, use the normal maintenance
package procedures to correct any problems.
6. To test the system, type the following at the command line:
diag
7. Select Advanced Diagnostics.
8. Select System Verification mode on sysplanar0 and all processors. Use the normal
maintenance package procedures to correct any problems.
Note: You can obtain the customer number and the customer phone number from
the customer or the marketing representative. Use that information when you
use the chcod command. For example:
chcod -c Jane_Doe-CustomerNumber_9999999-Phone_(333)_444-5555
To verify that the information was entered correctly, type the following at the
command line:
chcod
The RESOURCE INSTALLED is equal to the total number of installed processors (xx).
OLD VALUE
0
NEW VALUE
........
The NEW VALUE must equal (yy), recorded in step 2 on page 406.
5. Shutdown and reboot the system, as follows:
shutdown -Fr
6. Log in to the system as root user or as a user with system group privileges.
7. Type the following at the command line:
chcod
The values of (xx) and (yy) should match the values recorded in step 2 on
page 406.
The processor Capacity Upgrade on Demand feature is enabled and ready to be used.
If you want to enable additional processors, go to “Enabling Additional Capacity
Upgrade on Demand Processors” on page 409.
If the Capacity Upgrade on Demand feature is already installed in your system and you
want to upgrade to use additional processors, continue with this section.
The values of (xx) and (yy) are the current values for the number of processors
installed (xx) and the number of authorized processors (yy). Use this information
about the current configuration to determine the number of processors to enable.
3. Continue with “Adding Processor Capacity” on page 410 to enable any additional
processors that you need.
In the example above, the value of -n 6 is equal to the total number of processors
that you want to have authorized (yy) by entering this command. To upgrade your
system to use more of the installed processors, increase the value of (yy) in
increments of 2. In this case, 6 would represent 4 base processors plus 2 additional
Capacity Upgrade on Demand processors.
The values of (yy) should match the value you entered for the
total_number_of_processors in step 1.
For details about the flag variables entered in the chcod command, see Appendix D,
“Using the chcod Command” on page 555.
Service processor menus are available using an ASCII terminal when OK is displayed on
the primary I/O drawer operator panel or when the service processor has detected a
server problem (such as a surveillance failure).
For a summary of the service processor functions and the methods used to invoke
them, see the following table.
411
Service Processor Menus
The service processor menus are divided into two groups:
v General user menus - the user must know the general-access password.
v Privileged user menus - the user must know the privileged-access password.
If the server is powered off, the service processor menus can be accessed locally or
remotely.
When you gain access, the service processor prompts you for a password (if set), and
when verified, displays the service processor menus.
The service processor menu prompt, represented by 0> or 1>, indicates the serial port
to which the terminal is connected.
v 0> indicates serial port 1 (S1)
v 1> indicates serial port 2 (S2)
The AIX service aid, Save or Restore Hardware Management Policies, can be used to
save your settings after initial setup or whenever the settings must be changed for
system operation purposes.
It is strongly recommended that you use this AIX service aid for backing up service
processor settings to protect the usefulness of the service processor and the availability
of the server. Refer to “Save or Restore Hardware Management Policies,” in the
“Introducing Tasks and Service Aids” section of the RS/6000 and Eserver pSeries
Diagnostic Information for Multiple Bus Systems, SA38-0509.
1. Power-on System
2. Power-off System
1>
v Power-on System
Allows the user to start the system using the current ASCII terminal as the active
console.
v Power-off System
Allows the user to power off the system using the current ASCII terminal as the
active console.
v Read VPD Image from Last System Boot
Displays manufacturer vital product data, such as serial numbers, part numbers, and
so on, that were stored during the system boot prior to the one in progress now.
v Read Progress Indicators from Last System Boot
Displays a number of the boot progress indicators, which may include service
processor checkpoints, IPL checkpoints, and/or AIX configuration codes, from the
previous system boot. This information can be useful in diagnosing system faults.
The progress indicator codes are listed from top (latest) to bottom (oldest).
This information is not stored in non-volatile storage. If the system is powered off
using the power-on button on the operator panel, this information is retained. If the
ac power (dc power in a dc-powered system) is disconnected from the system, this
information will be lost. For an example, refer to “LCD Progress Indicator Log” on
page 451.
v Read Service Processor Error Logs
Main Menu
At the top of the Main Menu is a listing containing:
v Your system’s current firmware version
v The firmware copyright notice
v The system name given to your server during setup
You need the firmware version for reference when you either update or repair the
functions of your service processor.
The system name, an optional field, is the name that your server reports in problem
messages. This name helps your support team (for example, your system administrator,
network administrator, or service representative) to more quickly identify the location,
configuration, and history of your server. The system name is set from the Main Menu
using option 6.
Note: The information under the Service Processor Firmware heading in the following
Main Menu illustration is example information only.
MAIN MENU
1>
1>
Note: Unless otherwise stated in menu responses, settings become effective when a
menu is exited using option 98 or 99.
For security purposes, the service processor counts the number of attempts to enter
correct passwords. The results of not recognizing a correct password within this error
threshold are different, depending on whether the attempts are being made locally (at
the server) or remotely (through a modem). The error threshold is three attempts.
If the error threshold is reached by someone entering passwords at the server, the
service processor commands the server to resume the initial program load (IPL). This
action is taken based on the assumption that the server is in an adequately secure
location with only authorized users having access. Such users must still successfully
enter a login password to access AIX.
If the error threshold is reached by someone entering passwords remotely, the service
processor commands the server to power down to prevent potential security attacks on
the server by unauthorized remote users. The following table lists what you can access
with the privileged-access password and the general-access password.
Note: If you forget the password, you must remove the battery for at least 30 seconds
to disable the password.
v Change Privileged-Access Password
Set or change the privileged-access password. It provides the user with access to all
service processor functions. This password is usually used by the system
administrator or root user.
v Change General-Access Password
Set or change the general-access password. It provides limited access to service
processor menus, and is usually available to all users who are allowed to power on
the server, especially remotely.
Note: The general-access password can only be set or changed after the
privileged-access password is set.
1. Surveillance:
Currently Enabled
3. Surveillance Delay:
10 minutes
1>
– Surveillance
Can be set to Enabled or Disabled.
– Surveillance Delay
Can be set to any number from 0 through 255.
Refer to “Service Processor System Monitoring - Surveillance” on page 446 for more
information about surveillance.
v Reset Service Processor
If this option is selected, entering Y causes the service processor to reboot.
1>
Use the system reset string option to enter the system reset string, which resets the
machine when it is detected on the main console on serial port 1.
Use the snoop serial port option to select the serial port to snoop.
After serial port snooping is correctly configured, at any point after the system unit is
booted to AIX, whenever the reset string is typed on the main console, the system
unit reboots.
Pressing Enter after the reset string is not required, so make sure that the string is
not common or trivial. A mixed-case string is recommended.
4. Power-On System
5. Power-Off System
1>
1. Ring-indicate power-on :
Currently Enabled
2. Number of rings:
Currently 3
Ring-indicate is enabled by default on both serial port 1 (S1) and serial port 2 (S2).
When ring-indicate power-on is enabled, call-in is disabled.
If ring-indicate power-on is enabled and call-in is already enabled, you will be asked
to confirm your choice. Refer to the message displayed on your screen.
Note: In order for the ring-indicate power-on settings to take effect and ring-indicate
power-on to be enabled, the service processor menus must be exited
completely. Enter 99 at the Main Menu screen to completely exit the service
processor menus.
v Reboot/Restart Policy Setup Menu
The following menu controls reboot/restart policy:
1>
Reboot is the process of bringing up the system hardware; for example, from a
system reset or power on. Restart is activating the operating system after the system
hardware is re-initialized. Restart must follow a successful reboot.
– Number of reboot attempts - If the server fails to successfully complete the boot
process, it attempts to reboot the number of times specified. Values equal to or
greater than 0 are valid. Only successive failed reboot/restart attempts are
counted.
– Use OS-Defined restart policy - Allows the service processor to react or not
react in the same way as the operating system to major system faults by reading
the setting of the operating system parameter Automatically Restart/Reboot
After a System Crash. This parameter may or may not be defined, depending on
the operating system or its version/level. If the operating system automatic restart
setting is defined, then it can be set to respond to a major fault by restarting or by
not restarting. See your operating system documentation for details on setting up
operating system automatic restarts. The default value is Yes.
1>
To select a boot mode, select a number and press Enter. The item corresponding to
the selected number toggles from disabled to enabled. If the same number is
selected again, the item toggles from enabled to disabled. If a boot mode is enabled,
the boot mode selected is performed, and the disabled/enabled selection is reset to
disabled. Following is a description for each boot mode:
– Boot to SMS Menu
When this selection is enabled, the system boots to the System Management
Services (SMS) menu.
– Service Mode Boot from Saved List
This selection causes the system to perform a service mode boot using the
service mode boot list saved in NVRAM. If the system boots AIX from the disk
drive and AIX diagnostics are loaded on the disk drive, AIX boots in service mode
to the diagnostics menu.
Using this option to boot the system is the preferred way to run online diagnostics.
– Service Mode Boot from Default List
This selection is similar to “Service Mode Boot from Saved List”, except the
system boots using the default boot list that is stored in the system firmware. This
is normally used to try to boot customer diagnostics from the CD-ROM drive.
Using this option to boot the system is the preferred way to run standalone
diagnostics.
– Boot to Open Firmware
When this selection is enabled, the system boots to the open firmware prompt.
This option should only be used by service personnel to obtain additional debug
information.
5. Read NVRAM
1>
1>
Note: This table is built from vital product data collected during the last boot
sequence. The first time the system is powered up, or after the system’s
non-volatile RAM (NVRAM) has been erased, this table may be empty. The
table is rebuilt during the next boot into AIX.
Note: The processor numbering scheme used by the service processor is different
from the numbering scheme used by AIX. Consult the AIX documentation
before configuring or deconfiguring a processor to ensure that the correct
processor is selected.
0>
Note: This table is built from vital product data collected during the last boot
sequence. The first time the system is powered up, or after the system’s
non-volatile RAM (NVRAM) has been erased, this table may be empty. The
table is rebuilt during the next boot into AIX.
The DIMMs in the system are presented by group. Octal A is shown first, followed by
octals B, C, and D, if present. The menu selection index/DIMM number and the
DIMM address correspond to the numbers shown in “Memory Riser Card 1 Memory
DIMM Locations for Service Processor Menus” on page 428 and “Memory Riser Card
2 Memory DIMM Locations for Service Processor Menus” on page 429.
7: 11.40 8: 11.41
5: 11.32 6: 11.33
3: 11.24 4: 11.25
1: 11.16 2: 11.17
7: 10.72 8: 10.73
5: 10.64 6: 10.65
3: 10.56 4: 10.57
1: 10.48 2: 10.49
1>
1. English
2. Francais
3. Deutsch
4. Italiano
5. Espanol
6. Svenska
1>
Note: Your ASCII terminal must support the ISO-8859 character set to correctly display
languages other than English.
7. Call-Out Test
1>
Note: Modem configuration is only supported on serial port 1 (S1) and serial port 2
(S2).
To make changes, First select the port and then the configuration file
name
Modem Ports:
1. Serial port 1
2. Serial port 2
0>
Call-in and ring-indicate power-on cannot be enabled at the same time. If ring-indicate
power-on is already enabled and you try to enable call-in, a message prompts you for
confirmation. Refer to the message displayed on the screen.
1>
A serial port speed of 9600 baud or higher is recommended. Valid serial port speeds
are as follows:
50 600 4800
75 1200 7200
110 1800 9600
134 2000 19200
150 2400 38000
300 3600 57600
115200
1>
v Service Center Telephone Number is the number of the service center computer.
The service center usually includes a computer that takes calls from servers with
call-out capability. This computer is referred to as the ″catcher.″ The catcher expects
messages in a specific format to which the service processor conforms.
For more information about the format and catcher computers, refer to the README
file in the AIX /usr/samples/syscatch directory. Contact your service provider for the
correct service center telephone number to enter. Until you have that number, leave
this field unassigned.
v Customer Administration Center Telephone Number is the number of the system
administration center computer (catcher) that receives problem calls from servers.
Contact your system administrator for the correct telephone number to enter here.
Until you have that number, leave this field unassigned.
v Digital Pager Telephone Number is the number for a numeric pager carried by
someone who responds to problem calls from your server. Contact your
administration center representative for the correct telephone number to enter. For
test purposes, use a test number, which you can change later. See the note on page
447 for information about using certain modems for paging.
Note: At least one of the preceding three telephone numbers must be assigned in
order for the call-out test to execute successfully.
v Customer Voice Telephone Number is the telephone number of a phone near the
server or answered by someone responsible for the server. This is the telephone
number left on the pager for callback. For test purposes, use a test number, which
you can change later.
v Customer System Telephone Number is the telephone number to which your
server’s modem is connected. The service or administration center representatives
need this number to make direct contact with your server for problem investigation.
This is also referred to as the call-in phone number.
4. Number of retries:
Currently 2
1>
v Call-Out policy can be set to first or all. If call-out policy is set to first, the
service processor stops at the first successful call-out to one of the following
numbers in the order listed:
1. Service center
2. Customer administrative center
3. Pager
If call-out policy is set to all, the service processor attempts a call-out to all of the
following numbers in the order listed:
1. Service center
2. Customer administrative center
3. Pager
v Remote timeout and remote latency are functions of your service provider’s catcher
computer. Either use the defaults or contact your service provider for recommended
settings.
v Number of retries is the number of times you want the server to retry calls that fail
to complete.
1>
When service mode is exited, the service processor parameters revert to the customer
settings.
Restart describes activating the operating system after the system hardware is
re-initialized. Restart must follow a successful reboot.
If you are using the AIX operating system, the menu item under SMIT for setting the
restart policy is Automatically Reboot After Crash (True/False). The default is false.
When the setting is true, and if the service processor parameter ″Use OS-Defined
Restart Policy″ is yes (the default), the service processor takes over for AIX to
reboot/restart after a hardware or surveillance failure.
If the operating system has no automatic restart policy, or if it is disabled, then the
service processor restart policy can be controlled from the service processor menus.
Use the Enable Supplemental Restart Policy selection.
When this setting is no, or if the operating system did not set a policy, the service
processor refers to enable supplemental restart policy for its action.
Enable supplemental restart policy - The default setting is no. If set to yes, the
service processor restarts the server when the operating system loses control and
either:
The Use OS-Defined restart policy is set to no.
OR
The Use OS-Defined restart policy is set to yes and the operating system has no
automatic restart policy.
The following table describes the relationship among the operating system and service
processor restart controls:
1
Service processor default
2
AIX default
Always check the pSeries & RS/6000 Support Web site for the latest firmware images
and update instructions. The Web address is: http://www.rs6000.ibm.com/support/micro
Downloading the firmware update image and update instructions from the Web is
preferred.
If the system is running, but access to the Web is not available, see “System Firmware
Update Using a Locally Available Image” on page 443.
If the system cannot be powered on, but the service processor menus are available,
see “Updating System Firmware From the Service Processor Menus” on page 444.
If the service processor programming has been corrupted, the service processor will
automatically enter recovery mode when power is applied to the system. Recovery
mode is described later in this section.
To check the level of firmware that is currently on the system, see “Determining the
Level of Firmware on the System” on page 443.
It is strongly recommended that you create a set of firmware update diskettes from the
Web site as soon as possible after a system is installed. As noted below, the diskette
images can be downloaded to any personal computer, pSeries, or RS/6000 system with
Internet access. This method is the only way to acquire a set of firmware update
diskettes; they cannot be created by using the service processor menus or by using
AIX.
Retain and store the latest firmware diskettes each time the firmware gets updated in
the event that the firmware becomes corrupted and must be reloaded.
Download the detailed instructions, as well as the latest flash images, from this Web
site. Before doing a system firmware update, check the Web site for the latest code and
images. Do not attempt a firmware update if you are unsure of the image you have, or
of the procedure required for the update.
A line that begins with ROM level (alterable).. displays the firmware level that is
currently on the system.
v Looking at the top of the service processor main menu.
The update file will have the format MMyymmdd.img. The MM indicates that this is
an update image for a Model M80 or Model 6M1 system: yy is the year, mm is the
month, and dd is the day of the update file.
4. After the update file has been written to the /tmp/fwupdate directory, enter the
following commands:
./update_flash -f /tmp/fwupdate/MMyymmdd.img
Attention: Do not overlook the periods (.) in the commands shown above. AIX
commands are case-sensitive. Type them exactly as shown.
You will be asked for confirmation to proceed with the firmware update and the required
reboot. If you confirm, the system will apply the new firmware, reboot, and return to the
AIX prompt. This may take up to ten minutes depending on the configuration of the
system.
Attention: On some systems, the message Wait for rebooting before stopping
may display on the system console. Do not turn off the system until the system has fully
rebooted to the AIX login prompt. If a shutdown is necessary at that time, log in as root
user and issue the shutdown command.
While the update is in progress, you will see Rebooting... on the display for several
minutes.
The service processor menus are available while the system is powered off. As a
privileged user, from the service processor main menu, select Service Processor
Setup, then select Reprogram Flash EPROM Menu. The update process requests
update diskettes as needed.
Refer to the detailed instructions that are provided on the Web site with the latest
image.
Note: The update process can take up to sixty minutes, and the system reboots itself
during the update process.
Recovery Mode
If the service processor detects that its programming has been corrupted when it is
powering up to standby mode, it will enter recovery mode. In recovery mode, it will ask
for the firmware update diskettes by putting eight-digit codes on the operator panel.
Code Action
A1FD 0000 System firmware has been corrupted and must be reflashed.
A1FD 0001 Insert update diskette 1.
A1FD 0002 Insert update diskette 2.
A1FD 0003 Insert update diskette 3.
A1FD 000n Insert update diskette n.
Notes:
1. If the wrong diskette is inserted at any time, or if the diskette is left in the drive after
it has been read, B1FD 001F is displayed, indicating that the wrong diskette is in the
drive.
2. If B1FD 001A is displayed at any time during the process, the service processor must
be reset by activating the pinhole reset switch on the primary drawer operator panel.
After the update process is complete, the service processor resets itself and goes
through a complete reboot.
During boot time, the service processor does not configure processors or memory
DIMMs that are marked ″bad.″
You can enable or disable CPU Repeat Gard or Memory Repeat Gard using the
Processor Configuration/Deconfiguration Menu, which is a submenu under the System
Information Menu.
If the service processor detects no heartbeats during system IPL (for a set time period),
it cycles the system power to attempt a reboot. The maximum number of retries is set
from the service processor menus. If the fail condition persists, the service processor
leaves the machine powered on, logs an error, and displays menus to the user. If
call-out is enabled, the service processor calls to report the failure and displays the
operating system surveillance failure code on the operator panel.
Operating system surveillance is not enabled by default, allowing you to run operating
systems that do not support this service processor option.
For operating system surveillance to work correctly, you must set the following
parameters:
v Surveillance enable/disable
v Surveillance interval
The maximum time the service processor waits between heartbeats from the
operating system before reporting a surveillance failure.
v Surveillance delay
The maximum time the service processor waits for the first heartbeat from the
operating system, after the operating system has been started, before reporting a
surveillance failure.
Surveillance does not take effect until the next time the operating system is started after
the parameters have been set.
You can initiate surveillance mode immediately from service aids. In addition to the
three options above, a fourth option allows you to select immediate surveillance, and
rebooting of the system is not necessarily required.
If operating system surveillance is enabled (and system firmware has passed control to
the operating system), and the service processor does not detect any heartbeats from
the operating system within the surveillance delay period, the service processor
assumes the system is hung and takes action according to the reboot/restart policy
settings. See “Service Processor Reboot/Restart Recovery” on page 440.
If surveillance is selected from the service processor menus that are only available at
bootup, then surveillance is enabled by default as soon as the system boots. From
service aids, the selection is optional.
Call-Out (Call-Home)
The service processor can call out (call-home) when it detects one of the following
conditions:
v System firmware surveillance failure
v Operating system surveillance failure (if supported by operating system)
v Restarts
v Critical hardware failure
v Abnormal operating system termination
Note: Some modems, such as IBM 7857-017, are not designed for the paging function.
Although they can be used for paging, they will return an error message when
they do not get the expected response from another modem. Therefore, even
though the paging was successful, the error message will cause the service
processor to retry, continuing to place pager calls for the number of retries
specified in the Call-Out Policy Setup Menu. These retries result in redundant
pages.
System Configuration
The following describes the configuration for console mirroring:
v Service processor
v Modem connected to one serial port and enabled for incoming calls
v Local ASCII terminal connected to the other serial port. This local terminal can be
connected directly to your server or connected through another modem.
Error Log
Note: The time stamp in this error log is coordinated universal time (UTC), which is
also referred to as Greenwich mean time (GMT). The AIX error log has
additional information and time stamps with local time.
Entering an error number provides nine words of system reference code (SRC) data; an
example screen follows:
Detail: 6005
SRC
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
word11:B1004999 word12:0110005D word13:00000000
word14:00000000 word15:00001111 word16:00000 000
word17:B1004AAA word18:0114005D word19:A4F1E909
B1004999
If Return is pressed, the contents of NVRAM will be dumped 320 bytes at a time,
starting at address 0000.
The progress indicator codes are listed from top (latest) to bottom (oldest).
LCD Progress Indicator Log
B0FF
0539..17
0538..17
0539..17
0538..17
0539..17
0581
0538..17
0539..12
0538..12
0539..
0821..01-K1-00
0539..
0728..01-R1-00-00
0539..
0664..40-60-00-1,0
0539..
0777..U0.2-P1-I14/E1
0539..
0742..U0.2-P1-I12/E1
0539..
0776..U0.2-P1-I10/T1
E139
E1FB
E139
E183
Press Return to continue, or ’x’ to return to menu. >
EAA1..U0.1-P1-I14
E172..U0.1-P1
E172..U0.1-P1-I14
E172..U0.1-P1
94BB
9109
9380
9108
9107
9106
9105
9118
9104
9103
9102
90FD
Pre-Standby Phase
Pre-Standby Phase
This phase is entered when the server is first connected to a power source. This phase
is exited when the power-on self-test (POST) and configuration tasks are completed.
Standby Phase
The standby phase can be reached in either of the following ways:
v With the server off and power connected (the normal path), recognized by OK in the
LCD display.
OR
v With the server on after an operating system fault, recognized by an 8-digit code in
the LCD display.
In the standby phase, the service processor takes care of some automatic duties and
its menus are available. The service processor remains in the standby phase until a
power-on request is detected.
Bring-Up Phase
This phase is entered upon power-on, and exited upon loading of the operating system.
The text-based open firmware command line (the OK prompt), which is available from
the System Management Services Utilities menu, allows you to configure certain
adapters.
Note: The System Management Services can also be started using the Service
Processor Boot Mode Menu. See ″Boot Mode Menu″ on page 423.
After the System Management Services starts, the following Utilities screen displays:
Utilities
1 Password Utilities
2 Display Error Log
3 Remote Initial Program Load Setup
4 SCSI Utilities
5 Select Console
6 MultiBoot
7 Select Language
8 OK Prompt
.------.
|X=Exit|
`------’
===>
455
Each option on the System Management Services Utilities menu is described in this
section.
Password Utilities
Entering this selection permits access to the options shown on the following screen:
Password Utilities
.------.
|X=Exit|
`------’
===>
You can set the password from either the service processor or System Management
Services menus. After the password is set, both the service processor and System
Management Services automatically use the same password. The password is limited
to eight characters.
Note: If you forget your password, you must shut down the system and remove the
battery for at least 30 seconds.
Unattended Start Mode <ON>: This selection is used to enable or disable unattended
start mode. Use this option to instruct the service processor to restore the power state
of the server after a temporary power failure, which is necessary on servers that require
automatic power-on after a power failure. The default setting is ON.
.--------------------------------------------------------.
| Error Log |
| |
| |
| Date Time ErrorCode Location |
| Entry 1. 99/06/30 02:15:35 25a80011 U1.1-P1.1/Q1 |
| Entry 2. 99/07/01 03:18:36 20300026 U1.1-P1.1/Q1 |
`--------------------------------------------------------’
.-----------------. .-----------.
|C=Clear Error Log| |X=Exit Log |
`-----------------’ `-----------’
Note: The time stamp in this error log is Coordinated Universal Time (UCT), which is
also referred to as Greenwich Mean Time (GMT). The AIX error log has more
information available and time stamps with your local time.
Network Parameters
1. IP Parameters
2. Adapter Parameters
3. Ping
.------.
|X=Exit|
`------’
===>
Note: Some applications may require that IP addresses contain leading zeroes for
numbers less than 100. For example, 129.132.4.20 may need to be entered as
129.132.004.020.
IP Parameters
.------.
|X=Exit|
`------’
===>
Attention: If the client system and the server are on the same subnet, set the
gateway IP address to [0.0.0.0].
Adapter Parameters
.------.
|X=Exit|
`------’
(PRESS ENTER AFTER MAKING A SELECTION) ===>
Select an adapter on this screen to display configuration menus for that adapter. If
ethernet is selected, you can set parameters that control how the ethernet adapter
communicates with the network. For example, the following screen shows the menu for
the ethernet adapter configuration:
Adapter Parameters
.------.
|X=Exit|
`------’
(PRESS ENTER AFTER MAKING A SELECTION) ===>
After determining which adapter to use to ping the remote system, you must provide the
addresses needed to communicate with the remote system. Select the appropriate
device from the Ping menu to display a screen similar to the following:
PING IP Address
.---------. .------.
|E=Execute| |X=Exit|
`---------’ `------’
===>
SCSI Utilities
.--------.
| X=Exit |
`--------’
===>
Using the Harddrive Spin Up Delay option allows you to change the spin up delay time
for SCSI hard disk drives attached to your system. Spin up delay values can be entered
manually, or you can use a default setting (two seconds). All values are measured in
seconds.
Attention: Setting the value lower than the default can prevent successful booting
because the system power handling capabilities might be exceeded. If the system no
longer boots after setting this value, access this utility again and increase the value. It is
also possible that adding disk drives can cause similar symptoms. Changing this value
changes the surge current requirements on the system. As you increase the value, the
surge current requirements on the system are decreased.
Select Console
Select this option to define which display is used by the system for system
management. If no console is selected, the console defaults to serial port 1.
Selecting this option causes a screen similar to the one that follows to display on each
console that can be used as a firmware console. Follow the instructions on the screen
to select a particular console.
1 Select Software
2 Software Default
3 Select Install Device
4 Select Boot Devices
5 OK Prompt
6 Multiboot Startup <OFF>
.------.
|X=Exit|
`------’
===>
Select Software
If this option is supported by the operating system, the names of the operating system
installed are shown. This option might not be supported by all operating systems.
If you are running AIX, and you receive the following message:
No Operating System Installed
This indicates that information in nonvolatile storage could have been lost, as would
happen if the battery had been removed. To re-create this value, run the bootlist
command under AIX with the appropriate parameters. See the bootlist command in
your AIX documentation.The AIX Documentation library is available at the following Web
address: http://www-1.ibm.com/servers/aix/library/. Select Technical Publications. AIX
documentation is also contained on the AIX Documentation CD. The documentation is
made accessible by loading the documentation CD onto the hard disk or by mounting
the CD in the CD-ROM drive.
Software Default
If this option is supported by the operating system, you can select the default operating
system to start the system. This option might not be supported by all operating
systems.
.------.
|X=Exit|
`------’
====>
Lists the current order of devices in the boot list. The following is an example of this
screen.
Current Boot Sequence
1. Diskette
2. SCSI CD-ROM ( loc=P1/Z1-A1 )
3. SCSI 9100 MB Harddisk ( loc=P1/Z1-A5 )
4. Ethernet ( loc=P1-I4/E1 )
5. None
.------.
|X=Exit|
`------’
===>
1 1 Diskette
2 4 Ethernet ( loc=P1-I4/E1 )
3 - Token-Ring ( loc=P1-I5/T1 )
4 2 SCSI CD-ROM ( loc=P1/Z1-A1 )
5 3 SCSI 9100 MB Harddisk ( loc=P1/Z1-A5 )
.------.
|X=Exit|
`------’
===>
OK Prompt
Provides access to the open firmware command prompt.
This option should only be used by service personnel to obtain additional debug
information.
Select Language
1. English
2. Francais
3. Deutsch
4. Italiano
5. Espanol
6. Svenska
===> .------.
|X=Exit|
`------’
Note: To correctly display languages other than English, your ASCII terminal must
support the ISO-8859 character set.
OK Prompt
This option provides access to the open firmware command prompt.
This option should only be used by service personnel to obtain additional debug
information.
DANGER
An electrical outlet that is not correctly wired could place hazardous voltage
on metal parts of the system or the devices that attach to the system. It is the
responsibility of the customer to ensure that the outlet is correctly wired and
grounded to prevent an electrical shock.
Before installing or removing signal cables, ensure that the power cables for
the system unit and all attached devices are unplugged.
During an electrical storm, do not connect cables for display stations, printers,
telephones, or station protectors for communications lines.
CAUTION:
This product is equipped with a three–wire power cable and plug for the user’s
safety. Use this power cable with a properly grounded electrical outlet to avoid
electrical shock.
CAUTION:
This unit has more than one power supply cord. To reduce the risk of electrical
shock, disconnect two power supply cords before servicing.
467
Handling Static-Sensitive Devices
Attention: Disk drives, tape drives, and CD-ROM drives are sensitive to static
electricity discharge. These devices are wrapped in antistatic bags to prevent this
damage.
If you cannot use this method, you can power off the system by using the following
operator panel power button procedure.
Attention: Using the primary I/O drawer operator panel power button to power off the
system might cause unpredictable results in the data files, and the next IPL will take
longer to complete.
1. Open the rack door.
2. Press the power button on the primary I/O drawer operator panel.
The power LED on the primary I/O drawer operator panel starts blinking at a fast
rate. B0FF displays in the primary operator panel display.
When the power-off sequence is complete, the system goes into Standby mode, as
evidenced by the following:
v OK displays in the primary operator panel display.
v The primary I/O drawer operator panel LED will start blinking at a slow rate.
v The CEC drawer power LED will start blinking at a slow rate.
v If a secondary I/O drawer is present, its power LED will go off.
3. Disconnect the ac power cords (dc power cords in a dc-powered system) from the
drawers.
The power LED on the primary I/O drawer operator panel starts blinking at a fast rate.
9xxx checkpoints appear in the operator panel display.
Powering Off and Powering On the System Using the Service Processor
The system can be powered off and on using the System Power Control menu, which is
a service processor menu that is available to the privileged user. See “System Power
Control Menu” on page 420.
Note: You do not have to power off the system to remove a hot-pluggable FRU from
the CEC drawer or from I/O drawers.
1 2 3 4 5 6 7 8 9 10 11 12 13 14
P1
LED
PCI
Slot
Before you install the adapter, determine if the PCI adapter you are installing is
hot-pluggable. See the PCI Adapter Placement Reference, order number SA38-0538.
Removal
To remove a non-hotpluggable adapter, do the following:
1. Turn off power and unplug the system unit power cable from the electrical outlet.
If the system is operating under AIX, type shutdown to power off the system.
If you cannot use this method, you can power off the system by using the following
operator panel power button procedure.
Attention: Using the primary I/O drawer operator panel power button to power
off the system may cause unpredictable results in the data files, and the next IPL
will take longer to complete.
a. Open the rack door.
b. Press the power button on the primary I/O drawer operator panel.
The primary I/O drawer operator panel Power LED starts blinking at a fast rate.
B0FF displays in the primary operator panel display.
When the power-off sequence is complete, the system will go into Standby mode,
as evidenced by:
v OK displays in the primary operator panel display.
v The primary I/O drawer operator panel LED will start blinking at a slow rate.
v The CEC drawer power LED will start blinking at a slow rate.
2. Place the I/O drawer in the rear service position, as follows:
a. If you have not already done so, open the front door of the rack unit.
b. Loosen the two thumbscrews that attach the front bezel to the drawer.
c. Grasp both sides of the front bezel and pull the bezel off the drawer.
d. If you have not already done so, open the rear door of the rack unit.
e. If present, remove the two retaining screws that secure the drawer to the rails.
f. Ensure that the cables do not restrain drawer movement.
Replacement
To replace an adapter, perform the removal steps in the reverse order.
Note: If the adapter or device has a problem that prevents it from replacement by
hot swap, it must be replaced by shutting down the system. Go to the
RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus
Systems Map 210 - 3.
7. Press F4 to display the Device Names menu.
8. Select the adapter you are removing from the menu.
9. Answer YES to Keep Definition. Press Enter.
10. The ARE YOU SURE screen displays. Press Enter to verify the information.
Successful unconfigure is indicated by the OK message displayed next to the
Command field at the top of the screen.
11. Press F3 to return to the PCI Hot-Plug Manager menu.
12. Select Replace/Remove a PCI Hot-Plug Adapter and press Enter. The
Replace/Remove a PCI Hot-Plug Adapter menu displays.
13. Move the cursor to select the adapter that you are removing and press Enter.
14. Press the Tab key until the entry field displays the replace operation and then
Press the Enter key. Follow the instructions that display on the screen until you are
instructed to remove the adapter.
Note: A PCI adapter is only hot-pluggable if the PCI adapter is supported for hot-plug
applications. See the PCI Adapter Placement Reference, order number
SA38-0538.
For additional information about taking an adapter offline or removing it from the
system configuration, see the AIX System Management Guide: Operating
System and Devices. This publication is located on the AIX Documentation CD.
The documentation information is made accessible by loading the documentation
CD onto the hard disk or by mounting the CD in the CD-ROM drive.
Note: For information about the PCI slot LED states, see “I/O Drawer PCI Slot LED
Definitions” on page 470.
Provides a descriptive list of all slots that support PCI hot-plug capability. If the listing
for a slot indicates it holds an “Unknown” device, select the Install/Configure Devices
Added after IPL to configure the adapter in that slot.
Allows the user to add a new PCI hot-plug-capable adapter to the slot with the system
turned on. You will be asked to identify the PCI slot that you have selected prior to the
actual operation. The selected PCI slot will go into the Action state and finally into the
On state.
Note: The system will indicate the slot holds an “Unknown” device until you perform
the Install/Configure Devices Added After IPL option to configure the adapter.
Allows the user to remove an existing adapter, or replace an existing adapter with an
identical one. For this option to work, the adapter must be in the Defined state. See
Unconfigure a Device option below.
You will be asked to identify the PCI slot prior to the actual operation. The selected PCI
slot will go into the Action state.
Allows the user to identify a PCI slot. The selected PCI slot will go into the Identify
state. See “I/O Drawer PCI Slot LED Definitions” on page 470.
Allows the user to put an existing PCI adapter into the Defined state if the device is no
longer in use.
This step must be completed successfully before starting any removal or replacement
operation. If this step fails, the customer must take action to release the device.
Allows a new PCI adapter to be configured into the system if software support is
already available for the adapter. The selected PCI slot will go into the On state.
The system attempts to configure any new devices and tries to find and install any
required software from a user-selected source.
The add, remove, and replace functions return information to the user indicating
whether the operation was successful. If additional instructions are provided on the
screen, complete the recommended actions. If the instructions do not resolve the
problem, see the following:
v If the adapter is listed as Unknown, perform the Install/Configure Devices Added
After IPL option to configure the adapter.
v If you receive a warning indicating that needed device packages are not installed, the
system administrator must install the specified packages before you can configure or
diagnose the adapter.
v If you receive a failure message indicating a hardware error, the problem might be
either the adapter or the PCI slot. Isolate the problem by retrying the operation in a
different PCI slot, or trying a different adapter in the slot.
Removal
Note: For -48 vdc systems, you must first remove a long jackscrew. See steps 1
through 4 in “CEC Backplane” on page 496.
1. Turn off power as described in “Powering Off the System” on page 469.
2. Remove the card-cage retaining screws in the back of the CEC drawer.
3. Grasp the handle located in the rear center of the CEC card-cage and pull it straight
back.
4. Pull up on the release latches of the memory card, and carefully pull the card out of
the slot.
5. Locate the memory module connectors and determine which memory modules you
want to remove. Refer to “Memory Riser Card and Memory DIMM Locations” on
page 20 for memory module locations.
Replacement
1. With one hand, touch any metal surface of the chassis to minimize static electrical
charges, and then pick up the memory module.
2. Locate the memory module connector on the memory card.
3. Insert the memory module firmly into the connector.
4. Secure the memory module with the locking tabs located at each end of the
connector.
5. Replace the memory card into the system unit.
6. Push the CEC drawer back into the rack, and replace the CEC card-cage retaining
screws.
Removal
Note: For -48 vdc systems, you must first remove a long jackscrew. See steps 1
through 4 in “CEC Backplane” on page 496.
1. Turn off power as described in “Powering Off the System” on page 469.
2. Remove the card-cage retaining screws in the back of the CEC drawer.
3. Grasp the handle located in the center rear of the CEC card-cage, and pull the CEC
card-cage out from the drawer.
4. Pull up on the release latches on the CEC regulator card (1) and carefully remove
the card from its slot.
Replacement
Replace in reverse order.
Removal
Note: For -48 vdc systems, you must first remove a long jackscrew. See steps 1
through 4 in “CEC Backplane” on page 496.
Attention: The processor cards in this system can only be attached to the backplane
once. If a processor card is removed, it must be replaced with a new processor card.
Before proceeding, make sure you have been directed to replace the processor card(s)
by the action for a valid error code in Chapter 5, “Error Code to FRU Index” on
page 171 or by a second level of support.
1. Turn off power as described in “Powering Off the System” on page 469.
2. Remove the card-cage retaining screws in the back of the CEC drawer.
3. Grasp the handle located in the center rear of the CEC card-cage, and pull the CEC
card-cage out from the drawer.
4. Inspect the area around the outside of the CEC card-cage for excessive dirt or dust.
You may need to vacuum the external surfaces of the CEC card-cage, especially
when excessive dirt or dust is present. Vacuum before opening the cage.
5. Remove the screws from the processor air cover.
2
1
2 1
7. Locate the spring screws (1) attached to the processor card you want to remove.
Using a 3/8″ hex driver, turn the spring screw counterclockwise to loosen and
remove the screw. Do not drop the screw into the CEC card-cage.
8. The processor latches (2), which pivots at the opposite end of the spring screw,
raises slightly after the screw is removed. Grasp the end of the processor latch bar
where the spring screw was located and pull it straight up out of the card-cage.
Rest the latch bar against the sheet metal of the CEC card-cage.
9. Grasp the center of the processor card and remove the card by pulling it straight up
and out of the CEC card-cage. After you have removed the processor card, place
the card top-side down on a work surface.
c. Starting with the brush (1) in the center of the processor contact surface that is
on the CEC backplane, use an outward brushing motion to clear the debris
away from the area where the surfaces of the processor card and the CEC
backplane meet. (Do not brush the processor card or the contact surface on the
bottom of the processor card).
3. Grasp the center of the new processor card and align the square notch with the
processor latch pivot. The processor latch pivot is on your left as you face the rear
of the CEC card-cage. Align the holes in the processor card with the guide pins on
the CEC backplane, and press down until the processor card is fully seated.
4. Pivot the processor card latch down over the processor card.
5. Insert the spring screw into the end of the processor latch opposite the pivot end
(right side, viewed from the rear of the CEC card-cage). Turn the spring screw
clockwise into the threaded hole of the CEC backplane, then tighten the screw until
Removal
1. Loosen the two thumbscrews that attach the front bezel to the drawer.
2. Grasp both sides of the front bezel and pull it off the CEC drawer.
Replacement
Replace in reverse order.
Attention: After hot-plugging a CEC fan, wait at least 5 minutes and then examine
the system error logs to make sure CEC fan errors are not being logged.
Each CEC power supply is hot-pluggable. If only one needs to be removed, you do not
have to power off the system.
Removal
1
1. Carefully remove the end of the ac cord on the CEC power supply you want to
replace. Do not remove both ac cords, as power will be lost to the CEC, forcing the
system to power off.
2. Remove the screw keeping the latch handle in place, as shown in (1) above.
3. After the latch handle swings freely, position the latch handle (2) so that it is
perpendicular to the rear of the CEC drawer.
4. Pull on the ring attached to the power supply (3) and the latch handle (maintaining
its perpendicular position) to move the power supply away from the CEC drawer.
5. Remove the power supply.
Replacement
Replace in reverse order.
Removal
Note: For -48 vdc systems, you must first remove a long jackscrew. See steps 1
through 4 in “CEC Backplane” on page 496.
1. Turn off power as described in “Powering Off the System” on page 469.
2. For easy replacement, record the locations of the RIO cables attached to the rear of
the CEC drawer.
3. Disconnect all of the RIO cables attached to the rear of the CEC drawer.
4. Remove the card-cage retaining screws in the back of the CEC drawer.
5. Grasp the handle located in the center rear of the CEC card-cage and pull it straight
back.
6. Unscrew the two screws attaching the EMC tunnel to the CEC.
7. Pull up on the release latches of the CEC RIO adapter card and carefully pull the
card out of the slot.
1
2
3
Attention: The processor cards in this system can only be attached to the backplane
once. If the CEC backplane is replaced with a new backplane, the processor(s) must
also be replaced with a new processor card(s).
Attention: Before proceeding, make sure you have been directed to replace the
backplane and processor card(s) by the action for a valid error code in Chapter 5, “Error
Code to FRU Index” on page 171 or by instruction from your second level of support.
Removal
1. Turn off power as described in “Powering Off the System” on page 469.
2. Remove the card-cage handle retaining screws (1) and the card-cage handle (2)
located in the rear center of the CEC.
3. Remove the four rear CEC EMC shield retaining screws (3) located in each corner
of the EMC shield.
4. Remove the rear CEC EMC shield (4) by grasping the flanges on each side of the
shield, as shown in the following figure.
3
4
3
4
5. Remove the two card-cage retaining screws in the back of the CEC drawer.
6. Grasp the handle located in the center rear of the CEC card-cage and pull it
straight back.
7. Disconnect all of the RIO cables attached to the rear of the CEC drawer. For easy
replacement, record their locations.
8. Disconnect the JTAG cable.
9. Disconnect the V/S COMM cable.
10. Pull up on the release latches of the CEC RIO adapter card and carefully pull the
card out of the slot.
11. Pull up on the release latches of the memory riser card(s) and carefully pull the
card(s) out of the slot(s).
12. Pull up on the release latches of the CEC regulator card and carefully pull the card
out of the slot.
13. Remove the processor card(s) as shown in “Processor Card” on page 485.
14. Support the bottom of the CEC card-cage with a telescoping table or other suitable
means. Leave the drawer open.
Replacement
Replace in reverse order.
Removal
1. Turn off power as described in “Powering Off the System” on page 469.
4. Disconnect and label all cables from the rear of the assembly.
5. Remove boot disk drives if installed. See “Internal Disk Drives” on page 509.
6. Remove and label adapters. See “PCI Adapters” on page 471.
7. Remove two screws from each side of the assembly.
8. Disconnect and label all cables from the front of the assembly.
10. Remove the RIO card assembly, as described in “I/O Drawer RIO Bus Adapter” on
page 501.
11. Remove the bar from the old assembly and install on the new assembly. See “I/O
Drawer” on page 532.
Replacement
1. Replace in reverse order.
2. Set the drawer type using the service processor menus.
a. From the SP Main Menu, select System Information Menu (option 3).
b. Select Power Control Network Utilities Menu (option 9).
c. Select Change I/O Type (option 3).
d. For the I/O drawer address (drawer number), type:
v 1 for primary drawer.
v n for secondary drawers (n=2, 3, or 4).
e. For the I/O type, type:
v 87 for the primary I/O drawer.
v 84 for all secondary drawers.
3. Check firmware level by checking the Version on the title of the Service Processor
Main Menu. If the firmware level is not at the correct level, proceed as follows:
v Reprogram all system EPROMs using the Service Processor Setup Menu.
v Select Reprogram Flash EPROM Menu (option 7).
4. The service processor settings have been set to default values. These settings must
be restored if not previously set to default values.
v Manually reset all settings.
v If the service processor settings had previously been saved, the settings may be
restored using the procedure in “Saving and Restoring Service Processor
Settings” on page 412.
Removal
1. Turn off power as described in “Powering Off the System” on page 469.
2. Place the I/O drawer in the rear service position, as described in “Rear Service
Position” on page 519.
3. Remove the two thumbscrews and remove the rear cover.
Rear
Cover
Replacement
Replace in reverse order.
Removal
1. Turn off power as described in “Powering Off the System” on page 469.
2. Place the I/O drawer in the rear service position, as described in “Rear Service
Position” on page 519.
3. Remove the two thumbscrews and remove the rear cover.
Rear
Cover
Media
Cable
Tray
Replacement
Replace in reverse order.
Attention: After hot-plugging a power supply, wait at least 5 minutes and then
examine the system error logs to make sure power supply errors or power supply fan
errors are not being logged.
1. If you have not already done so, open the front door of the rack unit.
2. Loosen the two thumbscrews that attach the front bezel to the drawer.
3. Grasp both sides of the front bezel and pull it off the drawer.
4. Unplug the power cord from the power supply you are removing.
5. Pull the snap button out on the power supply handle.
6. Grasp the handle of the power supply and pivot it downward.
Snap Buttons
Push to secure
fan assembly,
pull to remove
fan assembly
Replacement
Note: The power supply handle must be in the down position for replacement.
Test
Switch
3. Use a pencil to press the test switch on the side of the power supply.
v If the LED on the front of the power supply is on steady and the fans spin, the
power supply is good and can be returned to the system.
v If the LED on the front of the power supply is not on steady and the fans do not
spin, the power supply is defective. Continue with step 8 in “I/O Drawer Power
Supplies and Power Supply Fan Assemblies” on page 505.
Removal
1. Turn off power as described in “Powering Off the System” on page 469
2. Place the drawer in the rear service position as described in “Rear Service Position”
on page 519.
3. Remove the two thumbscrews and remove the back top cover.
4. Remove the screw from the back of the disk drive carrier.
5. Slide the carrier forward and remove. The carrier has keyed slots (1) that engage
mounting posts (2) on the chassis.
6. Disconnect the internal cables from the drives.
7. If you are replacing a defective boot drive, remove the mounting screws (3) from the
carrier.
Internal DASD 2
SCSI id = 8
To
System
DASD 1
ID 4 jumper To
J5 System
Internal SCSI
Cable Internal SCSI
DASD 2
ID 8 jumper DASD 1 Cable
ID 4 jumper
Term
Term
J5
Term
Notes:
1. The SCSI ID jumpers are integrated in the SCSI cable.
2. Unused SCSI connectors on the SCSI cables are shown for each configuration.
Carefully select the correct SCSI connectors and ID jumpers.
3. When using the FC 3139 internal/external cable, one or more of the following
external cables will also be required:
v FC 2118
v FC 2424
v FC 2425
For descriptions and part numbers of the feature codes, see “External Cables” on
page 542.
Removal
1. Turn off power as described in “Powering Off the System” on page 469.
2. Place the drawer in the front service position as described in “Front Service
Position” on page 520.
3. Remove the two screws from the center top cover and remove the cover.
4. Disconnect the cable from the fan controller card.
5. Pull the snap button out.
6. Remove the fan controller card.
Replacement
Replace in reverse order.
Attention: Do not open the drive; no user adjustments or serviceable parts are inside.
CAUTION:
A class 1 laser is contained in the device. Do not attempt to operate the drive
while it is disassembled. Do not attempt to open the covers of the drive as it is
not serviceable and is replaced as a unit.
Removal
1. Turn off power as described in “Powering Off the System” on page 469.
2. If you have not already done so, open the front door of the rack unit.
3. Loosen the two thumbscrews that attach the front bezel to the I/O drawer.
4. Grasp both sides of the front bezel and pull it off the drawer.
5. Push the release latch to the left and pull out the CD-ROM.
Release Lever
Replacement
Replace in reverse order.
Removal
1. Turn off power as described in “Powering Off the System” on page 469.
2. If you have not already done so, open the front door of the rack unit.
3. Loosen the two thumbscrews that attach the front bezel to the I/O drawer.
4. Grasp both sides of the front bezel and pull it off the drawer.
5. Push up on the release latch and pull the tape drive out.
Release
Lever
Replacement
Replace in reverse order.
Removal
1. Turn off power as described in “Powering Off the System” on page 469.
2. If you have not already done so, open the front door of the rack unit.
3. Loosen the two thumbscrews that attach the front bezel to the I/O drawer.
4. Grasp both sides of the front bezel and pull it off the drawer.
5. Pull out on the snap button and pull the diskette drive out.
Snap Botton
Replacement
Replace in reverse order.
Removal
1. Turn off power as described in “Powering Off the System” on page 469.
2. Place the I/O drawer in the front service position as described in “Front Service
Position” on page 520.
3. Loosen the two retaining screws that attach the front bezel to the drawer.
4. Grasp both sides of the front bezel and pull it off the drawer.
5. Place the drawer in the front service position as described in “Front Service
Position” on page 520.
6. Remove the two screws (1) from the top of the operator panel.
1
Replacement
1. Remove the VPD module from the old operator panel and insert it in the new
operator panel. Make sure you insert the module in the same position as it was in
the old panel.
1 VPD Module
2 Cable Connector
2. To replace, perform removal steps in reverse order.
Rear of Rack
6. Grasp the bar at the rear of the drawer, and pull the drawer to the rear until it is
stopped by the two detents.
Attention: Do not pull the drawer out from the front. The drawer must be pushed to
the front service position from the rear of the system.
1. If you have not already done so, open the front door of the rack unit.
2. Loosen the two thumbscrews that attach the front bezel to the drawer.
3. Grasp both sides of the front bezel and pull it off the drawer.
4. Remove the screws that attach the left and right retaining brackets and remove the
brackets.
5. If you have not already done so, open the rear door of the rack unit.
6. If present, remove two screws securing the drawer to the rack. Refer to “Rear
Service Position” on page 519.
7. Push the drawer forward until it is stopped by the two detents.
Removal
CAUTION:
A lithium battery can cause fire, explosion, or severe burn. Do not recharge,
disassemble, heat above 100°C (212°F), solder directly to the cell, incinerate, or
expose cell contents to water. Keep away from children. Replace only with the
part number specified for your system. Use of another battery may present a risk
of fire or explosion.
Replacement
Tilt the battery so that you can insert it into the battery socket. Slide the battery into the
battery socket, then push it down into the socket.
Notes:
1. Be sure to tell the customer to reset the time and date.
2. Ensure that the battery polarity is correct; place the battery in the socket with the
positive side facing upward.
523
CEC Drawer Card Assembly
3
5
8 1
2
5 3
7
6
4
8
4
5 7
6
9
10
11
1 12
13
14
2 3
11
12 13
14
15
8 9
13
10
17
16 18 19 23
24
22
27
20
21
25
26
See RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus Systems for part
numbers.
5
4
10
See RS/6000 and Eserver pSeries Diagnostic Information for Multiple Bus Systems for part
numbers.
P3 J 12
+12 V G +3.3 V
J3
P 13
J 10 J 11
J 13 J 15
2 3
S 14
S 16
System Board
S 17 Power Power
Supply 1 Supply 2
V/S COMM
1
JTAG
S 15 S 18
12
CD-ROM
PCI Board
Optional
Media 3.3 V
12
Grd
16 Term
13 5V
DASD 1
J6 J7
Media Misc.
Power Power B
DASD 2 u
14 3.3 V
l
Term Ground
k
5V
h
J5 e
DASD I/P a
Power J4 2 J3 d
Fan 3 Fan 4 Base SPCN Op Panel
10
Power 3
Distribution J28
Power Fan 6 Diskette
15 Board
Supply 2 or J1 4
Cooling J2
Unit Fan 9 Base SPCN
15
Power Fan 5 J2 J3
Supply 1 I/O Board
9
Fan 1 Fan 2 7 J9
Fan 5
JTAG
Controller
Card J5 J30
SCSI
Op Panel J4
8 V/S COMM
6
J21
8
Optional
Media
PCI Board
Optional
8 3.3 V
Media
Grd
Term 5V
B
J6 J7 u
Media Misc. l
Power Power
k
3.3 V h
Ground e
5V a
2 d
I/P
J4 J3
Fan 3 Fan 4 Base SPCN Op Panel
Power 3
9 Distribution 7 I/O Board
Power Fan 6
Board
Supply 2 or J1
Cooling J2
Unit Fan 9 Base SPCN
9
Power Fan 5 J2 J3
Supply 1
6 4
Fan 1 Fan 2
Fan
Controller
Card
Op Panel
This product may contain a sealed lead acid, nickel cadmium, nickel metal
hydride, lithium, or lithium ion battery. Consult your user manual or service
manual for specific battery information. The battery must be recycled or disposed
of properly. Recycling facilities may not be available in your area. For information
on disposal of batteries, contact your local waste disposal facility.
In the United States, IBM has established a collection process for reuse, recycling, or
proper disposal of used IBM sealed lead acid, nickel cadmium, nickel metal hydride,
and other battery packs from IBM Equipment. For information on proper disposal of
these batteries, please contact IBM at 1-800-426-4333. Have the IBM part number
listed on the battery available prior to your call.
Environmental Design
The environmental efforts that have gone into the design of this system signify IBM’s
commitment to improve the quality of its products and processes. Some of these
accomplishments include the elimination of the use of Class 1 ozone-depleting
chemicals in the manufacturing process and reductions in manufacturing wastes. For
more information, contact an IBM account representative.
547
Declared Acoustical Noise Emissions
Noise Emissions1 Operating Idle
With M80 CEC drawer only
LWAd 6.4 bels 6.4 bels
LpAm N/A N/A
<LpA>m 48 dBA 48 dBA
Impulsive or prominent No No
discrete tones
Noise Emissions1 Operating Idle
With M80 and Primary I/O Drawer
LWAd 6.5 bels 6.5 bels
LpAm N/A N/A
<LpA>m 49 dBA 49 dBA
Impulsive or prominent No No
discrete tones
Noise Emissions2 Operating Idle
LWAd 5.8 bels 5.8 bels
LpAm N/A N/A
<LpA>m 45 dBA 45 dBA
Impulsive or prominent No No
discrete tones
1. Noise emissions data are based on the following configuration: the drawer is mounted in a
7014 Series Model T00 Rack, a power distribution unit is installed in the rack, and the system
is operating in a normal environment of 25 °C (78 °F).
2. Noise emissions data are based on the following configuration: the drawer is mounted in a
7014 Series Model T00 Rack and a power distribution unit is installed in the rack.
The manufacturer may not offer the products, services, or features discussed in this
document in other countries. Consult the manufacturer’s representative for information
on the products and services currently available in your area. Any reference to the
manufacturer’s product, program, or service is not intended to state or imply that only
that product, program, or service may be used. Any functionally equivalent product,
program, or service that does not infringe any intellectual property right of the
manufacturer may be used instead. However, it is the user’s responsibility to evaluate
and verify the operation of any product, program, or service.
The manufacturer may have patents or pending patent applications covering subject
matter described in this document. The furnishing of this document does not give you
any license to these patents. You can send license inquiries, in writing, to the
manufacturer.
The following paragraph does not apply to the United Kingdom or any country
where such provisions are inconsistent with local law: THIS MANUAL IS
PROVIDED ″AS IS″ WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR
IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF
NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR
PURPOSE. Some states do not allow disclaimer of express or implied warranties in
certain transactions; therefore, this statement may not apply to you.
Information concerning products made by other than the manufacturer was obtained
from the suppliers of those products, their published announcements, or other publicly
available sources. The manufacturer has not tested those products and cannot confirm
the accuracy of performance, compatibility or any other claims related to products made
by other than the manufacturer. Questions on the capabilities of products made by other
than the manufacturer should be addressed to the suppliers of those products.
549
550 Service Guide
Appendix C. Service Processor Setup and Test
For your convenience, an example of a basic service processor setup checklist is
included here. Your setup may include more or fewer of the available features, so you
can adjust this checklist for your own application so that it is helpful to you.
Note: This step is strongly recommended to protect the usefulness of the service
processor and the availability of the server.
551
Testing the Setup
This section contains sample testing procedures to help ensure that your setup is
working.
These tests include communicating with the server operating system. Before you start,
ensure that the necessary serial port(s) is configured. If you need assistance, refer to
“Serial Port Configuration” on page 553.
The server should be powered off as a result of the “Service Processor Setup
Checklist” on page 551.
Testing Call-In
1. At your remote terminal, call in to your server. Your server answers and offers you
the Service Processor Main Menu after requesting your privileged access password.
2. Select System Power Control.
3. Select Power-On System.
When you are asked if you wish to continue powering on the system, type Y.
4. After the system firmware and operating system have initialized the server, the login
prompt displays at your remote terminal if you set up Seamless Modem Transfer
(refer to “Transfer of a Modem Session” on page 566 for more information). This
may take several minutes. When the login prompt displays, you have successfully
called the service processor.
5. Type logout to disconnect from the operating system. The message No Carrier
displays on your remote terminal.
6. Call your server again. The operating system answers and offers you the login
prompt. If these tests are successful, call-in is working.
7. Log in and type shutdown to shut down your server.
8. The message No Carrier displays on your remote terminal.
Testing Call-Out
During the setup, you entered your phone numbers for the pager (on page 435) and
customer voice (on page 435). These numbers are used for this test.
1. Your remote terminal is disconnected as a result of the Call-In test.
2. Call your server again.
3. At the service processor main menu, select Call-In/Call-Out Setup menu, then
select Call-Out test. This action causes a simulated error condition for the purposes
of this test.
4. After a few moments, a message displays, regarding an illegal entry. Press Enter to
clear the message and return to the main menu.
5. When your telephone rings, answer the call. You should hear the sound of a
telephone being dialed. Your computer is trying to page you.
If no serial ports are configured, none are listed. If you want to configure serial
ports that are not listed, continue with the remaining steps.
3. Identify the serial port(s) with the modem(s).
4. Type smit tty
5. Select add tty
6. Select RS232
7. Select Baud rate 9600 or higher.
8. Select login enable and set the flow control to RTS.
9. Commit the selections and set up any other needed serial ports.
10. Exit SMIT.
Description
The chcod command manages the Capacity Upgrade on Demand feature, which allows
configuration of more ResourceType, in this instance processors, on the system than
were initially configured. The additional resources may be enabled if they are available,
and if the system supports Capacity Upgrade on Demand for the specified
ResourceType. Only one ResourceType can be managed at a time. The change in the
number of ResourceType takes effect after the next system boot.
Flags
-c CustomerInfo
Specifies the text string to include in the error log. CustomerInfo cannot be
more than 255 characters in length. Blank spaces cannot be included in the
string. After CustomerInfo has been specified, subsequent uses of the chcod
command do not have to specify the -c flag, but you do have the option of
changing it. CustomerInfo can consist of alphanumeric characters and any of
″.″ (decimal point), ″,″ (comma), ″-″ (hyphen), ″(″ (open parenthesis), or ″)″
(closed parenthesis). This flag is optional.
-h Displays the usage message. This flag is optional.
-n NbrResources
Specifies the number of ResourceType to be authorized on the system. The
value forNbrResources should be entered in increments of 2. The number that
is entered represents the total number of processors for the system. If you are
adding the first additional processor board, the number should be 6, 8, 10, or
12. Each time a processor board is added, the total number of desired
processors is entered. If the number is 0, Capacity on Demand will be disabled
for the specified ResourceType. This flag is optional. If -n is specified, then -r
must also be specified.
-r ResourceType
Specifies the ResourceType (proc for processors) to be enabled and monitored
on the system. The system must support Capacity Upgrade on Demand for
ResourceType. If -r is specified, then -n must also be specified.
555
The default for the chcod command (with no flags) displays the current value of
CustomerInfo, a reserved field named MailAddr, the system’s model name and serial
number, and the current values for any ResourceType.
Recovery Procedures
Perform any required service actions to repair the system. After the service action is
complete, use the following procedure to check the Capacity Upgrade on Demand
information and if necessary, return the system to its original configuration.
1. Log in to the system as root user or as a user with system group privileges.
2. Type the following at the command line:
chcod
3. The configured Capacity Upgrade on Demand features are displayed. Compare the
current configuration with the configuration you recorded earlier.
If the system configurations do not match, use the procedures beginning with “Installing
a System with Capacity Upgrade on Demand” on page 405 to reinstall the Capacity
Upgrade on Demand feature and reconfigure the hardware, as appropriate.
Unauthorized devices cannot be tested by AIX diagnostics, nor do they appear when
running the lscfg command on the NEW RESOURCE menu, or any AIX diagnostic
task. For example, a system may have eight processors originally installed, but only six
of the eight have been authorized for use. Diagnostics will only show the six authorized
processors in the diagnostic test list or NEW RESOURCE list. The two unused
processors cannot be tested by standalone and online diagnostics, and will not display
in the diagnostic test list. However, they are tested by the power-on self-test (POST),
and if a processor has failed, the failure is:
557
v Logged in the POST error log
v Logged in the AIX error log
v Reported at boot time by automatic error log analysis
v Reported if the sysplanar0 diagnostics are run
System Management Services and service processor menus are not affected by
Capacity Upgrade on Demand and all processors are tested at boot time. If one
processor failed at IPL, Capacity Upgrade on Demand still enables the number of
authorized processors, having the remaining good processors marked as available for
future capacity and the failed processors marked as failed. The customer can continue
to run the system without degradation. However, in such a case, the failing processor
card should be replaced at the customer’s convenience to allow the customer to add
capacity as needed.
where (xx) is the number processors installed in the system, (yy) is the number of
Capacity Upgrade on Demand-enabled processors.
The sample modem configuration files are located in your service processor firmware in
the /usr/share/modems subdirectory (if your server is using AIX) with the following
names. A listing of each specific file is included in this appendix.
Use the following selection procedures and your modem manual to determine which of
the configuration files is suitable for your use.
561
Configuration File Selection
Use the following steps to select a configuration file:
1. Is your modem an IBM 7852-400?
If Yes, use modem configuration file modem_m0.cfg and go to step 7 on page 563.
Note: The IBM 7852-400 modem has DIP switches on the right side of the unit.
See “IBM 7852-400 DIP Switch Settings” on page 565 for the correct switch
settings.
Note: The IBM 7857-017 modem has two telephone line connections on the back
of the unit. One is marked LL (for Leased Line), and the other is marked
PTSN (for Public Telephone Switched Network). The service processor
expects to use the modem on the public network, so the telephone line
should attach to the PTSN connector.
Note: Some older modems do not respond to the X0 or &R1 commands. Edit out
these commands from the modem configuration file if yours is such a
modem. See your modem manual for more information.
Some modems, such as the IBM 7857-017, are not designed for the paging function.
Although they can be used for paging, they return an error message when they do not
get the expected response from another modem. Therefore, even though the paging
was successful, the error message causes the service processor to retry, continuing to
place pager calls for the number of retries specified in the Call-Out Policy Setup Menu.
These retries result in redundant pages.
Note: If you have already set up your serial ports, line speeds, authorizations, and
telephone numbers from the service processor menus, use the service aid to
specify your customized modem configuration files.
If you have not already set up your serial ports, line speeds, authorizations, and
telephone numbers from the service processor menus, use the service aids to
set them while you specify your customized modem configuration files.
To disable Auto-Reliable Mode testing of the remote modem, use the sample modem
configuration file /usr/share/modems/modem_f.cfg as a model that you can modify, as
follows:
1. Find the necessary command in your modem manual.
2. Copy the /usr/share/modems/modem_f.cfg file to a new file with a different name
(for example, modem_fx.cfg).
3. In the new file (modem_fx.cfg), change the line Send "ATE0T\r" to Send
"ATcccE0T\r" where ccc is the added command as specified in your modem
manual, as follows:
Change the third line of each of the following stanzas:
v condout
1 Up Force DTR
2 Up Flow Control &E4
3 Down Result Codes Enabled
4 Down Modem Emulation Disabled
5 Up Auto Answer Enabled
6 Up Maximum Throughput Enabled
7 Up RTS Normal Functions
8 Down Enable Command Mode
9 Down Remote Digital Loopback Test Enabled
10 Up Dial-Up Line Enabled
11 *Up AT Responses Enabled (Extended Responses Disabled)
12 *Down Asynchronous Operation
Xon/Xoff Modems
Some early modems assume software flow control (Xon/Xoff) between the computer
and the modem. Modems with this design send extra characters during and after the
transmitted data. The service processor cannot accept these extra characters. If your
configuration includes such a modem, your functional results may be unpredictable.
The sample modem configuration files included in this appendix do not support these
modems, so custom configuration files are necessary. Anchor Automation 2400E is an
example of such a modem.
Ring Detection
Most modems produce an interrupt request each time they detect a ring signal. Some
modems generate an interrupt only on the first ring signal that they receive. AT&T
DataPort 2001 is an example of such a modem.
Terminal Emulators
The service processor is compatible with simple ASCII terminals, and therefore
compatible with most emulators. When a remote session is handed off from the service
processor to the operating system, agreement between terminal emulators becomes
important.
The server’s operating system will have some built-in terminal emulators. You may also
have a commercially available terminal emulation. It is important that the local and host
computers select the same or compatible terminal emulators so that the key
assignments and responses match, ensuring successful communications and control.
For best formatting, choose line wrap in your terminal emulator setup.
Recovery Procedures
Situations such as line noises and power surges can sometimes cause your modem to
enter an undefined state. When it is being used for dial-in, dial-out or ring indicate
power-on, your modem is initialized each time one of these actions is expected. If one
of these environmental conditions occur after your modem has been initialized, it might
be necessary to recover your modem to a known state.
In case recovery becomes necessary, shut down your system using established
procedures. Disconnect the power cable and press the power button to drain
capacitance while power is disconnected. Disconnect and reconnect modem power, and
then reconnect system power to completely reinitialize your system.
The modem command &Dn (where n is a number) generally sets the modem response
to the Data Terminal Ready (DTR) signal from the server’s serial port. The desired
response is that the modem will hold a connection while DTR is enabled, and drop the
connection when DTR is released. Using this mechanism, the server hangs up on a
connection under normal conditions.
Two strategies are available for dealing with the modem’s response to DTR:
v Recovery
v Prevention
Before you use one of these strategies, determine if your server’s modem is set up
correctly to respond to DTR.
With the remote terminal connected to serial port 1 and defined as the Primary Console
Device, there are two tests you can perform:
1. Will the modem drop the connection after the System initialization complete
message displays at the remote terminal?
If Yes, the modem is set up correctly.
If No, try another &Dn setting for your server’s modem. See your modem manual
for this information. The &Dn command appears in three places each in three of the
sample modem configuration files, as follows:
2. Will the server’s modem disconnect when the power drops? You can make this
observation at the remote terminal by commanding your server to shut down and
power off. (Use the AIX command shutdown -F.) Watch for the message NO
CARRIER on your remote terminal.
If Yes, this is the correct response. The modem is set up correctly.
If No, try another &Dn setting for your server’s modem. See your model manual for
this information. The &Dn command appears in three places each in three of the
sample modem configuration files. Only the following sample modem configuration
files contain the &Dn command (in three places each):
v modem_f.cfg
v modem_f0.cfg
v modem_f1.cfg
If you are using modem_z.cfg or modem_z0.cfg, you cannot control DTR response.
If your remote terminal does not disconnect after logging off, you must command
the remote terminal emulator to hang up. This then breaks the connection.
Recovery Strategy
The recovery strategy consists of making two calls to establish a remote session. This
solution is the easiest to implement and allows more freedom for configuring your
server’s serial ports.
To set up a remote terminal session, dial into the service processor and start the
system. After the operating system is loaded and initialized, the connection will be
dropped. At this point, call the server back and the operating system will answer and
offer you the login prompt.
ICDelay 1
DefaultTO 10
CallDelay 120
#
# %N Call-Out phone number %R Return phone number
#
#
# PROGRAMMING NOTE: No blanks between double quote marks (").
disconnect:
delay 2 # Separate from previous data.
ICDelay 1
DefaultTO 10
CallDelay 120
#
# %N Call-Out phone number %R Return phone number
#
#
# PROGRAMMING NOTE: No blanks between double quote marks (").
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATH0\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "0\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATE0Q1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "0\r" timeout 1
done
ICDelay 1
DefaultTO 10
CallDelay 120
# AT Attention Code , Inserts delay in dialing commands
# Z Reset to factory defaults Q0 Turn on responses
# E0 Turn echo off Q1 Turn off responses
# V0 Use numeric responses S0=0 Automatic answer inhibit
# +++ Escape to command mode S0=2 Answer on second ring
# H0 Hang-up T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
#
# %N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
#
# Following are common responses from a wide range of modems:
# 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK
#
# PROGRAMMING NOTE: No blanks between double quote marks (").
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATH0T\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "0\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATE0Q1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "0\r" timeout 1
done
ICDelay 1
DefaultTO 10
CallDelay 120
# AT Attention Code , Inserts delay in dialing commands
# Z0 Reset. Restore Profile 0 Q0 Turn on responses
# E0 Turn echo off Q1 Turn off responses
# V0 Use numeric responses S0=0 Automatic answer inhibit
# +++ Escape to command mode S0=2 Answer on second ring
# H0 Hang-up X0=0 Limit modem response codes
# T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
#
# %N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
#
# Following are common responses from a wide range of modems:
# 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK
#
# PROGRAMMING NOTE: No blanks between double quote marks (").
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATH0T\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "0\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATE0Q1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "0\r" timeout 1
done
ICDelay 1
DefaultTO 10
CallDelay 120
# AT Attention Code , Inserts delay in dialing commands
# &F Reset to default profile Q0 Turn on responses
# E0 Turn echo off Q1 Turn off responses
# V0 Use numeric responses S0=0 Automatic answer inhibit
# +++ Escape to command mode S0=2 Answer on second ring
# H0 Hang-up X0=0 Limit modem response codes
# T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
#
# &C1 Detect CD &D2 Respond to DTR (often the default)
#
# %N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
#
# Following are common responses from a wide range of modems:
# 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK
#
# PROGRAMMING NOTE: No blanks between double quote marks (").
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATH0T\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "0\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATE0Q1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "0\r" timeout 1
done
ICDelay 1
DefaultTO 10
CallDelay 120
# AT Attention Code , Inserts delay in dialing commands
# &F0 Reset. Restore profile 0 Q0 Turn on responses
# E0 Turn echo off Q1 Turn off responses
# V0 Use numeric responses S0=0 Automatic answer inhibit
# +++ Escape to command mode S0=2 Answer on second ring
# H0 Hang-up X0=0 Limit modem response codes
# T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
#
# &C1 Detect CD &D2 Respond to DTR (often the default)
# &R1 Ignore RTS (CTS)
#
# %N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
#
# Following are common responses from a wide range of modems:
# 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK
#
# PROGRAMMING NOTE: No blanks between double quote marks (").
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATH0T\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "0\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATE0Q1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "0\r" timeout 1
done
ICDelay 1
DefaultTO 10
CallDelay 120
# AT Attention Code , Inserts delay in dialing commands
# &F1 Reset. Restore profile 1 Q0 Turn on responses
# E0 Turn echo off Q1 Turn off responses
# V0 Use numeric responses S0=0 Automatic answer inhibit
# +++ Escape to command mode S0=2 Answer on second ring
# H0 Hang-up X0=0 Limit modem response codes
# T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
#
# &C1 Detect CD &D2 Respond to DTR (often the default)
# &R1 Ignore RTS (CTS)
#
# %N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
#
# Following are common responses from a wide range of modems:
# 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK
#
# PROGRAMMING NOTE: No blanks between double quote marks (").
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATH0T\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "0\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATE0Q1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "0\r" timeout 1
done
Read the safety information in “Safety Notices” on page xi before using these
procedures.
Note: To put the drawer in the front-service position, the drawer stops have to be
removed temporarily.
1 2
587
Drawer Rail Assembly and Installation
The drawer rails require assembly before you put them in your rack.
Use two screws, part number 00G1268, to attach an anti-tip guide (2), part number
09P0327, to each rail. The left rail (1) is part number 03N3845 and the right rail (5) is
part number 03N3847.
Rail Installation
As you install the right rail into the rack, install the drawer latch reset bracket, part
number 11K0851, above the right rail. Refer to the Installation Guide for your system
drawers to install the rails in your rack.
1
2
I/O Drawer Cables: If present, connect the RIO drawer cables to the I/O drawer and
secure them with the RIO cable retainer (3), part number 11K0584. Use two screws (2),
part number 00G1268.
To hold the cables, use the cable hanging brackets (1), part number 11K1013, as
needed.
Route the cables over the cable guides, and secure the cables with Velcro strips, part
number 07H6855 or 51H9502.
Route the cables over the cable guides, and secure the cables with Velcro strips (2),
part number 07H6855 or 51H9502.
593
F LCD progress indicator log 451
LED status 22
fan controller card removal 512
loading the system diagnostics 401
firmware 443
location codes 29, 33
level 443
AIX 32
update 443
format 29
firmware checkpoints 156
physical 29
firmware updates
locations 9
system 442
AIX 33
firmware/POST error codes 212
bays 23
flow, data 4
cable diagram 64, 65
flow, IPL 147
CEC card cage 12
front service position 520
CEC drawer front 9
FRU
CEC drawer front, bezel removed 10
hot-pluggable 470
CEC drawer rear 11
index 171
location code reference tables 33
isolation 77
logical 29
G memory DIMM 20
memory riser card 20
general user menu 413
PCI board 18
H physical 29, 33
handling static-sensitive devices 468 primary I/O drawer bay 16
hot-pluggable FRUs 470 primary I/O drawer front 13
primary I/O drawer operator panel 22
I primary I/O drawer rear 15
I/O drawer SCSI IDs 23
backplane assembly 498 secondary I/O drawer bay 17
features 25 secondary I/O drawer front 14
PCI slot LED definitions 470 secondary I/O drawer operator panel 22
power distribution board 503 system board 19
RIO bus adapter 501
indicators, POST 7 M
information, reference 1 maintenance analysis procedures 75
installing cables 53 maintenance analysis procedures (MAPs) 81, 82
internal disk drives removal 509 entry MAP 81
IPL minimum configuration MAP 122
flow 147 power MAP 94
phases 147 problem determination MAP 89
isolation, FRU 77 quick entry MAP 82
memory configuration/deconfiguration menu 426, 427
J memory riser cards and memory modules 481
JTAG, cable connection 53 memory, description 24
messages, service processor checkpoints 150
K minimum configuration MAP 75, 122
keys minus 48 vdc power
numeric 1 key 7 CEC backplane 497
numeric 5 key 7 CEC fans 490
numeric 6 key 8 CEC power supplies 492
numeric 8 key 8 CEC regulator card 484
CEC RIO Bus Adapter 493
L memory riser cards and memory modules 481
language selection menu 431 processor card 485
language, select 465 modem
laser compliance statement xiii configuration file selection 562
laser safety information xiii configurations 561
Index 595
replacement procedure (continued) service processor (continued)
memory modules 482 processor configuration/deconfiguration menu 425
restart recovery 440 reboot/restart policy setup menu 421
RETAIN 437 ring-indicate power-on menu 420
retries 436 serial port selection menu 434
ring-indicate power-on menu 420 serial port snoop setup menu 419
RIO cabling 54 serial port speed setup menu 434
riser card memory placement rules 24 service processor reboot/restart recovery 440
setup 551
S setup checklist 551
safety xi setup menu 416
electrical xii system information menu 424
laser compliance statement xiii system power control menu 420
notices xi telephone setup menu 435
rack xi test 551
saving service processor settings 551 service processor checkpoints 150
SCSI IDs 23 service processor error codes 233
secondary I/O drawer cable diagram 65 service processor feature 78
secondary I/O drawer operator panel locations 22 setup menu 416
self-test, power-on 6 SPCN cabling 59
serial port SPCN error codes 177
selection menu 434 specifications 67
snoop setup menu 419 7014 Series Model T00 Rackrack 68
speed setup menu 434 cec drawer 67
service clearance 70 I/O drawer 69
service inspection guide 73 noise emission notes 71, 548
service mode 401 standalone diagnostics 401
service processor strategy, console 6
backup settings 551 system diagnostics, loading 401
boot mode menu 423 system firmware update messages 388
call-in/call-out setup menu 432 system firmware updates 442
call-out call-home 447 system information menu 424
call-out policy setup menu 436 system management services 455
checklist 551 display error log 457
checkpoints 150 multiboot 462
console mirroring 449 OK prompt 465
customer account setup menu 437 password utilities 456
error log 450 remote initial program load setup 457
functions 438 SCSI utilities 461
general user menu 413 select console 461
language selection menu 431 select language 465
main menu 414 system memory 24
memory configuration/deconfiguration menu 426, system power-on methods 439
427 system specifications 67
menus 412 system, powering on and off 5
menus list 411
modem configuration menu 433 T
monitoring - surveillance 446 tape drive removal 515
operational phases 452 telephone setup menu 435
OS surveillance setup menu 418 testing the setup
passwords 417 call-in 552
power control network utilities menu 430 call-out 552
power-on methods 439 trademarks xviii
privileged user menus 414 transfer of a modem session 566
procedures in service mode 437
V
V/S COMM, cable connection 53
W
Web sites
AIX library 426, 462
firmware updates 442, 443
microcode updates 442, 443
RS/6000 and pSeries publications xvii
Index 597
598 Service Guide
Readers’ Comments — We’d Like to Hear from You
RS/6000 Enterprise Server Model M80
Eserver pSeries 660 Model 6M1
Service Guide
Overall, how satisfied are you with the information in this book?
How satisfied are you that the information in this book is:
When you send comments to IBM, you grant IBM a nonexclusive right to use or distribute your comments in any
way it believes appropriate without incurring any obligation to you.
Name Address
Company or Organization
Phone No.
_________________________________________________________________________________
Readers’ Comments — We’d Like to Hear from You Cut or Fold
SA38-0571-01 IBMR Along Line
NO POSTAGE
NECESSARY
IF MAILED IN THE
UNITED STATES
Information Development
Department H6DS-905-6C006
11400 Burnet Road
Austin, TX 78758-3493
_____________________________________________________________________________
Fold and Tape Please do not staple Fold and Tape
Cut or Fold
SA38-0571-01 Along Line
IBMR
Printed in U.S.A.
September 2001
SA38-0571-01
Spine information: