RS/6000 SP IBM SP Switch Service Guide GA22-7443-04
RS/6000 SP IBM SP Switch Service Guide GA22-7443-04
Note! Before using this information and the product it supports, read the information in “Safety and environmental notices” on page ix and “Notices” on page A-1. Fifth Edition (April 2002) This book replaces GA22-7443-03. IBM welcomes your comments.
Contents Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii Safety and environmental notices . Safety notices . . . . . . . . . Danger notices. . . . . . . . Caution notices . . . . . . . Laser safety information . . . . Environmental notices . . . . . . Product recycling and disposal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Selecting the switch clock source . . . . . . . . . . Determining the correct switch clock source . . . . . . Removing and restoring switch resources . . . . . . . . Removing a switch assembly from the active configuration . Restoring a switch assembly to the active configuration . . Viewing switch partitions . . . . . . . . . . . . . Service position procedures . . . . . . . . . . . . . Placing a switch assembly into service position . . . . . Replacing a switch assembly from service position . . . .
Figures 1-1. 1-2. 1-3. 2-1. 2-2. 2-3. 2-4. 2-5. 2-6. 2-7. 2-8. 2-9. 2-10. 4-1. 4-2. 4-3. 4-4. 4-5. 4-6. SP Switch high-level diagram . . . . . . . . . . . . . . . . . . . . . . . . SP Switch wrap plugs . . . . . . . . . . . . . . . . . . . . . . . . . . SP Switch inner chassis and front chassis cables . . . . . . . . . . . . . . . . Front view of frame locations . . . . . . . . . . . . . . . . . . . . . . . . Front view of multi-switch frame locations . . . . . . . . . . . . . . . . . . .
vi RS/6000 SP: SP Switch Service Guide
Tables 1-1. 1-2. 1-3. 1-4. 1-5. 1-6. 1-7. 1-8. 1-9. 1-10. 2-1. 2-2. 2-3. 3-1. 3-2. 3-3. 3-4. 5-1. 5-2. 5-3. 5-4. 5-5. Switch problem diagnostics . . . . . . . . . . . . . Environmental messages for switches . . . . . . . . . Switch connector resistance values . . . . . . . . . . Fan failure diagnostics . . . . . . . . . . . . . . . Inner chassis cable continuity . . . . . . . . . . . . Front chassis cable continuity . . . . . . . . . . . . SP Switch error conditions . . . . . . . . . . . . .
viii RS/6000 SP: SP Switch Service Guide
Safety and environmental notices For general information concerning safety, refer to Electrical Safety for IBM Customer Engineers, S229-8124. For a copy of the publication, contact your IBM account representative or the IBM branch office serving your locality. Safety notices The following is a list of all safety notices (in English only) pertaining to SP hardware maintenance tasks from this and other RS/6000 SP hardware publications.
DANGER Before you connect the power cable of this product to ac power, verify that the power receptacle is correctly grounded and has the correct voltage. (SPSFD004) DANGER During an electrical storm, do not connect or disconnect any cable that has a conductive outer surface or a conductive connector. (SPSFD005) DANGER Switch off power and unplug the machine power cable from the power receptacle, before removing or installing any part that is connected to primary power.
DANGER The remaining steps of the procedure contain measurements that are taken with power on. Remember that hazardous voltages are present. (SPSFD013) DANGER The frame main circuit breaker and the controller must not be switched on again now. Before disconnecting the power cables from the power receptacles, ensure that the customer’s branch distribution circuit breakers (customer power source circuit breakers) are Off and tagged with DO NOT OPERATE tags, S229-0237.
CAUTION: The covers are to be closed at all times except for service by trained service personnel. (SPSFC003) CAUTION: When the unit is being serviced, the covers should not be left off or opened while the machine is running unattended. (SPSFC004) CAUTION: Due to weight of each thin node (under 18 Kg [40 lbs]), use care when removing and replacing thin nodes above shoulder height. (SPSFC005) CAUTION: The wide node weight may exceed 32 Kg (70.5 lbs).
CAUTION: v When moving frames into position, team members should work together. Using one person on each corner of the frame can prevent strain. v In raised floor installations, mechanically safe moldings should be installed around floor cutouts. Extreme caution should be used when moving frames during installation or removal because of the proximity of floor cutouts to casters.
This product might contain nickel-cadmium or lithium batteries in communication adapters. The batteries must be recycled or disposed of properly. Recycling facilities might not be available in your area. In the United States, IBM has established a collection process for reuse, recycling, or proper disposal of used sealed lead-acid, nickel-cadmium and nickel metal hydride batteries and battery packs from IBM equipment.
About this book This book is part of the RS/6000® SP™ hardware service library and applies to the RS/6000 SP Switch.
How to send your comments Your feedback is important in helping to provide the most accurate and highest quality information. If you have any comments about this book or any other RS/6000 SP documentation: v Send your comments by e-mail to mhvrcfs@us.ibm.com. Be sure to include the name of the book, the order number of the book, and, if applicable, the specific location of the text you are commenting on (for example, a page number or table number).
Summary of changes GA22-7443-04 This edition contains replaces GA22-7443-03 and any update versions made to that level and makes them obsolete. This edition contains minor changes and fixes to softcopy cross-book links. GA22-7443-03 This edition contains replaces GA22-7443-02 and any update versions made to that level and makes them obsolete. This edition contains minor changes and fixes to softcopy cross-book links.
xviii RS/6000 SP: SP Switch Service Guide
Chapter 1. Maintenance Analysis Procedures (MAPs) This chapter provides information for identifying problems and guides you to the most likely failed Field Replaceable Unit (FRU). The MAPs then refer you to the FRU Removal/Replacement procedures for the corrective action.
Switch description and problem determination (MAP 0590) FRUs include: Fans, circuit breaker/LED card, switch supervisor card, switch power card(s), inner chassis cable, front chassis cable, complete assembly. SPS-8 There are only 8 ports. All clocks are distributed through data cables. FRUs include: Fans, circuit breaker/LED card, switch supervisor card, switch power card(s), inner chassis cable, front chassis cable, complete assembly.
Switch description and problem determination (MAP 0590) There are two LEDs on the front of each SP Switch. For quick reference, their definitions are as follows: Environment (Yellow) Off No environmental problems detected by switch supervisor card. On Warning of environmental condition out of nominal range. Preventative Maintenance should be scheduled for this switch. Flashing Serious environmental condition detected; power shut off. Power (Green) Off No 48 V dc power available at the SP Switch.
Switch description and problem determination (MAP 0590) Table 1-1. Switch problem diagnostics Priority 1 (1 of 3) 2 (2 of 3) 3 (3 of 3) Message or condition Action Environmental problems v Errpt: “Failure...” v Log: “Shutdown: Voltage...” v Log: “Shutdown: Fan...” v Log: “Shutdown: Temperature...” v Errpt: “Warning...” v Log: “Warning:...” v Environment (Yellow) LED- on or flashing Go to “SP Switch environment (MAP 0600)” on page 1-5.
SP Switch environment (MAP 0600) SP Switch environment (MAP 0600) Purpose of this MAP This MAP provides diagnostic information for switch problems that are related to the operating environment. Note: Refer to “Service position procedures” on page 3-9 for placing a switch into the service position or for removing the switch from the service position. Step 0600-001 You have arrived at this MAP from Table 1-1 on page 1-4. 1.
SP Switch environment (MAP 0600) Step 0600-004 Perspectives indicated a shutdown condition and Table 1-2 on page 1-5 directed you to this step. 1. One or more of the following conditions exist: v Voltage out of range: +5 V “shutdownP5” v Voltage out of range: +12 V “shutdownP12” v Voltage out of range: −5 V “shutdownN5” 2. Have the customer remove the SP Switch from the active configuration and power off the SP Switch. 3. Put the switch into service position.
SP Switch environment (MAP 0600) 3. Is the measured resistance now within the acceptable range? v If yes, go to “Step 0600-017” on page 1-10 to verify fix. v If no, go to “Step 0600-008”. Step 0600-008 You replaced the inner chassis cable and the front chassis cable but the measured resistance is still outside of the acceptable range. 1. This indicates that there is still a problem. 2. Replace the switch supervisor card. 3. Go to “Step 0600-017” on page 1-10 to verify fix.
SP Switch environment (MAP 0600) Table 1-4. Fan failure diagnostics Priority 1 Component Action Fan 1, 2, 3, 4 or 5 a. Check specified fans for blockages or loose cable connections. (1 of 5) b. Fix any obvious problems and continue at “Step 0600-012”. c. If you do not find any problems, continue at Priority 2. 2 Fan 1, 2, 3, 4 or 5 a. Replace fans as described in Chapter 4, “FRU removals and replacements” on page 4-1. (2 of 5) 3 b. Continue at “Step 0600-012”. Switch supervisor card a.
SP Switch environment (MAP 0600) 6. Check the Environmental (yellow) LED for an ON or FLASHING condition. 7. Is the Environmental (yellow) LED ON or FLASHING? v If the Environmental LED is on or flashing: a. Put the SP Switch’s circuit breaker into the Off (‘0’) position. b. Return to “Step 0600-011” on page 1-7 and continue service with the next highest priority. v If the Environmental LED is not on or flashing: a. You have resolved the problem. b. Go to “Step 0600-017” on page 1-10 to verify fix.
SP Switch environment (MAP 0600) 2. 3. 4. 5. Remove blockage. If required, put the switch into service position (refer to “Service position procedures” on page 3-9). With all cables replugged and Environmental (yellow) LED OFF, power on the SP Switch. Go to “Step 0600-017” to verify fix. Step 0600-017 You have repaired or replaced a component. 1. If necessary, reinstall SP Switch cover. 2. If required, take the switch out of the service position (refer to “Service position procedures” on page 3-9). 3.
SP Switch environment (MAP 0600) 2. Remove the switch supervisor card (refer to “Removing the switch supervisor card” on page 4-4). 3. Using a digital multimeter, measure resistance at the planar connection for the supervisor card, between pins 12A and 12B. v The resistance should be in a range of 4 to 20 ohms. 4. Is the measured resistance in the acceptable range? v If yes: a. Replace supervisor card as described in “Replacing the switch supervisor card” on page 4-5. b. Go to “Step 0600-017” on page 1-10.
SP Switch environment (MAP 0600) Step 0600-025 You have a PS1Fail problem 1. Replace power supply card PS1. 2. Reinstall the SPS front panel assembly, being careful to align the guide pins on the P1 to the inner chassis cup guide. 3. Replug the power cable (J1) and supervisor cable (J2) to the rear of the assembly. 4. Put the circuit breaker in the On (‘1’) position. 5. Does the PS1Fail condition still exist? v If yes, go to “Step 0600-026”. v If no, go to “Step 0600-017” on page 1-10.
SP Switch environment (MAP 0600) Step 0600-029 You have a PSFuseGood problem that did not go away when you removed power supply PS1. 1. Put the circuit breaker in the Off (‘0’) position. 2. Remove power cable (J1) and supervisor cable (J2) from the rear of the SP Switch. 3. Remove the front panel assembly. 4. Replug power supply PS1 and remove power supply PS2. 5. Reinstall the SPS front panel assembly, being careful to align the guide pins on the P1 connector to the inner chassis. 6.
SP Switch power (MAP 0610) Step 0610-001 A message in Perspectives indicated that you have a power problem and Table 1-1 on page 1-4 or Table 1-2 on page 1-5 directed you to this MAP. 1. From a Perspectives window on the control workstation or by looking at the SP Switch, check the Power (green) LED for this SP Switch. 2. The definition of the Power (green) LED is as follows: Power (Green) Off No 48 V dc power available at the SP Switch.
SP Switch power (MAP 0610) Step 0610-005 When you put the circuit breaker into the On (‘1’) position, the circuit breaker tripped into the Off (‘0’) position. 1. Have the customer remove the SP Switch from the active configuration and power off the SP Switch. 2. Put the switch into service position (refer to “Service position procedures” on page 3-9). 3.
SP Switch power (MAP 0610) v If yes: a. Have the customer remove the SP Switch from the active configuration and power off the SP Switch. b. Go to “Step 0610-010”. v If no: a. Fix any cable connection problems. b. Return to “Step 0610-006” on page 1-15. Step 0610-010 You have an SP type switch that is either tripping the circuit breaker to the Off position or the circuit breaker is on but the Power (green) LED is not lit. 1. Turn off the circuit breaker. 2.
SP Switch power (MAP 0610) a. Replace the front chassis cable. b. Return to “Step 0610-004” on page 1-14 to verify the replacement cable. v If no: a. Replace the circuit breaker. b. Return to “Step 0610-004” on page 1-14 to verify the replacement breaker. Step 0610-013 You have continuity on all cables. 1. If this is the first time through this step, replace the supervisor card. v Return to “Step 0610-004” on page 1-14 to verify the replacement breaker.
SP Switch function (MAP 0620) splstdata -n | pg 5. From the system file server, log into “primary” processor node as root using the telnet command: telnet PrimaryNodename 6. Check errpt -a -N Worm | pg for any switch related problems. If any errors are listed, use the error information, with this MAP, to help isolate the problem. 7. Check the functional state of the switch in the out.top file by issuing the following command on the “primary” processor node: pg /var/adm/SPlogs/css/out.top Notes: a.
SP Switch function (MAP 0620) Table 1-7. SP Switch error conditions Error # 2 (SPS) Message/condition Description and action Initialized Description: Initialization detected a wrapped port where a processor node or dependent node was expected (this may result from isolation procedures), or else a disconnected cable. Action: 1. If this is an unexpected condition, check cabling against the configuration file var/adm/SPlogs/css/out.top. 2.
SP Switch function (MAP 0620) Table 1-7. SP Switch error conditions (continued) Error # −3 (SPS) Message/condition Description and action Device status: Device has been Description: Possible hardware problem. removed from network because of a Action: Go to “Step 0620-004” on page 1-21.
SP Switch function (MAP 0620) Table 1-7. SP Switch error conditions (continued) Error # −8 (SPS) Message/condition Description and action Device status: Device has been Description: Initialization of this link detected a different switch removed from network because of a node number than the one expected. miswire Action: Link status: Link has been removed 1. Check this connection against the actual hardware cabling. from network, probable miswire 2.
SP Switch function (MAP 0620) Refer to notes at beginning of “SP Switch function (MAP 0620)” on page 1-17 for more information on cable connections and logical-to-physical frame translations. c. Be aware that for logical frames consisting of two physical frames, all out.top even-numbered nodes represent nodes physically located in the expansion frame (F/C 1010) but at the next lower slot number. For example, L01-N04 (logical frame 1, node 4) could be physically located at E02-N03 (physical frame 2, slot 3).
SP Switch function (MAP 0620) 1. Make sure a wrap plug is properly installed on the connector. Refer to Figure 1-2 on page 1-4 for views of the switch wrap plugs. 2. Have customer check (and update if necessary) the switch configuration file appropriately before continuing: v For code level 1.02 and higher, use the Etopology command. 3. Did you just correct a problem with a wrap plug or correct the switch configuration? v If yes, go to “Step 0620-043” on page 1-36.
SP Switch function (MAP 0620) v If yes: a. An SRN was obtained from diagnostics. b. Use the following SRN table (Table 1-9) to continue service. c. After using Table 1-9, go to “Step 0620-012” on page 1-26. v If no: a. The diagnostics did not detect a problem, therefore the problem is in SP Switch. b. Go to “Step 0620-041” on page 1-35. Table 1-9.
SP Switch function (MAP 0620) Table 1-9. Service Request Number (SRN) table for SP Switch adapters (continued) Service Request Number SRN source Failing component(s) Description 763-200 through 763-299 (except 763-282) D ext clock SPS MX adapter wrap plug Problem detected with the external clock (SP switch). 763-282 D switch cable Adapter is good but switch cable is not attached. 763-2A0 through 763-2A9 D SPS MX adapter Problem detected with the internal clock.
SP Switch function (MAP 0620) Table 1-9. Service Request Number (SRN) table for SP Switch adapters (continued) Service Request Number SRN source Failing component(s) Description 764-2A0 through 764-2A9 D SP System Attachment Adapter Problem detected with the internal clock. 764-3xx (See note) D SP System Attachment Adapter system or I/O planar Problem detected with POS registers. 764-4xx (See note) D SP System Attachment Adapter software TBIC test failed.
SP Switch function (MAP 0620) Table 1-10. Switch problem priority listing Priority 1 Failing component Action Software a. Have customer verify that the software is configured and operating correctly for this processor node/system. b. If no problem is found, continue with next highest priority item in the list for this SRN. c. Otherwise, power off this processor node and continue service at “Step 0620-017” on page 1-28. External clock a.
SP Switch function (MAP 0620) Step 0620-013 An SRN listed in Table 1-9 on page 1-24 indicated that you have a switch clock problem and Priority 2 in Table 1-10 on page 1-27 directed you to this step. 1. Check the switch clock selections on the “problem” switches . 2. Do the clock settings on the “problem” SP Switches appear to be okay? v If yes, go to “Step 0620-014”. v If no a. Change the clock selections on the appropriate switches (refer to “Selecting appropriate switch clocks” on page 3-6). b.
SP Switch function (MAP 0620) Step 0620-019 Since the diagnostics did not return an SRN, no problem was detected. 1. Have you just reseated, repaired, or replaced a component? v If yes, go to “Step 0620-043” on page 1-36 to verify fix. v If no: a. Problem is in the SP Switch. b. Go to “Step 0620-041” on page 1-35 to service or replace switch assembly. Step 0620-020 The advanced diagnostics returned an SRN indicating that the diagnostics detected a problem. 1.
SP Switch function (MAP 0620) 2. 3. 4. 5. 6. Note: When unplugging the two ends of the suspect cable, check /var/adm/SPlogs/css/out.top to verify only two (2) ports were lost. If four (4) ports were lost, then two cables were swapped across that switch to switch connection. Connect the cables correctly and check /var/adm/SPlogs/css/out.top for any other errors.
SP Switch function (MAP 0620) v If yes, go to “Step 0620-027”. v If no: a. The problem switch is providing the master clock for the system. b. Have customer use the Eclock command to select a different master clock for the system. c. Go to “Step 0620-043” on page 1-36 to verify fix. Step 0620-027 The clock selection is set to one of the External clocks. 1. Select a different external clock for this SP Switch. 2. From the control workstation of the primary node reinitialize the switch by entering: Estart 3.
SP Switch function (MAP 0620) v S00-BH-J9 5. Disconnect the switch data cable at the tailgate of the frame containing the processor node. 6. Connect the end of the processor node data cable to the jack. 7. Run advanced diagnostics on “css0” on the “test” processor node and its associated switch port. v Do not perform the cable wrap test. v Refer to ″Running diagnostics in a processor node″ in RS/6000 SP: System Service Guide. 8. Look for an SRN indicating a clock problem (such as “External clock”). 9.
SP Switch function (MAP 0620) Step 0620-033 All ground straps connecting the frames make adequate contact at both ends. 1. Disconnect clock source data cable at S00-BH-J3, J5, J7, or J9, then reconnect to SP Switch. (This is done to eliminate clocking noise from cable.) 2. Find a processor node in the “problem” frame which is usable for service. This will be used as the “test” processor node. 3. Run advanced diagnostics on “css0” on the “test” processor node and its associated switch port.
SP Switch function (MAP 0620) 5. Set the switch clock selections on the “problem” switches (refer to “Selecting appropriate switch clocks” on page 3-6). 6. Run advanced diagnostics on “css0” on the “test” processor node and its associated switch port. v Do not perform cable wrap test. v Refer to ″Running diagnostics in a processor node″ in RS/6000 SP: System Service Guide. 7. Look for an SRN indicating a clock problem, such as “External clock” being listed as one of the failing components.
SP Switch function (MAP 0620) 5. Look for an SRN indicating a clock problem, such as “External clock” being listed as one of the failing components. 6. Do the “css0” diagnostics fail with indication of a clock problem? v If yes: a. Problem is the clock selection in this SP Switch. b. Return to “Step 0620-035” on page 1-33. v If no, go to “Step 0620-039”. Step 0620-039 The “css0” diagnostics did not fail. This indicates that the problem is not in this SP Switch. 1.
SP Switch function (MAP 0620) 2. Put the SP Switch’s circuit breaker into the On (‘1’) position. 3. Go to “Step 0620-043” to verify fix. Step 0620-043 You have replaced switch components and need to verify that the problem has been fixed. 1. Make sure any processor node(s) that was put in SERVICE mode is returned to NORMAL mode. Make sure all processor node(s) are IPLed completely in NORMAL mode. 2.
Chapter 2. Locations Naming standard for RS/6000 SP components . . . . . Format structure . . . . . . . . . . . . . . . Example of format structure . . . . . . . . . . Frame (WWW) . . . . . . . . . . . . . . . Major assembly (XXX) . . . . . . . . . . . . Sub-assembly (YY) . . . . . . . . . . . . . Connection location (ZZZZ) . . . . . . . . . . Examples for using complete levels of nomenclature . Location diagrams of the RS/6000 SP components . . . . Front and rear views of RS/6000 SP frame . . . . . .
– 01 - 99 for frames 1-99 (specific to that frame) Notes: 1. E01 designates RS/6000 SP physical frame 1 2. L00 designates any/all RS/6000 SP logical frames 3. S00 designates any/all RS/6000 SP multi-switch frames 4.
Front and rear views of RS/6000 SP frame Figure 2-1 shows a front view of the RS/6000 SP frame locations. “Frame (FRA)” on page 2-6 describes the assembly designations for the RS/6000 SP frame. Figure 2-1. Front view of frame locations. See notes below. Figure notes: 1. Wide processor nodes take up an entire shelf position (two thin processor node slots). They are identified by the odd numbered position. 2. In a F/C 2030/1 frame, switch assemblies take up an entire shelf partition.
Figure 2-2 shows a front view of the RS/6000 SP multi-switch frame. SP Switch Frame Locations - Front View Switch Asm S16 Switch Asm S14 Switch Asm S12 Switch Asm S10 Switch Asm S8 Switch Asm S6 Switch Asm S4 Switch Asm S2 Main Power Switch with LED Left Skirt Right Skirt A B C 48 V Power Modules (Front Cover and Skirt, and Air Filter Removed) Figure 2-2. Front view of multi-switch frame locations Figure 2-3 on page 2-5 shows a front view of the Model 3AX (49-inch) frame.
Figure 2-3. Front view of 49-inch frame locations. See notes below. Figure notes: 1. Wide processor nodes take up an entire shelf position (two thin processor node slots). They are identified by the odd numbered position. 2. In a F/C 2030/1 frame, switch assemblies take up an entire shelf partition. (They are identified by the even-numbered position.) 3. Processor node slots are numbered up to N8. 4. The single-phase SEPBU power unit must have a power module in position “D” (right-most slot).
Figure 2-4. Rear view of frame locations Note: See notes under Figure 2-1 on page 2-3 for processor node/switch assembly numbering. Frame locations Figure 2-1 on page 2-3 shows a front view of the RS/6000 SP frame locations, with numbered processor nodes, and the three phase SEPBU.
G6: Front door ground G7: Rear door ground G8: Ground SW: Power-on switch LD: LED card FC: Front cover RC: Rear cover Example: E01-FRA-G1 Chapter 2.
Switch assembly locations J1 J2 Supervisor Bus Card Switch Planar Air Baffle Power Supply 2 Supervisor Card P5 P6 Fan 5 Fan 4 CB & LED Card Figure 2-5.
Connector details Figure 2-6 shows RS/6000 SP component connector details. Figure 2-6. RS/6000 SP connector details (as seen at receiving ends, not at cable ends) Cable routing Figure 2-7 on page 2-10 and Figure 2-8 on page 2-10 show back views of the RS/6000 SP frame, showing the horizontal and vertical paths of cable routing from connector-to-connector, with the depth amplified on the drawing. Chapter 2.
Note: When attaching exterior and interior cables to a POWER3 SMP High Node allow for enough cable for a 2-foot service loop for node movement into service position. Figure 2-7. Frame cabling routing path in rear of RS/6000 SP frame — 1.93 m frame Figure 2-8. Frame cabling routing path in rear of RS/6000 SP frame — 2.01 m frame Note: For a multi-switch frame (F/C 2030/1), refer to Figure 2-7.
Table 2-1 shows external cable routing in a RS/6000 SP frame populated with 16 processor nodes. (Refer to “Cable routing” on page 2-9 to see the routing paths.) Table 2-1.
Figure 2-9. Frame cable routing paths in rear of RS/6000 SP multi-switch frame (F/C 2030/1) — 1.93 m frame Figure 2-10. Frame cable routing paths in rear of RS/6000 SP multi-switch frame (F/C 2030/1) — 2.
Switch data cables SPS data cables Table 2-2 describes the attachment locations and routing for the internal SPS Switch data cables: Table 2-2.
2-14 RS/6000 SP: SP Switch Service Guide
Chapter 3. Service procedures Personal ESD requirements . . . . . . . . . . . . . Tools and files overview . . . . . . . . . . . . . . . Using the css.snap script . . . . . . . . . . . . . . Switch supervisor self-test . . . . . . . . . . . . . . Verification tests using Perspectives . . . . . . . . . . Node supervisor verification . . . . . . . . . . . . Frame supervisor verification . . . . . . . . . . . . Switch supervisor verification . . . . . . . . . . . . Selecting appropriate switch clocks. . . . .
Table 3-1. Service procedure tools Utility (see note) Runs on Description fault_service_Worm_RTG All nodes Monitors the switch for faults. It restarts the switch if a fault is detected. fs_monitor All nodes Monitors the adapter for interrupts that have not been serviced. This is required because of a hardware oversight in MSMU, in which there is a window in time that causes it to hold interrupts and then forget to introduce them. rc.switch All nodes Starts the daemons and IP configuration tools.
Table 3-3. Tuning output files (continued) File (see note) Location Description daemon.stderr Primary A record of which nodes were not initialized. out.top Primary Reports errors from the last tuning procedure. It begins as a copy of the topology file and errors are indicated to the right of each entry. out.top.old Primary A copy of out.top from the previous run. router.log Primary The router log file generated by the route table generator after initialization. router.log.
The files ending in .out are produced by running the appropriate command to dump internal (in memory) trace information or dump data to a file. The completed output file will be found in /var/adm/SPlogs/css/css.snap.[date-time]tar.Z. css.snap avoids flooding /var by following these rules: v If less than 10% of /var is free, css.snap exits. v If the CSS portion of /var is more than 30% of the total space in /var, css.snap erases old snap files until the CSS share sinks below 30%. If successful, css.
2. In the Node pane, click the icon of the node you want to verify 3. Click the ″Notebook″ icon on the tool bar v When the Notebook window opens, make certain that the ″Node Status″ tab is selected 4.
v ″Yes″ displayed in a red box indicates that the switch supervisor has failed and it is not responding to the frame supervisor. Note: Clicking ″Help″ in the Notebook window’s lower right corner displays attribute descriptions. Selecting appropriate switch clocks The following procedure describes how to select a clock source for a switch assembly, followed by an explanation on how to determine which clock source to use on each assembly.
1 Input 1 (BH-J3 for SPS) 2 Input 2 (BH-J4 for SPS) 3 Input 3 (BH-J5 for SPS) Table 3-4.
6. The customer can re-initialize the switch using the Estart command. The frame and processor nodes which were removed in this procedure will appear in the out.top file with error messages; however, the remainder of the switch resources are now available for customer use. 7. If switch re-initialization was successful, the customer can start running parallel jobs again. 8. Return to MAPs to continue service action(s). Restoring a switch assembly to the active configuration 1. 2. 3. 4. 5. 6. 7. 8. 9. 10.
Efence of primary and primary backup nodes By design, Efence of primary and primary backup nodes is not allowed. If you attempt to fence either of these nodes, you will get the following responses: Efence: 0028-147 Node number designates the Primary Node. Efence: 0028-166 Node number designates the Primary Backup Node. You should assign a new primary or primary backup node and initiate a restart to be able to remove these nodes from the network for service.
d. Choose “Normal Mode” e. Choose “Display Current Bootlist” This will display the current bootlist. 2. Power down the node, service it, and hook it back into the frame. 3. On the control workstation, run spbootins to set the node to boot in maintenance mode. For example, if it is node 12 of frame 2, enter: spbootins -r maintenance 2 12 1 4. On the control workstation, netboot the node: a. From the SP Perspectives Launch Pad, select ″Hardware Perspectives″ b.
The following 3 adapters require functional microcode to be installed: Adapter Package ® ESCON Control Unit Adapters Feature 2756 ™ BLKMUX S/370 Control Unit Feature 2755 FDDI Adapters Features 2723, 2724, 2725, 2726 ESCON BLKMUX FDDI These adapters might need updating to the latest level in their FLASH EPROM: Adapter Package SSA Adapters Features 6214, 6216, 6217, 7133 Drives SSAFLASH SCSI Adapters Features 2412, 2415, 2416 ECA192 Note: The ECA192 instructions differ from the above and are in
To complete the microcode update, it is usually necessary to remove and then replace the device from the configuration. The most reliable method to do this is to reboot the node. Some adapters can actually require a power off cycle to complete the microcode update. Others can be updated simply by running cfgmgr. Note: During microcode download for SSA adapters, there is a possibility that the download process could result in an error.
Chapter 4. FRU removals and replacements Handling static-sensitive devices . . . . Procedures for switch assemblies . . . . Removing the switch assembly . . . . Replacing the switch assembly . . . . Removing the switch fans . . . . . . Replacing the switch fans . . . . . . Removing the switch front chassis cable. Replacing the switch front chassis cable . Removing the switch supervisor card . . Replacing the switch supervisor card . .
Procedures for switch assemblies CAUTION: The unit weight exceeds 18 Kg (40 lbs) and requires two service personnel to lift. (SPSFC002) Attention: Components in the frame are susceptible to damage from static discharge. Always use an ESD wristband when working inside frame covers. (See “Personal ESD requirements” on page 3-1 for more details.) Do not touch the pins or circuitry on these components. Note There are different levels of the switch assembly hardware.
Removing the switch fans Note: Refer to “Handling static-sensitive devices” on page 4-1 before removing or installing ESD sensitive devices. Perform these procedures to remove a fan from an SP Switch assembly: 1. Perform “Placing a switch assembly into service position” on page 3-9 to place the switch assembly into the service position. 2. Remove the top cover of the switch assembly by removing the screws. 3. Locate the fan. 4. Push shock mounts toward center of fan. 5. Pull fan out of the fan bracket.
Perform these procedures to remove the fan control cable from an SP Switch assembly: 1. Perform “Placing a switch assembly into service position” on page 3-9 to place the switch assembly into the service position. 2. Unplug connectors P7, P8 and P9. Remove cable by unhooking retaining material along raceway, and unplugging the Fan connectors P2, P3, P4, P5 and P6. Using fingertip pressure, remove connector P1 by removing screws. Retain these screws for new cable installation. 3.
Replacing the switch supervisor card Perform these procedures to replace the supervisor card in an SP Switch assembly: 1. Insert supervisor card. 2. Rotate card thumb locks inward to seat card. 3. Plug connector P5. 4. Perform “Replacing a switch assembly from service position” on page 3-9 to remove the switch assembly from the service position. Switch Supervisor Card Figure 4-4.
Figure 4-5. Removing the SP Switch inner chassis cable Replacing the switch inner chassis cable Perform these procedures to replace the supervisor power cable from an SP Switch assembly: 1. Plug connectors P3, P4, P5 and P6. Route cable along the raceway, hooking retaining material where needed. Attach P1 connector to the rear of the switch assembly with screws retained in the removal procedure. 2.
4. Remove power supply card. Replacing the switch power cards Perform these procedures to replace the switch power cards) in an SP Switch assembly: 1. Insert power supply card. 2. Rotate card thumb locks inward to seat card. 3. Plug connector P4 (PS1) or P6 (PS2). 4. Perform “Replacing a switch assembly from service position” on page 3-9 to remove the switch assembly from the service position. 2 1 Switch Power Supply Cards Figure 4-6. Removing the SP Switch power supply cards Chapter 4.
4-8 RS/6000 SP: SP Switch Service Guide
Chapter 5. Parts catalog SPS, SPS-8 Switch assembly (feature) (view 1) . SPS, SPS-8 Switch assembly (feature) (view 2) . SPS, SPS-8 Switch assembly (feature) (view 3) . Switch cables (feature) . . . . . . . . . . Multi-switch frame (F/C 2030/1) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-2 . 5-4 . 5-6 . 5-8 . . . . . . . . . . . . . . . . . . . .
SPS, SPS-8 Switch assembly (feature) (view 1) 5-2 RS/6000 SP: SP Switch Service Guide
Table 5-1.
SPS, SPS-8 Switch assembly (feature) (view 2) 5-4 RS/6000 SP: SP Switch Service Guide
Table 5-2. SPS, SPS-8 Switch assembly (feature) (view 2) Assembly index Part number Units Description SPS Switch assembly (reference only) SPS-8 Switch assembly (reference only) 1 11P0655 1 Inner chassis assembly SPS (includes all assembly in shaded area) 1 11P0656 1 Inner chassis assembly SPS-8 (includes all assembly in shaded area) 2 1624766 1 Screw, 6 mm 3 26H7389 1 Cable, inner chassis 4 46H9305 1 Card, switch supervisor 5 77G0961 1 Card, power Chapter 5.
SPS, SPS-8 Switch assembly (feature) (view 3) Air flow 3 1 2 8 5 4 7 6 5-6 RS/6000 SP: SP Switch Service Guide
Table 5-3. SPS, SPS-8 Switch assembly (feature) (view 3) Assembly index Part number Units Description SPS switch assembly (reference only) SPS-8 switch assembly (reference only) 1 26H7391 1 Cable, front chassis 2 32G1547 1 Screw, hex M4 x 5 3 46H9778 2 Cup guide (2.01 m, order kit) 3 77G1079 2 Pin guide (2.
Switch cables (feature) Table 5-4.
Chapter 5.
Multi-switch frame (F/C 2030/1) 5-10 RS/6000 SP: SP Switch Service Guide
Table 5-5.
5-12 RS/6000 SP: SP Switch Service Guide
Notices This information was developed for products and services offered in the U.S.A. IBM may not offer the products, services, or features discussed in this document in other countries. Consult your local IBM representative for information on the products and services currently available in your area. Any reference to an IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service may be used.
Electronic emissions notices Federal Communications Commission (FCC) statement This equipment has been tested and found to comply with the limits for a Class A digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against harmful interference when the equipment is operated in a commercial environment.
For installations in Japan: The following is a summary of the VCCI Japanese statement in the box above. This is a Class A product based on the standard of the Voluntary Control Council for Interference by Information Technology Equipment (VCCI). If this equipment is used in a domestic environment, radio disturbance may arise. When such trouble occurs, the user may be required to take corrective actions.
A-4 RS/6000 SP: SP Switch Service Guide
Index Numerics 0375867 5-11 04H9469 5-11 08J5557 5-11 11J3975 5-8 11J3976 5-8 11J3977 5-8 11J3978 5-8 11J3979 5-8 11J3980 5-8 11J3981 5-8 11J3982 5-8 11J5189 5-11 11J5191 5-11 11J5193 5-11 11J5195 5-11 11J6091 5-8 11J6092 5-8 11J6093 5-8 11J6094 5-8 11J6095 5-8 11J6096 5-8 11J6097 5-8 11J6098 5-8 11J6099 5-8 11J6100 5-8 11J6101 5-8 11J6102 5-8 11J6103 5-8 11J6104 5-8 11J6105 5-8 11J6106 5-8 11P0006 5-8 11P0655 5-5 11P0656 5-5 1624766 5-5 1624776 5-11 17H5026 5-11 21L3932 5-11 26H7198 5-11 26H7212 5-3 26H725
frame locations 2-3, 2-5, 2-6 frame naming standard 2-1 frame supervisor verification 3-5 front chassis cable, SPS 1-8 front view of 49-inch frame locations 2-4 front view of frame locations 2-3 front view of multi-switch frame locations 2-4 H handling static-sensitive devices P partitioning, SPS 3-8 partitioning, switch 3-8 Perspectives, verification tests using 3-4 placing a SPS assembly into service position Procedures ESD 3-1 purpose of book xv task procedures overview xv 3-9 4-1 R I installing ada
SPS assembly, placing into service position 3-9 SPS assembly, removing 4-2 SPS assembly, replacing 4-2 SPS assembly, replacing from service position 3-9 SPS fan control cable, removing 4-3 SPS fan control cable, replacing 4-4 SPS fans, removing 4-3 SPS fans, replacing 4-3 SPS front chassis cable 1-8 SPS partitioning 3-8 SPS power card, removing 4-6 SPS power card, replacing 4-7 SPS resources, removing 3-7 SPS resources, restoring 3-7 SPS supervisor card, removing 4-4 SPS supervisor card, replacing 4-5 SPS s
X-4 RS/6000 SP: SP Switch Service Guide
Reader’s comments – We’d like to hear from you RS/6000 SP SP Switch Service Guide Publication No.
GA22-7443-04 IBM ___________________________________________________________________________________________________ Readers’ Comments — We’d Like to Hear from You Cut or Fold Along Line _ _ _ _ _ _ _Fold _ _ _ and _ _ _Tape _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _Please _ _ _ _ do _ _ not _ _ _staple _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _Fold _ _ _and _ _ Tape ______ PLACE POSTAGE STAMP HERE IBM Corporation Department 55JA, Mail Station P384 2455 South Road Poughkeepsie
IBM GA22-7443-04