HP XC System Software Administration Guide Version 3.2

21.5.1 How To Start HP Serviceguard When Only the Head Node is Running...........................260
21.5.2 Restart Serviceguard Quorum Server if Quorum Server Node is Re-imaged....................260
21.5.3 Known Limitation if Nagios is Configured for Improved Availability..............................260
21.5.4 Network Restart Command Negatively Affects Serviceguard...........................................261
21.5.5 Problem Failing Over Database Package Under Serviceguard...........................................261
21.6 SLURM Troubleshooting.............................................................................................................261
21.6.1 SLURM Configuration Issues..............................................................................................261
21.6.2 SLURM Run-Time Troubleshooting....................................................................................262
21.7 LSF-HPC Troubleshooting...........................................................................................................263
22 Servicing the HP XC System...................................................................................267
22.1 Adding a Node............................................................................................................................267
22.2 Replacing a Client Node..............................................................................................................269
22.3 Replacing a Server Blade OnBoard Administrator.....................................................................270
22.4 Replacing a System Interconnect Board in an HP CP6000 System.............................................272
22.5 Software RAID Disk Replacement...............................................................................................272
22.5.1 Replacing a RAID Disk........................................................................................................272
22.5.2 Writing a Boot Block to the RAID Disk...............................................................................274
22.6 Incorporating External Network Interface Cards........................................................................275
22.6.1 Gathering Information.........................................................................................................276
22.6.1.1 Gathering Node-Specific Information.........................................................................276
22.6.1.2 Determining NIC-Specific Information.......................................................................277
22.6.1.3 Gathering Networking Information............................................................................279
22.6.1.4 Consolidating Information in the NIC Data Worksheet.............................................279
22.6.2 Editing the platform_vars.ini File........................................................................................279
22.6.3 Using the device_config Command....................................................................................283
22.6.4 Updating the Database for the External Network Card......................................................283
22.6.5 Updating the Firewall Custom Configuration....................................................................284
22.6.5.1 Verifying the Updated CMDB.....................................................................................286
22.6.6 Reconfiguring the Nodes.....................................................................................................287
22.6.7 Verifying Success.................................................................................................................287
22.6.7.1 Verifying the Ethernet Port..........................................................................................288
22.6.7.2 Verifying the Ethernet Device.....................................................................................288
22.6.7.3 Testing the Network Connection.................................................................................288
22.6.8 Updating the Golden Image................................................................................................288
A Installing LSF-HPC with SLURM into an Existing Standard LSF Cluster ...............289
A.1 Assumptions.................................................................................................................................289
A.2 Requirement.................................................................................................................................290
A.3 Sample Case..................................................................................................................................290
A.4 HP XC Preparation.......................................................................................................................290
A.5 Installing LSF-HPC with SLURM.................................................................................................295
A.6 Perform Post Installation Tasks....................................................................................................298
A.7 Configuring the LSF Alias............................................................................................................299
A.8 Starting LSF on the HP XC System...............................................................................................300
A.9 Sample Running Jobs....................................................................................................................301
A.10 Troubleshooting..........................................................................................................................301
B Installing Standard LSF on a Subset of Nodes.......................................................303
B.1 Requirements................................................................................................................................303
B.2 Assumptions.................................................................................................................................304
B.3 Sample Case..................................................................................................................................304
B.4 Instructions....................................................................................................................................304
10 Table of Contents