HP XC System Software Administration Guide Version 3.2

18.3.1 Understanding the csys Utility in the Mounting Instructions............................................218
18.3.2 Mounting Internal File Systems...........................................................................................219
18.4 Mounting Remote File Systems...................................................................................................222
18.4.1 Understanding the Mounting Instructions.........................................................................223
18.4.2 Mounting a Remote File System..........................................................................................224
19 Managing Software RAID Arrays..........................................................................227
19.1 Overview of Software RAID........................................................................................................227
19.1.1 Software RAID-0..................................................................................................................227
19.1.2 Software RAID-1..................................................................................................................227
19.2 Installing Software RAID on the Head Node..............................................................................227
19.3 Installing Software RAID on Client Nodes.................................................................................227
19.4 Examining a Software RAID Array.............................................................................................228
19.5 Error Reporting............................................................................................................................229
19.6 Removing Software RAID from Client Nodes............................................................................229
20 Using Diagnostic Tools...........................................................................................231
20.1 Using the sys_check Utility.........................................................................................................231
20.2 Using the ovp Utility for System Verification.............................................................................231
20.3 Using the dgemm Utility to Analyze Performance.....................................................................237
20.4 Using the System Interconnect Diagnostic Tools........................................................................238
20.4.1 HP XC Diagnostic Tools for the Myrinet System Interconnect...........................................238
20.4.1.1 The gm_prodmode_mon Diagnostic Tool...................................................................238
20.4.1.2 The gm_drain_test Diagnostic Tool.............................................................................239
20.4.2 Using Diagnostic Tools for the Quadrics System Interconnect...........................................239
20.4.2.1 The swmlogger Daemon.............................................................................................239
20.4.2.2 The qselantestp Diagnostic Tool..................................................................................240
20.4.2.3 The qsnet2_level_test Diagnostic Tool.........................................................................241
20.4.2.4 The qsnet2_drain_test Diagnostic Tool........................................................................243
20.4.3 Using Diagnostic Tools for the InfiniBand Interconnect.....................................................243
20.4.4 Using Diagnostic Tools for the Gigabit Ethernet System Interconnect...............................244
21 Troubleshooting........................................................................................................245
21.1 General Troubleshooting.............................................................................................................245
21.1.1 Cannot Connect to Database During Configuration...........................................................245
21.1.2 Mismatched Secure Shell Keys............................................................................................246
21.1.3 NFS Mount Failure (Permission Denied)............................................................................246
21.1.4 NFS Attribute Caching on Large-Scale Systems.................................................................246
21.1.5 Stale Metrics Data................................................................................................................246
21.2 Nagios Troubleshooting...............................................................................................................247
21.2.1 Determining the Status of the Nagios Service.....................................................................247
21.2.2 Nagios Fails to Start.............................................................................................................247
21.2.3 Nagios Log Files..................................................................................................................248
21.2.4 Running Nagios Plug-Ins Manually....................................................................................248
21.2.5 Using the nrg Command's Analyze Mode..........................................................................248
21.2.6 Multiple %EXPR% Expressions Are Not Accepted in the nagios_vars.ini File..................249
21.3 Messages Reported by Nagios.....................................................................................................249
21.4 System Interconnect Troubleshooting.........................................................................................252
21.4.1 Myrinet System Interconnect Troubleshooting...................................................................252
21.4.2 Quadrics System Interconnect Troubleshooting.................................................................253
21.4.3 InfiniBand System Interconnect Troubleshooting...............................................................255
21.4.4 OFED Troubleshooting Procedures.....................................................................................257
21.5 Improved Availability Issues.......................................................................................................260
Table of Contents 9