Quadrics QsNetII Interconnect
These errors are filtered out unless you use jtest with the verbose option.
12.9 Performing Drain Time Testing
Unlike the production mode testing described in Section 12.7.2 and the log file
monitoring described in Section 12.7, drain time testing does have an impact
on cluster availability. Users are requested to quit the system so that you can
run a series of planned diagnostics, To ensure the stability of the interconnect,
HP recommends that you run this series of tests at least once every two weeks
for preventative maintenance purposes. You should archive the results of these
tests for reference purposes and for verification of instances when the system
was known to be operating correctly.
The following tests are recommended during drain time. You can run all tests as
scheduled jobs on all nodes. You must run the test in the specified sequence:
1. Environmental monitoring with qsnetstat, as described in Section 12.2.1.
2. The qsnetelantest, as described in Section 12.3 .
3. For systems running HP XC, run the qsnet2_level_test at level1,as
described in Section 12.5.
_______________________ Note _______________________
For systems not running HP XC, run the qsnet2_dmatest
command on all nodes, as described in Section 12.4. Specify the -Un
option argument where n is the number of levels in the cluster,
as defined in the cluster’s position file. (See Section 12.2.4 for
information on the position file).
_______________________ Note _______________________
Clusters might be configured with full bandwidth or reduced
bandwidth, as defined in the configuration rules and cabling tables
for your cluster. If your cluster has a reduced bandwidth you
must also apply the test constraints defined in Section 12.11. This
condition applies to both the qsnet2_level_test command
under HP XC and the qsnet2_dmatest command for systems not
running HP XC.
4. qsportmap, as described in Section 12.16.
5. qsnet2_level_test at level2, as described in Section 12.5.
6. qsnet2_level_test at level3, as described in Section 12.5.
7. qsnet2_level_test at level4, as described in Section 12.5.
As an alternative (and if it is supported by your configuration) you can run
the qsnetsoaktest with its duration time set to 10 minutes as described in
Section 12.6.
12.10 Performing Drain Time Testing under HP XC
Clusters running HP XC can run the qsnet2_drain_test script to perform drain
testing. This script runs the following tests in sequence:
• /usr/bin/qsctrl (to check environmental conditions)
• qsnet2_level_test level1
Maintenance and Diagnostic Procedures 12-19