HP XC System Software Installation Guide Version 4.0

Virtual hostname is lsfhost.localdomain
Comparing ncpus from Lsf lshosts to Slurm cpu count.
The Lsf and Slurm cpu count are NOT in sync.
The lshosts 'ncpus' value of 1560 differs from the cpu
total of 2040 calculated from the sinfo output.
Suggest running 'lshosts -w' manually and compare the ncpus
value with the output from sinfo
--- FAILED ---
Testing hosts_status ...
Running 'bhosts -w'.
Checking output from bhosts.
Running 'controllsf show' to determine virtual hostname.
Checking output from controllsf.
Virtual hostname is lsfhost.localdomain
Comparing MAX job slots from Lsf bhosts to Slurm cpu count.
The Lsf MAX job slots and Slurm cpu count are NOT in sync.
The bhosts 'MAX' value of 1560 differs from the cpu
total of 2040 calculated from the sinfo output.
Suggest running 'bhosts -w' manually and compare the MAX job
slots value with the output from sinfo.
--- FAILED ---
Follow this procedure to resolve the discrepancy in available CPU resources:
1. Restart the LIM daemon and update licensing information:
# lsadmin limrestart
2. Wait a few seconds and run the following command to confirm that the number of
CPUs is correct:
# lshosts -w
3. When the output of the lshosts command is correct, update LSF with static resources
(CPUs and memory) to match what SLURM is reporting:
# badmin reconfig
The value reported must match the total number of CPUs reported by SLURM.
14.5.1 OVP network_bidirectional Test Might Report False Error on HP Server Blades
The OVP network_bidirectional test might report a false failure at enclosure boundaries.
If these errors occur, rerun the OVP with a double verbose option (--verbose --verbose
and verify the actual results versus the mean result. If the difference is less the 5%, you can safely
ignore the errors.
The following is an example of a false failure. Nodes ibblc64 and ibblc65 are on enclosure
boundaries.
Exchange results summary (all values in mBytes/sec):
min: 2077.790000
max: 2143.940000
median: 2107.490000
mean: 2107.259747
range: 66.150000
variance: 76.098854
std_dev: 8.723466
The following node pairs have values more than
3 standard deviations from the mean:
182 Troubleshooting