HP XC System Software Installation Guide Version 3.1

Table N-1 Diagnosing System Imaging Problems (continued)
Possible SolutionHow To DiagnoseSymptom
Ensure that startsys is was used to image the
nodes.
Check for instances of flamethrower running on
the head node.
# ps -aef | fgrep flamethrower
Verify that the node has started
imaging by looking for
“imaging_started” messages in the
rsyncd log file. Verify that no
“finished” messages are in the
imaging.log file.
An imaged node boots correctly, but
the node hangs in the autoinstall
script waiting for the first multicast
operation.
Verify that the network is quiet. A very busy
network can cause dropped multicast UDP
packets.
Try this:
1. Stop the imaging operation.
2. Verify that no flamethrower daemons are
running.
3. Open the
/etc/systemimager/flamethrower.conf
file.
4. Comment out the line with FEC =
5. Save the file.
6. Retry the imaging operation.
Verify that the imaging operation has
failed by examining the
imaging.log file and look for
multiple retries of flamethrower.
Multicast operation fails.
N.3.1 Monitor an Imaging Session
To monitor an imaging operation, use the tail -f command in another terminal window to view the
imaging log files.
It is possible to actually view an installation through the remote serial console, but to do so, you must edit
the /tftpboot/pxelinux.cfg/default file before the installation begins and add the correct serial
console device to the APPEND line. If this is done, disable the cmfd services and image a smaller group of
nodes sat any one time. The network traffic caused by the serial console can adversely affect the imaging
operation.
N.4 Troubleshoot Licenses
This procedure describes what to do if you did not have the XC.lic license file in place before you ran
the cluster_config utility, which means that LSF does not function.
1. Begin this procedure as the root user on the head node.
2. Use the file copy utility of your choice (such as scp) to copy the XC.lic file to the
/opt/hptc/etc/license directory on the head node.
3. Make sure the file permissions are set to only allow root to have read and write access:
# chmod 600 /opt/hptc/etc/license/XC.lic
# ls -l /opt/hptc/etc/license/XC.lic
-rw------- 1 root root 941 Dec 20 10:34 XC.lic
4. Stop and start the license daemon:
# service hptc-lm stop
# service hptc-lm start
5. Determine the node on which LSF is running:
# controllsf show current
LSF is currently running on n16 and assigned to node n16
6. Log in to the node where LSF is running.
7. Restart the LSF LIM daemon:
# pkill lim
#lsadmin limstartup
Checking configuration files ...
194 Troubleshooting