HP XC System Software Installation Guide Version 4.0

Table 14-1 Diagnosing System Imaging Problems (continued)
Possible SolutionHow To DiagnoseSymptom
Verify DHCP settings and status of daemon.
Verify network status and connections.
Monitor the /var/log/dhcpd.log file for
DHCPREQUEST messages from the client
node MAC address.
Check boot order and BIOS settings.
Rerun imaging/booting operations with less
nodes.
The system boots from local disk
and runs nconfigure. You can
verify this by checking messages
written to the imaging.log file.
The network boot times out.
Ensure that the mysqld daemon is running
on the head node.
# service mysqld status
Verify network connections.
Boot fewer nodes in a single operation.
The system is placed in
single-user mode.
A node configuration
(nconfigure) operation fails
while attempting to access the
configuration and management
database on the head node.
Ensure that startsys is was used to image
the nodes.
Check for instances of flamethrower
running on the head node.
# ps -aef | fgrep flamethrower
Verify that the node has started
imaging by looking for
“imaging_started” messages in
the rsyncd log file. Verify that
no “finished” messages are in the
imaging.log file.
An imaged node boots correctly,
but the node hangs in the
autoinstall script waiting for the
first multicast operation.
Verify that the network is quiet. A very busy
network can cause dropped multicast UDP
packets.
Try this:
1. Stop the imaging operation.
2. Verify that no flamethrower daemons
are running.
3. Open the /etc/systemimager/
flamethrower.conf file.
4. Comment out the line with FEC =
5. Save the changes to the file and exit the
text editor.
6. Retry the imaging operation.
Verify that the imaging operation
has failed by examining the
imaging.log file and look for
multiple retries of flamethrower.
Multicast operation fails.
14.3.1 /hptc_cluster File System Does Not Mount
It is possible to experience a mount failure when nodes image, boot, and attempt to NFS mount
the /hptc_cluster file system. The nodes receive a Permission denied error message from
the attempt to mount the /hptc_cluster file system.
Run the following commands on the head node to restart nfs and re-run node configuration
scripts and restart services on all other nodes:
# service nfs restart
# pdsh -a touch /var/hptc/nconfig.1st
# stopsys
# startsys
14.3.2 Client Node or Nodes Do Not Network Boot
The following message displays on a client node if the per-node symbolic links to the elilo.efi
file are lost:
TSize..
PXE-E23: Client received TFTP error from server.
PXE-E98: Code: 1h File not found
Load of Netboot failed: Not Found
14.3 Troubleshooting the Imaging Process 179