Fabric OS Troubleshooting and Diagnostics Guide v6.4.0 (53-1001769-01, June 2010)

14 Fabric OS Troubleshooting and Diagnostics Guide
53-1001769-01
Switch boot issues
2
reboot
haFailover
fastBoot
firmwareDownload
The RRD feature is activated and halts rebooting when an unexpected reboot reason is shown
continuously in the reboot history within a certain period of time. The period of time is switch
dependent. The following are considered unexpected reboots:
Reset
A reset reboot may be caused by one of the following:
- Power-cycle of the switch or CP.
- Linux reboot command.
- Hardware watchdog timeout.
- Heartbeat loss related reboot.
Software Fault:Kernel Panic
- If the system upon detecting an internal fatal error from which it cannot safely recover,
generally it will output an error message to the console, dump a stack trace for debugging
and then performs an automatic reboot.
- After a kernel panic, the system may not have enough time to write the reboot reason
causing the reboot reason to be empty. This is treated as an Unknown/reset case.
Software fault
- Software Fault:Software Watchdog
- Software Fault:ASSERT.
Software recovery failure
This is an HA bootup related issue and happens when switch is unable to recover to a stable
state. HASM log contains more detail and specific information on this type of failure, such as
one of the following:
- Failover recovery failed: This occurs when failover recovery failed and has to reboot the CP.
- Failover when standby CP unready: Occurs when the active CP has to failover, but the
standby CP is not ready to takeover mastership.
- Failover when LS trans incomplete: Takes place when a logical switch transaction is
incomplete.
Software bootup failure
This is an HA bootup related issue and happens when a switch is unable to load the firmware
to a usable state. HASM log contains more detail and specific information on this type of
failure, such as one of the following:
- System bring up timed out: The CP failed to come up within the time allotted.
- LS configuration timed out and failed: Logical switch configuration failed and timed out.
After RRD is activated, admin level permission is required to login enter the supportShow or
supportSave command to collect a limited amount of data to resolve the issue.
ATTENTION
The limited supportSave used with the RRD feature does not support USB.