NonStop NS-Series Operations Guide (H06.12+)

When the operating system detects a software error that it cannot correct, it can execute a
freeze instruction to suspend all application and system processes running in the associated
processor. The status of the frozen processor becomes:
Freeze code = %nnnnnn
If system freeze is enabled, the status for all other freeze-enabled processors becomes:
Frozen by other processor
The Processor Halt Codes Manual documents processor halt codes.
NOTE: Do not freeze-enable a processor unless instructed to do so by your service provider.
OSM Alarms and Attribute Values
When the OSM Service Connection displays processor-related alarms or problem attributes, the
alarm and attribute details often indicate the appropriate recovery action. Recovery actions might
call for:
Trying the “Recovery Operations for Processors” (page 110).
Contacting your service provider.
Recovery Operations for Processors
The architecture of Integrity NonStop NS-series servers offers recovery options not available in
NonStop S-series. Because a logical processor has a physical processor element (PE) in each of
up to three NonStop Blade Elements, in some cases, you no longer have to choose between taking
the time to dump entire processors and skipping the dump to reload the system as quickly as
possible. In those cases, you can reload a halted processor immediately, while excluding the PE
for one Blade Element, then dump that excluded PE before reintegrating the Blade Element into
the running processor.
NOTE: The parts of this section that do not apply to Integrity NonStop NS1200 and NS1000
systems include all references to processor elements (PEs) the RELOAD command OMITBLADE
option. For more information on Integrity NonStop NS1200 and NS1000 systems, see Differences
Between Integrity NonStop NS-Series Systems in the appropriate Planning Guide or Installation
Manual.
Processor recovery operations for your NS-series system might include:
“Recovery Operations for a Processor Halt” (page 110)
“Halting One or More Processors” (page 111)
“Reloading a Single Processor on a Running Server” (page 111)
“Recovery Operations for a System Hang” (page 115)
“Enabling/Disabling Processor and System Freeze” (page 115)
“Freezing the System and Freeze-Enabled Processors” (page 115)
“Dumping a Processor to Disk” (page 116)
“Backing Up a Processor Dump to Tape” (page 118)
“Replacing Processor Memory” (page 118)
“Replacing the Processor Board and Processor Entity” (page 118)
“Submitting Information to Your Service Provider” (page 119)
Recovery Operations for a Processor Halt
HP Tandem Failure Data System (TFDS) should be used to proactively monitor processors and
manage processor halts. Configured and running before a halt occurs, TFDS can help determine
110 Processors and Components: Monitoring and Recovery