NonStop NS-Series Operations Guide (H06.12+)

3 Overview of Monitoring and Recovery
“When to Use This Section” (page 49)
“Functions of Monitoring” (page 49)
“Monitoring Tasks” (page 49)
“Working With a Daily Checklist” (page 50)
“Tools for Checking the Status of System Hardware ” (page 50)
Additional Monitoring Tasks” (page 53)
“Monitoring and Resolving Problems—An Approach” (page 53)
“Using OSM to Monitor the System” (page 53)
“Using the OSM Service Connection” (page 54)
“Recovery Operations for Problems Detected by OSM” (page 58)
“Monitoring Problem Incident Reports” (page 58)
“Using SCF to Monitor the System” (page 58)
“Determining Device States” (page 58)
Automating Routine System Monitoring” (page 61)
“Using the Status LEDs to Monitor the System ” (page 65)
“Related Reading” (page 67)
When to Use This Section
This section provides an overview of monitoring an Integrity NonStop server using various tools.
It describes some common monitoring tasks. It also refers you to other sections or manuals for
more information about monitoring specific system components, events, applications, or processes.
Functions of Monitoring
You must monitor a system to ensure that it is operating properly and to recognize when corrective
action is required. By monitoring a system, you can:
Verify whether components are currently up or down
Be quickly notified of error conditions, state changes, and threshold conditions that have
been exceeded or are reaching their limits
View a chronological list of events that can help with problem diagnosis and resolution
Determine how much of a particular resource is being used; for example, processor capacity,
disk or file space, or communications line bandwidth
Find performance problems that can affect the users of the system
Make better use of existing resources
Ensure that products such as HP NonStop SQL/MP, HP NonStop SQL/MX, HP NonStop
Transaction Management Facility (TMF), and Pathway are available
Prevent many problems and outages from occurring
Monitoring Tasks
Regardless of the shift you work, certain areas of your hardware and software environment need
to be checked on a regular basis. This subsection provides guidelines that will enable you to
determine the general areas you should monitor.
When to Use This Section 49