Introduction to NonStop Operations Management

Overview of NonStop Operations Management
Introduction to NonStop Operations Management125507
1-5
Problem Management
Tandem provides a number of tools to manage the production environment, including
tools for:
Monitoring systems, networks, and applications online
Automating operator procedures
Managing distributed systems from a central site
Managing networks, databases, and applications
For guidelines and suggestions on managing the production environment, refer to
Section 5, “Production Management.” For guidelines on managing applications, refer to
Section 11, Application Management.” For guidelines on automating and centralizing
operations tasks, refer to Section 12, “Automating and Centralizing Operations.
Problem Management
Problem management includes the tasks required to manage and administer the
problem environment. For example, some of the tasks included in this discipline are:
Ensuring system fault tolerance
Predicting and preventing problems
Detecting and analyzing problems
Documenting and reporting problems
Researching, diagnosing, and isolating problems
Escalating problems
Resolving problems and analyzing the cause
Recovering from problems as quickly as possible
Establishing problem prevention techniques
Tandem provides a number of tools for managing the problem environment and
recommends a systematic method for detecting, isolating, and recovering from
problems. For guidelines and suggestions on managing the problem environment, refer
to Section 6, “Problem Management.
For comprehensive information about managing the problem environment and for
information about problem management tools provided by Tandem, refer to the
Availability Guide for Problem Management.
For guidelines on disaster planning, refer to Section 10, “Contingency Planning.