Introduction to NonStop Operations Management
Overview of NonStop Operations Management
Introduction to NonStop Operations Management–125507
1-5
Problem Management
Tandem provides a number of tools to manage the production environment, including
tools for:
•
Monitoring systems, networks, and applications online
•
Automating operator procedures
•
Managing distributed systems from a central site
•
Managing networks, databases, and applications
For guidelines and suggestions on managing the production environment, refer to
Section 5, “Production Management.” For guidelines on managing applications, refer to
Section 11, “Application Management.” For guidelines on automating and centralizing
operations tasks, refer to Section 12, “Automating and Centralizing Operations.”
Problem Management
Problem management includes the tasks required to manage and administer the
problem environment. For example, some of the tasks included in this discipline are:
•
Ensuring system fault tolerance
•
Predicting and preventing problems
•
Detecting and analyzing problems
•
Documenting and reporting problems
•
Researching, diagnosing, and isolating problems
•
Escalating problems
•
Resolving problems and analyzing the cause
•
Recovering from problems as quickly as possible
•
Establishing problem prevention techniques
Tandem provides a number of tools for managing the problem environment and
recommends a systematic method for detecting, isolating, and recovering from
problems. For guidelines and suggestions on managing the problem environment, refer
to Section 6, “Problem Management.”
For comprehensive information about managing the problem environment and for
information about problem management tools provided by Tandem, refer to the
Availability Guide for Problem Management.
For guidelines on disaster planning, refer to Section 10, “Contingency Planning.”