Introduction to NonStop Operations Management

Problem Management
Introduction to NonStop Operations Management125507
6-5
Problem Prevention Strategies
Application Design provides guidelines for designing applications for high
availability.
Ensure the availability of super-group (255, n) capabilities. While a super-group
logon is not needed under normal conditions, it may be required to solve certain
problems. Having access to a super-group password is sometimes the fastest way—
and even the only way—to solve a problem. Section 9, “Security Management,
describes a procedure to take advantage of super-group logon capabilities while
maintaining appropriate security and providing access to the system when it is
needed.
Prepare for environmental problems and disasters. Planning ahead can help you
prevent some environmental problems and disasters. Having a disaster recovery plan
can minimize the effect of those disasters you cannot prevent.
Section 3, “The Operations and Support Areas,” provides guidelines and
considerations for selecting and preparing the computer center location and
facilities. Section 10, “Contingency Planning,” describes disaster prevention and
recovery in detail. The Availability Guide for Problem Management also provides
detailed guidelines for preparing your operations environment for problems and
disasters.
Maintain accurate, up-to-date, and well-tested problem recovery procedures.
Section 5, “Production Management, provides guidelines for establishing recovery
procedures.
Have a reserve Tandem system (or use a development system). Problem prevention
can be greatly enhanced by using a reserve system for:
Testing new Tandem software releases
Testing new application software releases
Testing operational procedures
Training new system operators
Running the application when the production system must be down to install a
new release or new configuration of the operating system
Maintain a well-trained operations and support staff. An inadequately trained
operations staff is one of the biggest vulnerabilities an operations group can face. A
well-trained staff is better able to respond to problems.