Availability Guide for Problem Management

Preventing Unplanned Outages
Availability Guide for Problem Management125509
2-9
Availability of Super-Group Capabilities
Availability of Super-Group Capabilities
Make sure that operators, system managers, and others who may need super-user or
super-group capabilities have access to them. While a super-group user ID (255,n) is not
needed under normal conditions, it may be required to solve certain problems. Having
access to a super-group password is often the fastest—and sometimes the only way—to
solve a problem. The Introduction to NonStop Operations Management describes a
procedure for taking advantage of super-group capabilities while maintaining
appropriate security.
Disaster-Recovery Planning
Disaster-recovery or contingency planning is essential especially in companies where
day-to-day business activity is tied to a computer system. Section 8, “Planning for
Disasters,describes disaster prevention and recovery planning in detail.
Automated Recovery Procedures
Automating your recovery procedures ensures that they are carried out quickly,
efficiently, and consistently whenever problems occur. Section 6, “Automating
Operations and Recovery Procedures,” provides guidelines for automating recovery
procedures.
Where to Find More Information
The following table describes where to find information on predicting, preventing, and
preparing for problems that can cause outages:
Goal Where to Find Information in This Manual…
Predicting problems Monitoring Event Messages (Section 4)
Monitoring Objects (Section 5)
Preventing problems Automating Operations and Recovery Procedures (Section 6)
Auditing Systems for Fault Tolerance (Section 7)
Preparing for problems Planning for Disasters (Section 8)