Availability Guide for Problem Management

Recovering From Unplanned Outages
Availability Guide for Problem Management125509
3-9
Gathering the Facts
Facts About the Circumstances
Who reported the problem and where can they be contacted?
What was the user doing when the problem occurred?
What events led up to the problem?
What information was displayed when the problem occurred?
What do event messages, error logs, and memory dumps reveal?
What has changed recently that might have caused the problem?
What is the current configuration of the hardware and software products affected
(including release levels and product numbers)?
What other relevant information can you provide about the problem?
Other Important Information
What is the probable cause?
Who should resolve the problem?
How was the problem resolved?
Was the problem escalated to a higher level of support?
Who resolved the problem and when?
Maintaining a Problem-Reporting Log
Use a problem-reporting log to record the facts about the problem as it is reported to
your operations support staff. A problem-reporting log can be either online or in hard-
copy format. It should allow your staff to gather the following information quickly and
efficiently.
Figure 3-2 illustrates a sample problem-reporting log and the kind of information that
should be recorded on it. The information on the problem-reporting log can be used by
the operator, operations analyst, and whoever else is involved in resolving the problem.