Introduction to NonStop Operations Management
Problem Management
Introduction to NonStop Operations Management–125507
6-3
Providing Outage Prevention and Recovery Training
Providing Outage Prevention and Recovery Training
Providing outage prevention and recovery training can help the operations staff become
more aware of the concept and cost of outages, and promotes outage prevention habits.
Establishing an outage prevention and recovery training plan involves:
•
Assisting the Tandem Education Group (TEG) in determining your specific training
needs by sharing ideas about your education needs, completing education surveys,
and providing constructive evaluation following all training activities.
•
Encouraging management and planning staff to consider the education implications
of anticipated changes to your environment. Some of the changes affecting skill
levels include hardware changes, operating system upgrades, application changes,
implementation of new tools, and industry changes.
•
Establishing a function for a training manager whose responsibilities would be to
develop education programs for new-hires and continuing education for current
staff.
•
Establishing a learning environment on site for use of Independent Study Programs
(ISPs), Audio-Digital Technology (ADT), and Computer-Based Training (CBT)
classes.
Predicting and Preventing Problems
Because solving problems can mean the loss of availability, the best kind of problem
solving is problem prevention. There are substantial delays whenever a system
experiences a problem. It takes time to:
•
Recognize the problem
•
Log the problem and get someone to work on it (administrative delays)
•
Collect the necessary tools
•
Analyze the problem
•
Verify the cause
•
Fix the problem
•
Test and evaluate the fix
•
Put the system back into operation (recover from the problem)
By implementing problem prevention strategies, you can:
•
Predict potential problems
•
Prevent potential problems from becoming unplanned outages
•
Prepare for problems that may occur