Introduction to NonStop Operations Management

Problem Management
Introduction to NonStop Operations Management125507
6-3
Providing Outage Prevention and Recovery Training
Providing Outage Prevention and Recovery Training
Providing outage prevention and recovery training can help the operations staff become
more aware of the concept and cost of outages, and promotes outage prevention habits.
Establishing an outage prevention and recovery training plan involves:
Assisting the Tandem Education Group (TEG) in determining your specific training
needs by sharing ideas about your education needs, completing education surveys,
and providing constructive evaluation following all training activities.
Encouraging management and planning staff to consider the education implications
of anticipated changes to your environment. Some of the changes affecting skill
levels include hardware changes, operating system upgrades, application changes,
implementation of new tools, and industry changes.
Establishing a function for a training manager whose responsibilities would be to
develop education programs for new-hires and continuing education for current
staff.
Establishing a learning environment on site for use of Independent Study Programs
(ISPs), Audio-Digital Technology (ADT), and Computer-Based Training (CBT)
classes.
Predicting and Preventing Problems
Because solving problems can mean the loss of availability, the best kind of problem
solving is problem prevention. There are substantial delays whenever a system
experiences a problem. It takes time to:
Recognize the problem
Log the problem and get someone to work on it (administrative delays)
Collect the necessary tools
Analyze the problem
Verify the cause
Fix the problem
Test and evaluate the fix
Put the system back into operation (recover from the problem)
By implementing problem prevention strategies, you can:
Predict potential problems
Prevent potential problems from becoming unplanned outages
Prepare for problems that may occur