Availability Guide for Problem Management

Availability Guide for Problem Management125509
Index-1
Index
A
Alarm detection, threshold 5-6
Analysis, root-cause 3-23/3-24
Application design
fault tolerant design 2-2
Application messages
See Event messages
Applications
generating EMS event
messages 4-10/4-14
testing for graceful recovery 7-9
Architecture for remote support 1-7
ASO
See Automated systems operations
Audit for fault tolerance, performing 7-4
Automated systems operations (ASO) 6-1
Automating
batch jobs, using NetBatch 6-5, 6-6
memory dumps, using Tandem Failure
Data System (TFDS) 6-6
operations tasks
managing messages 6-2
monitoring objects 6-2
problem determination steps 6-4
recovery rules 6-5
repetitive tasks 6-4
Automation
definition of 6-1
examples
using TACL macros 6-6
using Tandem Failure Data System
(TFDS) 6-8
tools, Tandem 6-5/6-6
Avoiding a system freeze 7-7
B
Backup
batteries, monitoring 7-6
paths 7-5
sites 8-9/8-11
BACKUP command 8-3
Batch jobs, automating 6-5
Batteries, backup, monitoring 7-6
C
CA-Unicenter for NonStop Servers
event management 4-18
overview 9-3
Cold sites 8-9, 8-10
Command post for disaster recovery 8-6
Configuring
hardware
for fault tolerance 7-5
for stress 7-6
software for fault tolerance 7-8
Console
creating separate environments 4-16
Continuous operations 7-1
Cost of downtime 1-4
CPUDUMP command 3-17
Critical resources 5-7
monitoring 6-3/6-4
automating with Object Monitoring
Facility (OMF) 6-4
D
Damage assessment 8-6
Data
archiving 8-3
collection, using Measure 5-12