Availability Guide for Problem Management
Automating Operations and Recovery Procedures
Availability Guide for Problem Management–125509
6-8
TFDS Automated Recovery
TFDS Automated Recovery
TFDS monitors processors and automatically initiates a processor dump if a failure
occurs. The failed processor is reloaded automatically, and the processor dumped is
analyzed with the incident database to determine whether the failure is the result of a
recurring or known defect. TFDS creates an incident database that tracks specific
problem occurrences.
If the failure is identified as a recurring or known defect, the dump file is removed from
the system, and the number of occurrences for the particular failure is tabulated. If the
failure cannot be identified as a recurring or known defect, the dump file, the TMDS and
EMS log files or both, and the CONFLIST file are automatically saved to tape, and a
new incident record is established in the database.
TFDS Configuration File Example
Following is an example of a TFDS configuration file, which is an edit file that contains
the parameters and values that specify how TFDS will perform automated recovery tasks
in your system environment. The functions of these parameters are described in Table 6-
2.
AUTOMATIC-BACKUP OFF
AUTOMATIC-PURGE ON
BACKUPTIMEOUT 60
CRUNCH-FILE $SYSTEM.SYS03.CRUNCHR
DB-LOCATION $SYSTEM.TFDS
DISABLED-CPUS 0,3
DUMP ON
DUMPVOLUME $ALPHA, 10, ALTERNATE VOLUMES
ALLOWED
EVALUATION ON
IGNORED-CPUS 1
NETFILEXFER ON \GRANDE.$DATA.DUMPRD
RELOAD ON
RETRY-DUMP 3, RELOAD-ON-FAILURE
RETRY-RELOAD 3
TAPE-UNIT $TAPE
SYSTEM-ID Bank of Timbuktu (System
#12354)