Tandem Failure Data System (TFDS) Manual

Introduction to TFDS
HP Tandem Failure Data System (TFDS) Manual540122-003
1-2
Key Features
Key Features
Responds to both hardware-based and software-based processor failures.
Maintains a database of unique failure incidents.
Captures information only for new, unique failure incidents.
Avoids responding to repeated failure incidents.
Reloads and dumps processors automatically.
Collects key system files required for failure analysis.
Forwards collected information to service providers.
Architecture
Figure 1-1 shows how TFDS components work together with other system resources
to perform failure data collection and recovery tasks.
Figure 1-1. TFDS Organization and Architecture
SLICE
Key:
Last Update: October 20th, 2003
SCF
(Kernel
Subsystem)
TFDSCONF
Incident DB
Master
EMS Logs
TFDSCOM
NSK Utilities
(RCVDUMP
etc.)
Disk Dump
Tape
Dump
Snapshot
Server
TFDS Helper
($ztfnn)
Debug
Services
OS Millicode
eGarth
CPUnn
Old Client
New Client
TFDS RTL
Old API
TFDS RTL
New API
\NODE
TFDS
Monitor
($zdmp)
GCSC
External
Entities
TFDS
Components
CPU failure
__break or TFDS_TRIGGER_
TFDS_Break_xxx
TFDS instruments
Snapshot
File
Command
Interface
Initial
configuration
Process
control
CPU Dump
CPU Failure
Analysis
Incident DB
Detail
Incident DB
Control
Rediscovery
Rediscovery
XY
Data flows from X to Y
XY
X interacts with Y
X Y
X is a Library
Dynamically
Linked to Y
HP NED Dev
Fig. 3-2 TFDS on TNS/E - Detailed Context Diagram