Tandem Failure Data System (TFDS) Manual
Introduction to TFDS
HP Tandem Failure Data System (TFDS) Manual—540122-003
1-2
Key Features
Key Features
•
Responds to both hardware-based and software-based processor failures.
•
Maintains a database of unique failure incidents.
•
Captures information only for new, unique failure incidents.
•
Avoids responding to repeated failure incidents.
•
Reloads and dumps processors automatically.
•
Collects key system files required for failure analysis.
•
Forwards collected information to service providers.
Architecture
Figure 1-1 shows how TFDS components work together with other system resources
to perform failure data collection and recovery tasks.
Figure 1-1. TFDS Organization and Architecture
SLICE
Key:
Last Update: October 20th, 2003
SCF
(Kernel
Subsystem)
TFDSCONF
Incident DB
Master
EMS Logs
TFDSCOM
NSK Utilities
(RCVDUMP
etc.)
Disk Dump
Tape
Dump
Snapshot
Server
TFDS Helper
($ztfnn)
Debug
Services
OS Millicode
eGarth
CPUnn
Old Client
New Client
TFDS RTL
Old API
TFDS RTL
New API
\NODE
TFDS
Monitor
($zdmp)
GCSC
External
Entities
TFDS
Components
CPU failure
__break or TFDS_TRIGGER_
TFDS_Break_xxx
TFDS instruments
Snapshot
File
Command
Interface
Initial
configuration
Process
control
CPU Dump
CPU Failure
Analysis
Incident DB
Detail
Incident DB
Control
Rediscovery
Rediscovery
XY
Data flows from X to Y
XY
X interacts with Y
X Y
X is a Library
Dynamically
Linked to Y
HP NED Dev
Fig. 3-2 TFDS on TNS/E - Detailed Context Diagram










