Availability Guide for Application Design

Instrumenting an Application for Availability
Availability Guide for Application Design525637-004
8-31
DSM Management Services
The counter values stored to disk can be read using Measure procedures by a
management application or by the Measure product itself. Statistics gathered by
Measure can be presented by the Surveyor or Enform products or further analyzed by
a tool such as the Guardian Performance Analyzer (GPA).
For full details on defining and using Measure counters, refer to the Measure Users
Guide.
DSM Management Services
Central to providing availability through DSM are the DSM management services. Of
specific interest for availability are the Event Management Service (EMS) and the DSM
Template Services.
Through collecting, filtering, and distributing event messages, EMS provides
management applications and operators with immediate notification of problems or
potential problems with your applications or HP subsystems.
DSM Template Services are important in providing human-readable messages. The
purpose of these services is to generate meaningful text from tokens.
Event Management Service (EMS)
The primary HP tool for event collection is the Event Management Service (EMS),
which is a set of processes that collects event messages from HP subsystems
(including operating system processes) and user-written subsystems. EMS selectively
distributes these event messages to various destinations, such as a local operator
console or a management application running on a remote system.
EMS event messages can be used to perform the following tasks:
Monitoring a running network or system
Managing operator tasks
Analyzing problems
Detecting potential problems in advance
Automating problem detection and recovery
In other words, the primary purpose of EMS is to help keep subsystems and
applications available. It does this through collecting, filtering, and distributing events
as described in the following overview. For full details of EMS, refer to the EMS
Manual.
How Does EMS Collect, Filter, and Distribute Event Messages?
There are two types of EMS processes that manage the flow of event messages from
the subsystem environment to the operations environment: event-message collectors
and event-message distributors. Collectors accept event messages from subsystems
and log them to a log file. Distributors filter event messages and return selected
messages to the operations environment.