Availability Guide for Problem Management

Availability Guide for Problem Management125509
4-1
4
Monitoring Event Messages
Overview
Subsystems and applications generate messages to report changes in their state.
Monitoring these messages is critical to getting the most out of your online environment:
event messages advise you about the health and status of your system. You need to
monitor event messages in a way that prevents important or critical messages from being
overlooked in a flow of predominantly noncritical, informational messages. Failure to
manage and monitor the flow of system and application event messages can lead to
unplanned outages that could have been avoided.
This section:
Defines system event message management and describes why it is important for
problem prediction, prevention, and detection
Lists the steps for getting control of system event messages
Defines application event message management and describes why it is important
for problem prediction, prevention, and detection
Lists the steps for getting control of application event messages
Describes the advantages of setting up separate monitoring environments for system
and application event messages
Note. This section deals with Event Management Service (EMS) event messages. A second
important category of messages that you should monitor is hardware messages. You can use
the Tandem Service Management package (TSM) to monitor Himalaya S-series hardware
messages. See the TSM online user guide and online help for more information.