Open System Services Management and Operations Guide (G06.30+, H06.08+, J06.03+)

Monitoring OSS Resources
For systems running J06.07 and later J-series RVUs and H06.18 and later H-series RVUs, Event
Management Service (EMS) events can help you monitor OSS file-system resource usage. For each
type of resource:
Limit warning events occur when resource usage exceeds 85% of the resource limit and are
repeated hourly until resource usage drops below 65% of the resource limit.
Error events occur when a resource limit is reached or when an allocation error occurs.
These events are sent by various OSS processes under the OSS subsystem ID. For detailed
information about each of these events, see the Operator Messages Manual.
For detailed information about the OSS environment limits, see “Environment Limits” (page 411).
NOTE: You cannot configure these items:
Resource limit
The resource limits vary by the RVU that the system is running. For detailed information about
resource limits, see “Environment Limits” (page 411).
Reset time
The resource monitoring for a resource type is reset when the resource is initialized.
Warning threshold
When resource usage for a given resource reaches or exceeds the warning threshold, which
is 85% of the resource limit, a limit warning is sent and is sent once every hour until usage
drops to the safe threshold. Therefore you might see a limit warning message even though the
current usage is below the warning threshold.
Safe threshold
When resource usage for a given resource type has reached or exceeded the warning threshold
at some time after the reset time but has not yet dropped to or below the safe threshold, which
is 65% of the resource limit, limit warning events are sent once every hour.
Example
For example, a program such as a test script opens files until it reaches the system limit of 64,000
opens per processor and then closes the opened files. When the resource reaches the warning
threshold, which is 85% of the resource limit, a limit warning event like this is sent:
09-03-21 20:00:31 \node1.$BGP01 TANDEM.OSS.H04 000041
Resource ALL OPENS is approaching the
resource limit.
Resource limit: 64000
Warning threshold: 85
Current percentage: 85
Current usage: 54400
Peak percentage: 85
Peak usage: 54400
Peak time: 21MAR09 20:00
Reset time: 21MAR09 17:00
Safe threshold: 65
Each hour, if the current percentage has not dropped to or below the safe threshold, the limit
warning message is sent.
When the resource usage reaches the limit, a limit error like this is sent:
09-03-21 21:15:24 \node1.$BGP01 *TANDEM.OSS.H04 000040
Limit error for ALL OPENS resource.
Resource limit: 64000
Current percentage: 100
Current usage: 64000
OSS PID: 17039382
Monitoring OSS Resources 61