HP Caliper 5.3 User Guide (5900-1558, February 2011)

Metrics for Integrity Servers Dual-Core Itanium 2 and Itanium 9300 Quad-Core Processor Systems
The following CPU events are directly measured:
BACK_END_BUBBLE.ALL — The number of cycles when the back end of the pipeline was
stalled. This is the number of cycles lost (stall cycles) due to any of five possible events
(FPU/L1D, RSE, EXE, branch/exception, or the front end).
BE_EXE_BUBBLE.GRALL — The number of Full Pipe Bubbles in Main Pipe due to GR/GR or
GR/load dependency stalls. This is the number of cycles lost (stall cycles) due to GR/GR or
GR/load dependency.
BE_FLUSH_BUBBLE.ALL — The number of Full Pipe Bubbles in Main Pipe due to pipeline
flushes. This is the number of cycles lost (stall cycles) due to branch misprediction or
exception/interruption flush.
BE_L1D_FPU_BUBBLE.L1D — The number of Full Pipe Bubbles in Main Pipe due to L1D cache.
This is the number of cycles lost (stall cycles) due to L1D cache and L1/L2 DTLB.
CPU_OP_CYCLES.ALL — The number of elapsed CPU operating cycles.
When HyperThreading is on, this is the number of elapsed CPU operating cycles used by only
this process's hyperthread.
CPU_OP_CYCLES.ALL:all_threads=true
The number of elapsed CPU operating cycles used by both hyperthreads. Available only when
HyperThreading is on.
IA64_INST_RETIRED The number of retired IA-64 instructions. The count includes predicated
on and predicated off instructions and nops, but excludes hardware-inserted RSE operations.
NOPS_RETIRED — The number of retired nop.i, nop.m, or nop.b instructions. The count
excludes predicated off nop instructions.
PREDICATE_SQUASHED_RETIRED — The number of predicated off instructions retired.
THREAD_SWITCH_EVENTS.ALL — The number of hardware thread switches.
THREAD_SWITCH_EVENTS.L3MISS — The number of hardware thread switches due to L3
cache miss.
The following CPU metrics are derived (calculated) from the above CPU events:
% of Cycles lost due to stalls (lower is better) — The percentage of cycles lost due to all stalls.
% of Cycles lost due to stalls caused by L1D (L1D and L1/L2 DTLB) The percentage of cycles
lost due to L1D cache and L1/L2 DTLD.
% of Cycles lost due to GR/GR or GR/load dependency stalls — The percentage of cycles
lost due to GR/load or GR/GR dependency stalls.
% of Cycles lost due to branch misprediction or interruption flush stalls — The percentage of
cycles lost due to branch misprediction or interruption flush.
Effective instructions retired — The number of effective instructions retired excluding nop and
predicated off instructions.
Raw CPI (lower is better) — The cycles per instruction, including nop and predicated off
instructions.
Effective CPI (lower is better) — The cycles per effective instruction, excluding nop and
predicated off instructions.
Effective CPI during unstalled execution (lower is better) — The cycles per effective instruction,
excluding stall cycles, nop, and predicated off instructions.
198 Descriptions of Measurement Reports