HP Caliper 5.3 User Guide (5900-1558, February 2011)

ManualsBrandsHP ManualsSoftwareHP-UX Caliper Software

191

192

193

194

195

196

197

198

199

200

Metrics for Integrity Servers Dual-Core Itanium 2 and Itanium 9300 Quad-Core Processor Systems

The following CPU events are directly measured:

• BACK_END_BUBBLE.ALL — The number of cycles when the back end of the pipeline was

stalled. This is the number of cycles lost (stall cycles) due to any of five possible events

(FPU/L1D, RSE, EXE, branch/exception, or the front end).

• BE_EXE_BUBBLE.GRALL — The number of Full Pipe Bubbles in Main Pipe due to GR/GR or

GR/load dependency stalls. This is the number of cycles lost (stall cycles) due to GR/GR or

GR/load dependency.

• BE_FLUSH_BUBBLE.ALL — The number of Full Pipe Bubbles in Main Pipe due to pipeline

flushes. This is the number of cycles lost (stall cycles) due to branch misprediction or

exception/interruption flush.

• BE_L1D_FPU_BUBBLE.L1D — The number of Full Pipe Bubbles in Main Pipe due to L1D cache.

This is the number of cycles lost (stall cycles) due to L1D cache and L1/L2 DTLB.

• CPU_OP_CYCLES.ALL — The number of elapsed CPU operating cycles.

When HyperThreading is on, this is the number of elapsed CPU operating cycles used by only

this process's hyperthread.

• CPU_OP_CYCLES.ALL:all_threads=true

The number of elapsed CPU operating cycles used by both hyperthreads. Available only when

HyperThreading is on.

• IA64_INST_RETIRED — The number of retired IA-64 instructions. The count includes predicated

on and predicated off instructions and nops, but excludes hardware-inserted RSE operations.

• NOPS_RETIRED — The number of retired nop.i, nop.m, or nop.b instructions. The count

excludes predicated off nop instructions.

• PREDICATE_SQUASHED_RETIRED — The number of predicated off instructions retired.

• THREAD_SWITCH_EVENTS.ALL — The number of hardware thread switches.

• THREAD_SWITCH_EVENTS.L3MISS — The number of hardware thread switches due to L3

cache miss.

The following CPU metrics are derived (calculated) from the above CPU events:

• % of Cycles lost due to stalls (lower is better) — The percentage of cycles lost due to all stalls.

• % of Cycles lost due to stalls caused by L1D (L1D and L1/L2 DTLB) — The percentage of cycles

lost due to L1D cache and L1/L2 DTLD.

• % of Cycles lost due to GR/GR or GR/load dependency stalls — The percentage of cycles

lost due to GR/load or GR/GR dependency stalls.

• % of Cycles lost due to branch misprediction or interruption flush stalls — The percentage of

cycles lost due to branch misprediction or interruption flush.

• Effective instructions retired — The number of effective instructions retired excluding nop and

predicated off instructions.

• Raw CPI (lower is better) — The cycles per instruction, including nop and predicated off

instructions.

• Effective CPI (lower is better) — The cycles per effective instruction, excluding nop and

predicated off instructions.

• Effective CPI during unstalled execution (lower is better) — The cycles per effective instruction,

excluding stall cycles, nop, and predicated off instructions.

198 Descriptions of Measurement Reports