HP Caliper User Guide Release 5.5 (5900-2351, August 2012)

CYC_BE_WB2_FLUSH.ANY The number of CPU cycles spent in WB2 (Write back)
flushing of instructions.
CYC_BE_IBD_STALL.ANY The number of CPU cycles spent in the IBD(instruction buffer
and dispersal) without issuing instructions.
CYC_BE_IBD_STALL.GR_LOAD This is the number of cycles lost (stall cycles) due to GR load
RAW or WAW dependency condition of the instruction.
CYC_BE_EXE_REPLAY.GR_LOAD_RAW This is the number of cycles lost (stall cycles) in replay due
to RAW hazard in an instruction's GR load.
CYC_BE_EXE_REPLAY.GR_LOAD_WAW This is the number of cycles lost (stall cycles) in replay due
to WAW hazard in an instruction's GR load.
CYC_BE_DET_REPLAY.GR_LOAD This is the number of cycles lost (stall cycles) in replay due
to memory loads of single cycle GR load instructions. The
loads do not hit the FLD (first level data cache) and must be
obtained from lower level caches or memory, leading to
extra cycles.
CYC_BE_DET_REPLAY.ANY This is the number of cycles lost in replay in the DET stage.
CYC_BE_EXE_REPLAY.ANY This is the number of cycles lost in replay in the EXE stage.
CYC_BE_WB2_REPLAY.ANY This is the number of cycles lost in replay in the WB2 stage.
% Unstalled execution (higher is
better)
Percentage of unstalled cycles with respect to total number
of elapsed CPU operating cycles.
% Cycles lost due to branch
misprediction or interruption flush
stalls (lower is better)
The percentage of cycles lost due to flushes in the WB2
stage with respect to the the total number of elapsed CPU
operating cycles.
% Cycles lost due to issue bubbles
(lower is better)
The percentage of cycles lost due to stalls in IBD stage with
respect to total number of elapsed CPU operating cycles.
% Cycles lost due to DET replay
(lower is better)
The percentage of cycles lost due to replays in DET stage
with respect to total number of elapsed CPU operating
cycles.
% Cycles lost due to EXE replay
(lower is better)
The percentage of cycles lost due to replays in EXE stage
with respect to total number of elapsed CPU operating
cycles.
% Cycles lost due to WB2 replay
(lower is better)
The percentage of cycles lost due to replays in WB2 stage
with respect to total number of elapsed CPU operating
cycles.
% of Cycles lost due to GR load
penalties
Percentage of cycles lost due to GR load dependency stalls
(stalls in IBD, RAW and WAW hazards in EXE and DET
replays) with respect to total number of elapsed CPU
operating cycles.
% Effective instructions retired The total number of instructions retired without considering
the nop's and instructions with predicate disabled.
% Raw CPI (lower is better) The cycles per instruction, including nop and predicated off
instructions.
% Raw IPC (higher is better) The instructions per cycle, including nop and predicated off
instructions.
% Effective CPI (lower is better) The cycles per effective instruction, excluding nop and
predicated off instructions.
% Effective IPC (higher is better) The effective instructions per cycle, excluding nop and
predicated off instructions.
% Core cycles due to this thread The percentage of available processor cycles that the
measured process consumed. The other processor cycles
were consumed by other processes running in the core's
other hyperthread or were lost to HyperThreading overhead.
cycles Measurement Report Description 189