Intel 64 and IA-32 Architectures Software Developers Manual Volume 3B, System Programming Guide Part 2

Table Of Contents
A-8 Vol. 3
PERFORMANCE-MONITORING EVENTS
14H 02H ARITH.MUL Counts the number of multiply
operations executed. This includes
integer as well as floating point
multiply operations but excludes
DPPS mul and MPSAD.
17H 01H INST_QUEUE_WRITE
S
Counts the number of instructions
written into the instruction queue
every cycle.
18H 01H INST_DECODED.DEC0 Counts number of instructions that
require decoder 0 to be decoded.
Usually, this means that the
instruction maps to more than 1
uop
19H 01H TWO_UOP_INSTS_D
ECODED
An instruction that generates two
uops was decoded
1DH 01H HW_INT.RCV Number of interrupt received
1DH 02H HW_INT.CYCLES_MAS
KED
Number of cycles interrupt are
masked
1DH 04H HW_INT.CYCLES_PEN
DING_AND_MASKED
Number of cycles interrupts are
pending and masked
1EH 01H INST_QUEUE_WRITE
_CYCLES
This event counts the number of
cycles during which instructions are
written to the instruction queue.
Dividing this counter by the number
of instructions written to the
instruction queue
(INST_QUEUE_WRITES) yields the
average number of instructions
decoded each cycle. If this number
is less than four and the pipe stalls,
this indicates that the decoder is
failing to decode enough
instructions per cycle to sustain the
4-wide pipeline.
If SSE*
instructions that
are 6 bytes or
longer arrive one
after another,
then front end
throughput may
limit execution
speed. In such
case,
Table A-2. Non-Architectural Performance Events In the Processor Core for Intel Core
i7 Processors
Event
Num.
Umask
Value
Event Mask
Mnemonic Description Comment