Intel 64 and IA-32 Architectures Software Developers Manual Volume 3B, System Programming Guide Part 2

Table Of Contents
Vol. 3 A-7
PERFORMANCE-MONITORING EVENTS
12H 04H SIMD_INT_128.PACK Counts number of 128 bit SIMD
integer pack operations.
12H 08H SIMD_INT_128.UNPA
CK
Counts number of 128 bit SIMD
integer unpack operations.
12H 10H SIMD_INT_128.PACK
ED_LOGICAL
Counts number of 128 bit SIMD
integer logical operations.
12H 20H SIMD_INT_128.PACK
ED_ARITH
Counts number of 128 bit SIMD
integer arithmetic operations.
12H 40H SIMD_INT_128.SHUF
FLE_MOVE
Counts number of 128 bit SIMD
integer shuffle and move
operations.
13H 01H LOAD_DISPATCH.RS Counts number of loads dispatched
from the Reservation Station that
bypass the Memory Order Buffer.
13H 02H LOAD_DISPATCH.RS_
DELAYED
Counts the number of delayed RS
dispatches at the stage latch. If an
RS dispatch can not bypass to LB, it
has another chance to dispatch
from the one-cycle delayed staging
latch before it is written into the
LB.
13H 04H LOAD_DISPATCH.MO
B
Counts the number of loads
dispatched from the Reservation
Station to the Memory Order
Buffer.
13H 07H LOAD_DISPATCH.ANY Counts all loads dispatched from
the Reservation Station.
14H 01H ARITH.CYCLES_DIV_
BUSY
Counts the number of cycles the
divider is busy executing divide or
square root operations. The divide
can be integer, X87 or Streaming
SIMD Extensions (SSE). The square
root operation can be either X87 or
SSE.
Set 'edge =1,
invert=1,
cmask=1' to
count the
number of
divides.
Table A-2. Non-Architectural Performance Events In the Processor Core for Intel Core
i7 Processors
Event
Num.
Umask
Value
Event Mask
Mnemonic Description Comment