Intel 64 and IA-32 Architectures Software Developers Manual Volume 3B, System Programming Guide Part 2

Table Of Contents
A-88 Vol. 3
PERFORMANCE-MONITORING EVENTS
This event does not count:
floating-point computational operations
that cause traps or assists.
floating-point loads and stores.
When this event is captured with the
precise event mechanism, the collected
samples contain the address of the
instruction that was executed immediately
after the instruction that caused the
event.
C2H 01H UOPS_RETIRED.
LD_IND_BR
Fused load+op
or load+indirect
branch retired
This event counts the number of retired
micro-ops that fused a load with another
operation. This includes:
Fusion of a load and an arithmetic
operation, such as with the following
instruction: ADD EAX, [EBX] where the
content of the memory location
specified by EBX register is loaded,
added to EXA register, and the result is
stored in EAX.
Fusion of a load and a branch in an
indirect branch operation, such as with
the following instructions:
JMP [RDI+200]
•RET
Fusion decreases the number of micro-
ops in the processor pipeline. A high
value for this event count indicates that
the code is using the processor
resources effectively.
C2H 02H UOPS_RETIRED.
STD_STA
Fused store
address + data
retired
This event counts the number of store
address calculations that are fused with
store data emission into one micro-op.
Traditionally, each store operation
required two micro-ops.
This event counts fusion of retired micro-
ops only. Fusion decreases the number of
micro-ops in the processor pipeline. A high
value for this event count indicates that
the code is using the processor resources
effectively.
Table A-6. Non-Architectural Performance Events
in Processors Based on Intel Core Microarchitecture (Contd.)
Event
Num
Umask
Value Event Name Definition
Description and
Comment