Intel 64 and IA-32 Architectures Software Developers Manual Volume 3B, System Programming Guide Part 2

Table Of Contents
A-60 Vol. 3
PERFORMANCE-MONITORING EVENTS
The load’s data size is one or two bytes
and it is not aligned to the store.
The load’s data size is of four or eight
bytes and the load is misaligned.
The load is from bytes written by the
preceding store, the store is misaligned
and the load is not aligned on the
beginning of the store.
The load is split over an eight byte
boundary (excluding 16-byte loads).
The load and store have the same
offset relative to the beginning of
different 4-KByte pages. This case is
also called 4-KByte aliasing.
In all these cases the load is blocked
until after the blocking store retires and
the stored data is committed to the
cache hierarchy.
03H 10H LOAD_BLOCK.
UNTIL_RETIRE
Loads blocked
until retirement
This event indicates that load operations
were blocked until retirement. The number
of events is greater or equal to the
number of load operations that were
blocked.
This includes mainly uncacheable loads
and split loads (loads that cross the cache
line boundary) but may include other cases
where loads are blocked until retirement.
Table A-6. Non-Architectural Performance Events
in Processors Based on Intel Core Microarchitecture (Contd.)
Event
Num
Umask
Value Event Name Definition
Description and
Comment