Intel 64 and IA-32 Architectures Software Developers Manual Volume 3A, System Programming Guide, Part 1

ManualsBrandsIntel ManualsOtherIntel Pentium 4 Processor 2.80 GHz, 512K Cache, 533 MHz FSB

461

462

463

464

465

466

467

468

469

470

10-24 Vol. 3A

MEMORY CACHE CONTROL

10.5.6.1 Adaptive Mode

Adaptive mode facilitates L1 data cache sharing between logical processors. When

running in adaptive mode, the L1 data cache is shared across logical processors in

the same core if:

• CR3 control registers for logical processors sharing the cache are identical.

• The same paging mode is used by logical processors sharing the cache.

In this situation, the entire L1 data cache is available to each logical processor

(instead of being competitively shared).

If CR3 values are different for the logical processors sharing an L1 data cache or the

logical processors use different paging modes, processors compete for cache

resources. This reduces the effective size of the cache for each logical processor.

Aliasing of the cache is not allowed (which prevents data thrashing).

10.5.6.2 Shared Mode

In shared mode, the L1 data cache is competitively shared between logical proces-

sors. This is true even if the logical processors use identical CR3 registers and paging

modes.

In shared mode, linear addresses in the L1 data cache can be aliased, meaning that

one linear address in the cache can point to different physical locations. The mecha-

nism for resolving aliasing can lead to thrashing. For this reason,

IA32_MISC_ENABLE[bit 24] = 0 is the preferred configuration for Intel 64 and IA-32

processors that support Hyper-Threading Technology.

10.6 SELF-MODIFYING CODE

A write to a memory location in a code segment that is currently cached in the

processor causes the associated cache line (or lines) to be invalidated. This check is

based on the physical address of the instruction. In addition, the P6 family and

Pentium processors check whether a write to a code segment may modify an instruc-

tion that has been prefetched for execution. If the write affects a prefetched instruc-

tion, the prefetch queue is invalidated. This latter check is based on the linear

address of the instruction. For the Pentium 4 and Intel Xeon processors, a write or a

snoop of an instruction in a code segment, where the target instruction is already

decoded and resident in the trace cache, invalidates the entire trace cache. The latter

behavior means that programs that self-modify code can cause severe degradation

of performance when run on the Pentium 4 and Intel Xeon processors.

In practice, the check on linear addresses should not create compatibility problems

among IA-32 processors. Applications that include self-modifying code use the same

linear address for modifying and fetching the instruction. Systems software, such as

a debugger, that might possibly modify an instruction using a different linear address

than that used to fetch the instruction, will execute a serializing operation, such as a