HP Caliper User Guide Release 5.5 (5900-2351, August 2012)

Example 6 Example of a dcache Report for a Superdome Integrity Server
Function Details
---------------------------------------------------------------------------------------------------
% Total Avg. ---Latency buckets as % Misses---
Dcache Sampled Dcache Dcache L2 --L3-- loc loc 1 2 1&2 Line|
Latency Dcache Latency Laten. c2c mem hop hop c2c Slot| >Statement|
Cycles Misses Cycles Cycles Col,Offset Instruction
7 14 64 150 350 650 800 >800
---------------------------------------------------------------------------------------------------
56.19 [caliper::ipack_encode_chain, 0x4000000000335900, ipack.c]
321018 9111771 28.3 23 12 11 8 12 14 10 10 ~4518 Function Totals
-------------------------------------------------------------
[/CLO/Components/Caliper/Src/encode/ipack.c]
(262028) (8464760) (32.3) 20 22 10 8 9 14 6 11 278 > instr = instr->next;
0 0 5,0x00e0:0 M_ adds
r8=8,r35 ;;
262028 8464760 32.3 20 22 10 8 9 14 6 11 :1 M ld8
r35=[r8]
0 0 :2 I nop.i 0
5,0x00f0:0 M nop.m 0
:1 M nop.m 0
:2 B_ br.dptk
{self}+0x920 ;;
Using the --dcache-data-profile Option to Produce a Data Summary
To get a Data Summary output, you need to use the --dcache-data-profile option. The
dcache measurement does not produce the Data Summary by default.
The Data Summary shows a histogram of data accessed by location. Each sample records the
data address that is accessed. If a data address belongs to a global variable, the sample is
attributed to the global variable. Otherwise, the data address is mapped to a process region such
as text, data, heap, stack, and so forth.
If a data address cannot be determined to belong to a global variable or a process region, it is
shown as an unknown sample. The data address is appended to the symbol name. An example
entry might be *unknown_0x9fffffffbf789378*, where the data address
0x9fffffffbf789378 was a data cache miss, but it was not assigned to any global variable
or process region.
An example of Data Summary output is shown below:
Data Summary
---------------------------------------------------------------------------------------------------------------
% Total Avg. ---Latency buckets as % Misses--
Dcache Cumulat Sampled Dcache Dcache L2 --L3-- ------Memory-------
Latency % of Dcache Latency Laten.
Cycles Total Misses Cycles Cycles 7 14 64 150 250 350 450 > Data Entry
---------------------------------------------------------------------------------------------------------------
66.82 66.82 42 580 13.8 62 29 7 0 0 2 0 0 Heap
7.72 74.54 10 67 6.7 80 20 0 0 0 0 0 0 Memory mapped shared library
5.65 80.18 5 49 9.8 40 60 0 0 0 0 0 0 Process Text Region
4.84 85.02 4 42 10.5 25 50 25 0 0 0 0 0 libc.so.1::_arena_rmutex
4.72 89.75 5 41 8.2 40 60 0 0 0 0 0 0 Process Data Region
The Data Entry column shows the global variable name, process region name, or unknown data
address.
The process regions are:
Process Text Region - the address space occupied by the process text/instructions
Process Data Region - the address space occupied by initialized data and uninitialized data
(.bss)
Heap - the address space where dynamically allocated memory resides
Data and Heap combined - when HP Caliper cannot discover the data and heap regions
separately
Process Stack Region - the user stack area
Shared mem - all the shared memory areas mapped to the process
dcache Measurement Report Description 197