HP Caliper User's Guide
Example B-2 Example of a dcache Report for a Superdome Integrity Server
Function Details
---------------------------------------------------------------------------------------------------
% Total Avg. ---Latency buckets as % Misses---
Dcache Sampled Dcache Dcache L2 --L3-- loc loc 1 2 1&2 Line|
Latency Dcache Latency Laten. c2c mem hop hop c2c Slot| >Statement|
Cycles Misses Cycles Cycles Col,Offset Instruction
7 14 64 150 350 650 800 >800
---------------------------------------------------------------------------------------------------
56.19 [caliper::ipack_encode_chain, 0x4000000000335900, ipack.c]
321018 9111771 28.3 23 12 11 8 12 14 10 10 ~4518 Function
Totals
-------------------------------------------------------------
[/CLO/Components/Caliper/Src/encode/ipack.c]
(262028) (8464760) (32.3) 20 22 10 8 9 14 6 11 278 > instr
= instr->next;
0 0 5,0x00e0:0 M_
adds r8=8,r35 ;;
262028 8464760 32.3 20 22 10 8 9 14 6 11 :1 M
ld8 r35=[r8]
0 0 :2 I
nop.i 0
5,0x00f0:0 M
nop.m 0
:1 M
nop.m 0
:2 B_
br.dptk {self}+0x920 ;;
Using the --dcache-data-profile Option to Produce a Data Summary
To get a Data Summary output, you need to use the --dcache-data-profile option.
The dcache measurement does not produce the Data Summary by default.
The Data Summary shows a histogram of data accessed by location. Each sample records
the data address that is accessed. If a data address belongs to a global variable, the
sample is attributed to the global variable. Otherwise, the data address is mapped to
a process region such as text, data, heap, stack, and so forth.
If a data address cannot be determined to belong to a global variable or a process region,
it is shown as an unknown sample. The data address is appended to the symbol name.
An example entry might be *unknown_0x9fffffffbf789378*, where the data
address 0x9fffffffbf789378 was a data cache miss, but it was not assigned to any
global variable or process region.
An example of Data Summary output is shown below:
Data Summary
---------------------------------------------------------------------------------------------------------------
% Total Avg. ---Latency buckets as % Misses--
Dcache Cumulat Sampled Dcache Dcache L2 --L3-- ------Memory-------
Latency % of Dcache Latency Laten.
Cycles Total Misses Cycles Cycles 7 14 64 150 250 350 450 > Data Entry
---------------------------------------------------------------------------------------------------------------
66.82 66.82 42 580 13.8 62 29 7 0 0 2 0 0 Heap
7.72 74.54 10 67 6.7 80 20 0 0 0 0 0 0 Memory mapped
shared library
5.65 80.18 5 49 9.8 40 60 0 0 0 0 0 0 Process Text Region
256 Descriptions of Measurement Reports