On Mon, Sep 25, 2017 at 21:03:39 +0300, Lluís Vilanova wrote:
> I know it's not exactly the same we're discussing, but the plot in [1]
> compares
> a few different ways to trace memory accesses on SPEC benchmarks:
>
> * First bar is using a Intel's tool called PIN [2].
> * Second is calling into an instrumentation function on every executed memory
> access in QEMU.
> * Third is embedding the hot path of writing the memory access info to an
> array
> into the TCG opcode stream (more or less equivalent to supporting filtering;
> when the array is full, a user's callback is called - cold path -)
> * Fourth bar can be ignored.
>
> This was working on a much older version of instrumentation for QEMU, but I
> can
> implement something that does the first use-case point above and some
> filtering
> example (second use-case point) to see what's the performance difference.
>
> [1] https://filetea.me/n3wy9WwyCCZR72E9OWXHArHDw
Interesting! Unfortunately, this URL gives me a 404.
E.