# Enhancement: Associating allocator call sites with type information ## Background
If it was possible to query the DWARF type of a heap-allocated object given its address, or list the addresses of all heap objects with a given DWARF type, that would be useful for various use cases: - Most obviously for heap profiling ("how many objects of type X are there, and what do their contents look like"). - For associating Intel PEBS / AMD IBS hardware performance events related to data caches with type information (answering questions like "for this struct type, how often is each member accessed, and which members are often cache-cold on access"). - For associating memory access traces with type information. - Maybe for adding type information to ASAN use-after-free reports. Some memory allocators already have support for recording from where the allocator was called for each object; either by saving the instruction address from which the allocator was called (with __builtin_return_address(0), for example in the Linux SLUB allocator with SLAB_STORE_USER), or by saving an entire stack trace (for example in ASAN). The missing link for associating a heap object with a DWARF type is a link between the callsite address and the DWARF type. LLVM IR and Microsoft CodeView debuginfo already have this feature; LLVM IR calls it "heapallocsite", CodeView calls it "S_HEAPALLOCSITE". I think it makes sense to choose a similar name in DWARF ("DW_AT_heapallocsite"). An alternative that more closely aligns with DWARF conventions might be something like "DW_AT_call_allocated_type". A compiler might sometimes only have unreliable indicators of the type of an allocation (like when malloc() is called with a size argument derived from sizeof() in a non-trivial way), so I think the compiler should be allowed to make a reasonable guess when no reliable type information is available. The attribute would primarily be used for calls to subroutines that allocate individual objects (like C++ `new` or C `malloc()`); but it might also be used, for example, when calling a subroutine that sets up a type-specialized allocator instance (like `__kmem_cache_create_args()` in Linux). I expect the attribute to mostly be attached to DW_TAG_call_site in practice, so that recording the return address in the allocator is sufficient. However, it could work similarly with DW_TAG_inlined_subroutine if the allocator records the current instruction pointer when it is inlined. If an allocator subroutine is expected to reliably record type information with this mechanism, the compiler will probably have to disable tail calls and, depending on implementation, also inlining for calls to the allocator subroutine. ## Overview I am proposing to add a new DWARF attribute DW_AT_heapallocsite that can be attached to a DW_TAG_call_site (or DW_TAG_inlined_subroutine) for calls to allocator subroutines (or subroutines with similar semantics). The value of the attribute is essentially the type allocated by the callee when called from the callsite. ## Proposed Changes In section "2.2 Attribute Types", Table 2.2 "Attribute names", add the following entry: Attribute: DW_AT_heapallocsite Identifies or Specifies: Type allocated at call site In section "7.5.4 Attribute Encodings", Table 7.5 "Attribute encodings", add the following entry: Attribute name: DW_AT_heapallocsite Value: <next available ID> Classes: reference At the end of section "3.4.1 Call Site Entries", append these paragraphs: "The call site may have a DW_AT_heapallocsite attribute referencing a debugging information entry for the type that the type-agnostic callee operates on when called from this call site. In particular, if the callee's primary purpose is to allocate memory, it refers to the type of the allocated object. The referenced type may be a reasonable guess if no reliable type information is available. <italic>This attribute should only be used when either the callee is a memory allocation subroutine or the programmer has requested its use at a specific call site or for calls to a specific subroutine. The meaning of this attribute, when attached to a subroutine whose primary purpose is not to allocate memory, is defined by the programmer.</italic>" In section "3.3.8.2 Concrete Instances", before the paragraph starting with "An inlined subroutine entry may have a DW_AT_const_expr attribute", insert this paragraph: "An inlined subroutine entry may also have a DW_AT_heapallocsite attribute referencing a debugging information entry for a type, which is interpreted in the same way as for a DW_TAG_call_site." In Appendix A ("Attributes by Tag Value (Informative)"), Table A.1 ("Attributes by tag value"), add applicable attribute DW_AT_heapallocsite to entries: - DW_TAG_call_site - DW_TAG_inlined_subroutine ## References clang C/C++ attribute documentation for the CodeView version of this: https://clang.llvm.org/docs/AttributeReference.html#allocator my proposed LLVM implementation of DWARF heapallocsite information: https://github.com/llvm/llvm-project/pull/132073 Microsoft documentation for their memory profiler built on S_HEAPALLOCSITE debuginfo: https://learn.microsoft.com/en-us/visualstudio/profiling/memory-usage?view=vs-2022 -- Dwarf-discuss mailing list Dwarf-discuss@lists.dwarfstd.org https://lists.dwarfstd.org/mailman/listinfo/dwarf-discuss