Re: [Patch] [GCN] Handle generic ISA names in libgomp's plugin-gcn.c

Andrew Stubbs Wed, 05 Feb 2025 04:18:35 -0800

On 05/02/2025 11:14, Tobias Burnus wrote:

The number of AMD GPUs is huge - and, unfortunately, every GPU device
is potentially slightly different, requiring different code generation
either in some dusty corner case or for standard code.
As for several GPUs identical code can run (either all or when disabling
some features), AMD introduced with LLVM 19 some gfx*-generic targets.
GCC added support for gfx10-3-generic and gfx11-generic with commit
r15-4550-g1bdeebe69b71bf in October 2024 (undocumented). GCC itselfalways supports all -march= targets, but a assembler supporting the archis required such that at user runtime and when building a multilib, aassembler (and linker) supporting the new features is required. (GCCuses LLVM' assembler (llvm-mc) and linker (lld), i.e. LLVM 19+ isrequired for gfx*-generic.] However, the required runtime code landed inROCm much later; namely, commit 0c18ff22 rocr: Generic ISA targetssupport (Oct 28, 2024) in https://github.com/ROCm/ROCR-Runtime It isbelieved that the next ROCm release contained this feature, which isROCm 6.3, released on Dec 3, 2024. The latest ROCm is 6.3.2 of Jan 28,2025. Still, adding gfx*generic increases the number of requiredmultilibs as it does not seem to be possible to link mixed code ofgeneric and specific GPU code. See https://llvm.org/docs/AMDGPUUsage.html#amdgpu-generic-processor-table for a list ofgfx*generic and supported gfx* devices and some generic restrictions dueto using multilib. While gfx11-generic and gfx10-3-generic include allGPUs of that generation, with no or few restrictions, GFX9 devices arerather different and, hence, gfx9-generic only covers a subset of thedevices. * * * This patch now enables support for gfx10-3-generic,gfx11-generic and (new!) gfx9-generic in libgomp, making it actuallyusable. In libgomp, GCC prints its own diagnostic if there is an ISAmismatch between the actual GPU and the compiled-for GPU. Hence, notonly ROCm but also GCC needs to know which GPUs are compatible - inorder to propose the -foffload-options=-march=gfx... to compile for.That diagnostic now also proposes to try compile for the specificgfx*generic besides compiling for the specific GPU. Reasoning: As thenumber of multilibs is limited, having only a gfx11-generic multilib, itmakes sense to propose -march=gfx11-generic besides, e.g., -march=gfx1103 especially when the gfx1103 multilib is unavailable - andvice versa. In case GCC thinks that the ISA is supported but (a too old)ROCm does not recognize it, the error is now inferior; however, somewording has been added to the generic error message, which might stillhelp. As there are a couple of GPUs, previously unsupported, that aresupported by ROCm with the same gfx*-generic as GPUs we support, itmakes sense to add those GPUs as well - both to handle them in libgomp'sgeneric diagnostic and to support them in general. Therefore, thefollowing GPUs are now supported in addition: gfx902, gfx904, gfx909,gfx1031, gfx1032, gfx1033, gfx1034, gfx1035, gfx1101, gfx1102, gfx1150,gfx1151, gfx1152, and gfx1153. However, the multilib config has not beentouched, hence, those 14 device types and gfx{9,10-3,11}-generic are notsupported by default. Currently, the following 9 GPUs are enabled bydefault:gfx900, gfx906, gfx908, gfx90a, gfx90c, gfx1030, gfx1036,gfx1100, andgfx1103.

I'm not too happy about adding a whole list of specific devices that wehave not tested. So far, whenever I have added a new device there havebeen meta-data oddities and such-like that needed to be tweaked.Admittedly, adding a new device to an existing generation has beeneasier, but still there have been unexpected issues.

Adding the generic architectures does make sense, assuming we can testthem, and seems like a much better way to support these devices, untilsomebody can add properly tested and tuned support for an individual device.

I also don't like adding knowledge of unsupported devices purely forimproving diagnostics. It's fine for the known-unsupported devices, butwait a month or so and there will be new unknown-unsupported devices,and the message degrades again. Worse, the new diagnostic can recommendtrying -march=<name> for devices which the compiler will recognize buthave never been tested, and probably don't have multilibs configured.

A better approach might be to pattern-match "gfx{9,10,11}" in the nameHSA gives you for the physical device and recommend generic-march=gfx{9,10,11}-generic in those cases?

* * *
For distros building with LLVM 19, I could imagine that adding thegfx10-3-generic and gfx11-generic (and possibly gfx9-generic) multilibscould make sense; whether gfx1030, gfx1036, gfx1100, andgfx1103 couldalready bedropped - or only later (once ROCm 3.6 is more widely deployed) is the agoodquestion. [My gut feeling is that a distro should wait until next year,given
that December 2024 is still very recent.]

* * *

Thus back to the attached patch, which does:

* Add gfx9-generic - and enable libgomp support for gfx10-3-generic
* Addgfx902, gfx904, gfx909, gfx1031, gfx1032, gfx1033, gfx1034,gfx1035, gfx1101, gfx1102, gfx1150, gfx1151, gfx1152, and gfx1153. *Update the install + invoke (-march=) documentation for it
The patch has loosely be tested - but I currently do not have a ROCm 6.3
available with a gfx*-generic supported device; hence, I don't know whether
it really works.


Thus, I would be happy if someone with a supported gfx{9,10-3,11}-generic
device - or a newly added non-generic gfx* could test whether it actually
works!


[I am about to get a ROCm 6.3.2 with a gfx906 device, possibly later also
for gfx900 and even later for gfx1100.]

Any comment, remark, suggestion?
OK for mainline, once someone has shown that any gfx*-generic actuallyworks?

I'm happy to add the new gfx9-generic, and improving the diagnostics isalways good, but I'm not convinced about making it look like we supportdevices we've never tested.


(Of course, if someone is able to test them, then that's different.)

Andrew

Re: [Patch] [GCN] Handle generic ISA names in libgomp's plugin-gcn.c

Reply via email to