Clement LONGEAC <[email protected]> writes:

> Source: clblast
> Version: 1.6.3-1
> Severity: normal
> X-Debbugs-Cc: [email protected]
>
> Dear Maintainer,
>
> *** Reporter, please consider answering these questions, where appropriate ***
>
> I am in an internship at Synchrotron-Soleil , my directors are Frederic-
> Emmanuel PICCA and Emmanuel FARHI. I implemented Rocm and Pocl autopkgtest for
> architecture amd64 and arm64. I implemented autopktests for rocm and pocl 
> using
> Opencl on the package clblaston local. The aim is to have an overview of code
> compatibility with various AMD graphics cards, the codes on all the AMD boards
> available for CI rocm for GPU and Pocl for CPU.
>
> I implemented the POCL and ROCm tests in the control file. After running
> sbuild, it turns out that the tests involving the CPU, i.e. POCL, pass.
> However, the tests for the GPU part had accuracy problems, mainly due to the
> nature of the tests. The reason why these tests don't pass is that the card
> doesn't support, or supports poorly, FP16 calculations on ROCm. These 
> libraries
> support these calculations, but this may not be enough to make them pass in
> terms of precision. We therefore need to increase the tolerance to see if this
> would be sufficient (rocBLAS, MIOpen and ROCm-Lib). This may be because the
> card has a kernel limitation. Apparently these are common problems with FP16 
> on
> ROCm, and intrinsic to the card you have. It's symptomatic if you don't have
> professional cards.
>
> Common problems with FP16 on ROCm, due to accuracy issues. CLBlast tests
> sometimes fail because tolerances are too tight for FP16. ROCm may use 
> implicit
> FP16 to FP32 conversions (for compatibility), which may explain the numerical
> differences. Incomplete hardware support may be the cause. Some cards (such as
> the RX 6400) support FP16, but with limited extensions (e.g. no FP16_MMA on
> RDNA 2 consumer). The card we have would have partial FP16 support. This would
> explain our problem and any other problems we might have in the future.

Dear Clement,

Thank you for your work and this bug report! I would highly recommend
you share this information with the CLBlast upstream [1]. They often
take a while to respond, but usually attend to important matters like
this in due time.

Also, I'd be very happy to integrate your ROCm-based tests into the
CLBlast package – could you provide them, for example as a git merge
request on Salsa [2]?

(I currently don't have access to any ROCm hardware, so it may be a
while until I get around to testing things myself.)


[1] https://github.com/CNugteren/CLBlast

[2] https://salsa.debian.org/gspr/clblast


 Best,
 Gard

Attachment: signature.asc
Description: PGP signature

Reply via email to