On Wed, Aug 5, 2020 at 3:30 PM Andrew Stubbs wrote:
>
> This patch improves SLP performance in combination with some patches I
> have in development to add multiple vector sizes to amdgcn.
>
> The problem is that amdgcn's preferred vector size has 64 lanes, and SLP
> does not support lane masking.
This patch improves SLP performance in combination with some patches I
have in development to add multiple vector sizes to amdgcn.
The problem is that amdgcn's preferred vector size has 64 lanes, and SLP
does not support lane masking. My patches will add smaller vector sizes
(32, 16, 8, 4, 2)