Re: [PATCH v3] aarch64: Improve popcount for bytes [PR113042]

2024-08-15 Thread Richard Sandiford
Andrew Pinski writes: > For popcount for bytes, we don't need the reduction addition > after the vector cnt instruction as we are only counting one > byte's popcount. > This changes the popcount extend to cover all ALLI rather than GPI. > > Changes since v1: > * v2 - Use ALLI iterator and combine

[PATCH v3] aarch64: Improve popcount for bytes [PR113042]

2024-08-14 Thread Andrew Pinski
For popcount for bytes, we don't need the reduction addition after the vector cnt instruction as we are only counting one byte's popcount. This changes the popcount extend to cover all ALLI rather than GPI. Changes since v1: * v2 - Use ALLI iterator and combine all into one pattern. Add new