RE: [PATCH v3] aarch64: Recognize vector permute patterns suitable for FMOV [PR100165]

2025-04-25 Thread quic_pzheng
> Richard Sandiford writes: > > I think this would also simplify the evpc detection, since the > > requirement for using AND is the same for big-endian and > > little-endian, namely that index I of the result must either come from > > index I of the nonzero vector or from any element of the zero v

Re: [PATCH v3] aarch64: Recognize vector permute patterns suitable for FMOV [PR100165]

2025-02-21 Thread Richard Sandiford
Richard Sandiford writes: > I think this would also simplify the evpc detection, since the requirement > for using AND is the same for big-endian and little-endian, namely that > index I of the result must either come from index I of the nonzero > vector or from any element of the zero vector. (W

Re: [PATCH v3] aarch64: Recognize vector permute patterns suitable for FMOV [PR100165]

2025-02-21 Thread Richard Sandiford
Pengxuan Zheng writes: > This patch optimizes certain vector permute expansion with the FMOV > instruction > when one of the input vectors is a vector of all zeros and the result of the > vector permute is as if the upper lane of the non-zero input vector is set to > zero and the lower lane remai

[PATCH v3] aarch64: Recognize vector permute patterns suitable for FMOV [PR100165]

2025-02-18 Thread Pengxuan Zheng
This patch optimizes certain vector permute expansion with the FMOV instruction when one of the input vectors is a vector of all zeros and the result of the vector permute is as if the upper lane of the non-zero input vector is set to zero and the lower lane remains unchanged. Note that the patch