Richard Biener <rguent...@suse.de> 于2024年6月6日周四 14:20写道: > > On Thu, 6 Jun 2024, YunQiang Su wrote: > > > Richard Biener <rguent...@suse.de> 于2024年5月28日周二 17:47写道: > > > > > > The following avoids accounting single-lane SLP to the discovery > > > limit. As the two testcases show this makes discovery fail, > > > unfortunately even not the same across targets. The following > > > should fix two FAILs for GCN as a side-effect. > > > > > > Bootstrapped and tested on x86_64-unknown-linux-gnu, pushed. > > > > > > PR tree-optimization/115254 > > > * tree-vect-slp.cc (vect_build_slp_tree): Only account > > > multi-lane SLP to limit. > > > > > > * gcc.dg/vect/slp-cond-2-big-array.c: Expect 4 times SLP. > > > * gcc.dg/vect/slp-cond-2.c: Likewise. > > > > With this patch, MIPS/MSA still has only 3 times SLP. > > I am digging the problem > > I bet it's an issue with missed permutes. f3() requires interleaving > of two VnQImode vectors. >
Thanks. This problem disappears when I try to implement vcond_mask.