Re: [PATCH][ARM] Enable auto-vectorization for copysignf

2014-09-11 Thread Jiong Wang
On 11/09/14 14:55, Jiong Wang wrote: On 11/09/14 14:43, Christophe Lyon wrote: Hi Jiong, On 9 September 2014 12:59, Ramana Radhakrishnan wrote: On Mon, Aug 18, 2014 at 11:31 AM, Jiong Wang wrote: this patch enable auto-vectorization for copysignf by using vector bit selection instruction o

Re: [PATCH][ARM] Enable auto-vectorization for copysignf

2014-09-11 Thread Jiong Wang
On 11/09/14 14:43, Christophe Lyon wrote: Hi Jiong, On 9 September 2014 12:59, Ramana Radhakrishnan wrote: On Mon, Aug 18, 2014 at 11:31 AM, Jiong Wang wrote: this patch enable auto-vectorization for copysignf by using vector bit selection instruction on arm32 when neon available. I've no

Re: [PATCH][ARM] Enable auto-vectorization for copysignf

2014-09-11 Thread Christophe Lyon
Hi Jiong, On 9 September 2014 12:59, Ramana Radhakrishnan wrote: > On Mon, Aug 18, 2014 at 11:31 AM, Jiong Wang wrote: >> this patch enable auto-vectorization for copysignf by using vector >> bit selection instruction on arm32 when neon available. >> I've noticed that your new testcase fails (t

Re: [PATCH][ARM] Enable auto-vectorization for copysignf

2014-09-09 Thread Ramana Radhakrishnan
On Mon, Aug 18, 2014 at 11:31 AM, Jiong Wang wrote: > this patch enable auto-vectorization for copysignf by using vector > bit selection instruction on arm32 when neon available. > > for a simple testcase: > > for (i = 0; i < N; i++) > r[i] = __builtin_copysignf (a[i], b[i]); > > > assuming

[PATCH][ARM] Enable auto-vectorization for copysignf

2014-08-18 Thread Jiong Wang
this patch enable auto-vectorization for copysignf by using vector bit selection instruction on arm32 when neon available. for a simple testcase: for (i = 0; i < N; i++) r[i] = __builtin_copysignf (a[i], b[i]); assuming vector factor be 4, the generated instruction sequences is: