On 16/08/11 10:28, Ira Rosen wrote:
> Hi,
> 
> This patch changes the default vector size for auto-vectorization on
> ARM NEON to 128 bits. This new version is a result of a discussion
> with Richard and Ramana.
> 
> wwwdocs changes will follow shortly.
> 
> Bootstrapped and tested on arm-linux-gnueabi. The testsuite changes
> were also checked on powerpc64-suse-linux and x86_64-suse-linux.
> 
> There is one new failure:
> gcc.c-torture/execute/mode-dependent-address.c fails with -O3
> -funroll-loops with this patch or with -mvectorize-with-neon-quad.
> Ramana has a patch to fix this
> http://gcc.gnu.org/ml/gcc/2011-08/msg00284.html. I will wait with
> committing my patch until this issue is resolved.
> 
> OK for mainline?
> 
> Thanks,
> Ira
> 
> ChangeLog:
> 
>    * config/arm/arm.c (arm_preferred_simd_mode): Check
>    TARGET_NEON_VECTORIZE_DOUBLE instead of
>    TARGET_NEON_VECTORIZE_QUAD.
>    (arm_expand_sync): Likewise.
>    * config/arm/arm.opt (mvectorize-with-neon-quad): Make inverse
>    mask of mvectorize-with-neon-double.  Add RejectNegative.
>    (mvectorize-with-neon-double): New.
> 
> testsuite/ChangeLog:
> 
>    * lib/target-supports.exp (check_effective_target_vect_multiple_sizes):
>    New procedure.
>    (add_options_for_quad_vectors): Replace with ...
>    (add_options_for_double_vectors): ... this.
>    * gfortran.dg/vect/pr19049.f90: Expect more printings on targets that
>     support multiple vector sizes since the vectorizer attempts to
>     vectorize with both vector sizes.
>    * gcc.dg/vect/no-vfa-vect-79.c,
>     gcc.dg/vect/no-vfa-vect-102a.c, gcc.dg/vect/vect-outer-1a.c,
>     gcc.dg/vect/vect-outer-1b.c, gcc.dg/vect/vect-outer-2b.c,
>     gcc.dg/vect/vect-outer-3a.c, gcc.dg/vect/no-vfa-vect-37.c,
>     gcc.dg/vect/vect-outer-3b.c, gcc.dg/vect/no-vfa-vect-101.c,
>     gcc.dg/vect/no-vfa-vect-102.c, gcc.dg/vect/vect-reduc-dot-s8b.c,
>     gcc.dg/vect/vect-outer-1.c, gcc.dg/vect/vect-104.c: Likewise.
>    * gcc.dg/vect/vect-42.c: Run with 64 bit vectors if applicable.
>    * gcc.dg/vect/vect-multitypes-6.c, gcc.dg/vect/vect-52.c,
>    gcc.dg/vect/vect-54.c, gcc.dg/vect/vect-46.c, gcc.dg/vect/vect-48.c,
>    gcc.dg/vect/vect-96.c, gcc.dg/vect/vect-multitypes-3.c,
>    gcc.dg/vect/vect-40.c: Likewise.
>   * gcc.dg/vect/vect-outer-5.c: Remove quad-vectors option as
>    redundant.
>   * gcc.dg/vect/vect-109.c, gcc.dg/vect/vect-peel-1.c,
>    gcc.dg/vect/vect-peel-2.c, gcc.dg/vect/slp-25.c,
>    gcc.dg/vect/vect-multitypes-1.c, gcc.dg/vect/slp-3.c,
>    gcc.dg/vect/no-vfa-pr29145.c, gcc.dg/vect/vect-multitypes-4.c:
>    Likewise.
>  * gcc.dg/vect/vect-peel-4.c: Make ia global.
> 

Ok with the following change:

>  static unsigned int
>  arm_autovectorize_vector_sizes (void)
>  {
> -  return TARGET_NEON_VECTORIZE_QUAD ? 16 | 8 : 0;
> +  return TARGET_NEON_VECTORIZE_DOUBLE ? 0 : 16 | 8;
>  }


Please put parentheses round the expression to make the precedence explicit.

R.

Reply via email to