date:20150211

Re: [PATCH PR64705]Don't aggressively expand induction variable's base

2015-02-11 Thread Bin.Cheng

On Tue, Feb 10, 2015 at 12:24 AM, Richard Biener
 wrote:
> On February 9, 2015 11:09:49 AM CET, Bin Cheng  wrote:
>
> Did you extract a testcase for it?  Note that the IV step itself may be 
> expanded
> Too much.
>
>   I
>>looked into the regression and thought it was because of passes after
>>IVOPT.
>>The IVOPT dump is at least not worse than the original version.
>
> But different I presume.  So how did you figure it regressed?
The tree level dump is like,

---Without patch
+++ With patch
   :
   _618 = MAX_EXPR ;
   _619 = (integer(kind=8)) _618;
   _629 = (unsigned long) _618;

   ..

   :
   _257 = _619 + 1;
   _255 = (sizetype) _257;
   _261 = _255 * 8;
   _256 = (sizetype) _140;
-  _1174 = (sizetype) _618;
+ _1174 = (sizetype) _619;
   _1175 = _256 + _1174;
   _1176 = _1175 * 8;
   _1177 = _148 + _1176;
   ivtmp.3849_258 = (unsigned long) _1177;
   _1179 = (unsigned int) _104;

Previously, the computation of _1174 can be replaced by _629 in bb8 in
DOM2 pass, while it can't after patching.  This is the only possible
regression that I can see on TREE level.  There is another difference
but not regression on TREE.  Seems real change happens on RTL pre with
different register uses in the input.  I failed to go further or
extract a test case, it's just very volatile.

With all of this, I guess this patch shouldn't be blamed for this.  I
also wonder if the PR should be fixed in this way since the patch
definitely is a corner case.

Thanks,
bin
>
> Thanks,
> Richard.
>
>>Bootstrap and test on x86_64 and AArch64, so is it OK?
>>
>>2015-02-09  Bin Cheng  
>>
>>   PR tree-optimization/64705
>>   * tree-ssa-loop-niter.h (expand_simple_operations): New parameter.
>>   * tree-ssa-loop-niter.c (expand_simple_operations): New parameter.
>>   (tree_simplify_using_condition_1, refine_bounds_using_guard)
>>   (number_of_iterations_exit): Pass new argument to
>>   expand_simple_operations.
>>   * tree-ssa-loop-ivopts.c (extract_single_var_from_expr): New.
>>   (find_bivs, find_givs_in_stmt_scev): Pass new argument to
>>   expand_simple_operations.  Call extract_single_var_from_expr.
>>   (difference_cannot_overflow_p): Pass new argument to
>>   expand_simple_operations.
>>
>>gcc/testsuite/ChangeLog
>>2015-02-09  Bin Cheng  
>>
>>   PR tree-optimization/64705
>>   * gcc.dg/tree-ssa/pr64705.c: New test.
>
>

Re: [Haifa Scheduler] Fix latent bug in macro-fusion/instruction grouping

2015-02-11 Thread James Greenhalgh

On Mon, Feb 09, 2015 at 11:16:56PM +, Jeff Law wrote:
> On 02/06/15 05:24, James Greenhalgh wrote:
> >
> > ---
> > 2015-02-06  James Greenhalgh  
> >
> > * haifa-sched.c (recompute_todo_spec): After applying a
> > replacement and cancelling a dependency, also clear the
> > SCHED_GROUP_P flag.
> My worry here would be that we might be clearing a SCHED_GROUP_P that
> had been set for some reason other than macro-fusion.

Yeah, I also had this worry. This patch tackles the problem from the
other direction. If we see a SCHED_GROUP_P on an insn, treat it as a
hard dependency, and don't try to rewrite it. I think this will always
be "safe" but it might pessimize if the dependency breaker would have
resulted in better code generation.

I don't think this gives you anything towards fixing your bug, but
it clears mine.

I've bootstrapped and tested on x86_64-unknown-linux-gnu with no
issues and given it a quick check on the problem code from before,
where it has the desired impact.

Thanks,
James

---
2015-02-10  James Greenhalgh  

* haifa-sched.c (recompute_todo_spec): Treat SCHED_GROUP_P
as forcing a HARD_DEP between instructions, thereby
disallowing rewriting to break dependencies.
diff --git a/gcc/haifa-sched.c b/gcc/haifa-sched.c
index 75d2421..06444de 100644
--- a/gcc/haifa-sched.c
+++ b/gcc/haifa-sched.c
@@ -1233,6 +1233,11 @@ recompute_todo_spec (rtx_insn *next, bool for_backtrack)
   if (!sd_lists_empty_p (next, SD_LIST_HARD_BACK))
 return HARD_DEP;

+  /* If NEXT is intended to sit adjacent to this instruction, we don't
+ want to try to break any dependencies.  Treat it as a HARD_DEP.  */
+  if (SCHED_GROUP_P (next))
+return HARD_DEP;
+
   /* Now we've got NEXT with speculative deps only.
  1. Look at the deps to see what we have to do.
  2. Check if we can do 'todo'.  */

Re: [PATCH] Fix PR64764 && Fix scan-tree-dump for scop-19.c

2015-02-11 Thread Dominique d'Humières


> Le 10 févr. 2015 à 12:42, Tom de Vries  a écrit :
> 
> I think we need to understand first what's going on.

Sure, my patch was mainly to silence the failures on my working tree.

> In both test-cases, on Linux with -fpic the inlining of one function into the 
> other is not done, because we cannot be sure the function call in one 
> function binds to the other function at runtime.

I don’t understand why -fpic should change the inlining decision (except 
register pressure and so on).

> So, is the inlining happening with -fpic for Darwin? If so, is that correct?

The inlining is happening for darwin. I don’t know if that is correct or not. 
The test has been introduced at r143947 (Thu Feb 5 00:29:48 2009) and the only 
failures I have seen are due to r 220532 (checked back to r154691).

Dominique

> Thanks,
> - Tom

Re: [PATCH] Fix x86 #pragma GCC target and target attribute handling (PR target/61925)

2015-02-11 Thread Uros Bizjak

On Wed, Jan 28, 2015 at 9:05 PM, Jakub Jelinek  wrote:
> Hi!
>
> This patch rewrites the target pragma and target attribute handling in the
> i386 backend, so that outside of functions global_options and target globals
> reflect the currently active #pragma GCC target (if none active, obviously
> the default options) and inside of functions (in between set_cfun to that
> function and set_cfun to another function or NULL) the target state of the
> function.
>
> Without this patch, that state often was wherever last set_cfun left it
> (consider e.g. function with target attribute inside of #pragma GCC target
> region), and furthermore #pragma GCC target without any function definition
> before #pragma GCC pop_options confused it altogether, plus there has been
> confusion between when should be the default options active and when should
> be the currently active #pragma GCC target activated.
>
> The last hunk in i386.c also attempts to save some compile time cycles, I
> think it is pointless to add currently active #pragma GCC target as
> attributes to all the target builtins that are activated in the call,
> we certainly don't do that e.g. for target attribute.
>
> Bootstrapped/regtested on x86_64-linux and i686-linux, ok for trunk?
>
> 2015-01-28  Jakub Jelinek  
>
> PR target/61925
> * config/i386/i386.c (ix86_reset_to_default_globals): Removed.
> (ix86_reset_previous_fndecl): Restore it here, unconditionally.
> (ix86_set_current_function): Rewritten.
> (ix86_add_new_builtins): Temporarily clear current_target_pragma
> when creating builtin fndecls.
>
> * gcc.target/i386/pr61925-1.c: New test.
> * gcc.target/i386/pr61925-2.c: New test.
> * gcc.target/i386/pr61925-3.c: New test.

LGTM as far as I can review this, so if there are no objections from
Honza, the patch is OK.

Thanks,
Uros.

Re: [PATCH/AARCH64] Fix 64893: ICE with vget_lane_u32 with C++ front-end at -O0

2015-02-11 Thread Marcus Shawcroft

On 8 February 2015 at 02:24, Andrew Pinski  wrote:

> Here is the updated patch with Jakub's comments included and added a
> testcase for the 0, 0 case.
>
> Thanks,
> Andrew Pinski
> ChangeLog:
>
> PR target/64893
> * config/aarch64/aarch64-builtins.c (aarch64_init_simd_builtins):
> Change the first argument type to size_type_node and add another
> size_type_node.
> (aarch64_simd_expand_builtin): Handle the new argument to
> AARCH64_SIMD_BUILTIN_LANE_CHECK and don't ICE but rather
> print an out when the first two arguments are not
> nonzero integer constants.
> * config/aarch64/arm_neon.h (__AARCH64_LANE_CHECK):
> Pass the sizeof directly to __builtin_aarch64_im_lane_boundsi.
>
> testsuite/ChangeLog:
>
> * c-c++-common/torture/aarch64-vect-lane-1.c: New testcase.
> * c-c++-common/torture/aarch64-vect-lane-2.c: New testcase.

OK, thanks /Marcus

Re: [Patch Testsuite] XFAIL gfortran.dg/pr45636.f90 on AArch64

2015-02-11 Thread Marcus Shawcroft

On 10 February 2015 at 16:06, James Greenhalgh  wrote:
>
> Hi,
>
> As is already done for mips and hppa, we should XFAIL this test on
> AArch64 as we don't currently use the store_by_pieces infrastructure.
>
> We may in future want to tweak this, but for GCC 5.0 the safe thing
> to do is just to XFAIL the test.
>
> Tested with aarch64-none-elf.
>
> OK?

Ok, with Mike's comment addressed...  /Marcus

> Thanks,
> James
>
> ---
> 2015-02-10  James Greenhalgh  
>
> * gfortran.dg/pr45636.f90: XFAIL for aarch64* targets.

Re: [RFA][PATCH][PR rtl-optimization/47477] Type narrowing in match.pd

2015-02-11 Thread Richard Biener

On Tue, Feb 10, 2015 at 9:55 PM, Jeff Law  wrote:
>
> This PR was originally minor issue where we regressed on this kind of
> sequence:
>
> typedef struct toto_s *toto_t;
> toto_t add (toto_t a, toto_t b) {
>   int64_t tmp = (int64_t)(intptr_t)a + ((int64_t)(intptr_t)b&~1L);
>   return (toto_t)(intptr_t) tmp;
> }
>
>
> There was talk of trying to peephole this in the x86 backend.  But later
> Jakub speculated that if we had good type narrowing this could be done in
> the tree optimizers...
>
> S, here we go.  I didn't do anything with logicals are those are already
> handled elsewhere in match.pd.  I didn't try to handle MULT as in the early
> experiments I did, it was a lose because of the existing mechanisms for
> widening multiplications.
>
> Interestingly enough, this patch seems to help out libjava more than
> anything else in a GCC build and it really only helps a few routines. There
> weren't any routines I could see where the code regressed after this patch.
> This is probably an indicator that these things aren't *that* common, or the
> existing shortening code better than we thought, or some important
> shortening case is missing.
>
>
> I think we should pull the other tests from 47477 which are not regressions
> out into their own bug for future work.  Or alternately, when this fix is
> checked in remove the regression marker in 47477.
>
>
> Bootstrapped and regression tested on x86_64-unknown-linux-gnu.  OK for the
> trunk?
>
>
>
>
>
>
>
>
> diff --git a/gcc/ChangeLog b/gcc/ChangeLog
> index 7f3816c..7a95029 100644
> --- a/gcc/ChangeLog
> +++ b/gcc/ChangeLog
> @@ -1,3 +1,8 @@
> +2015-02-10  Jeff Law  
> +
> +   * match.pd (convert (plus/minus (convert @0) (convert @1): New
> +   simplifier to narrow arithmetic.
> +

Please reference PR47477 from here

>  2015-02-10  Richard Biener  
>
> PR tree-optimization/64909
> diff --git a/gcc/match.pd b/gcc/match.pd
> index 81c4ee6..abc703e 100644
> --- a/gcc/match.pd
> +++ b/gcc/match.pd
> @@ -1018,3 +1018,21 @@ along with GCC; see the file COPYING3.  If not see
> (logs (pows @0 @1))
> (mult @1 (logs @0)
>
> +/* If we have a narrowing conversion of an arithmetic operation where
> +   both operands are widening conversions from the same type as the outer
> +   narrowing conversion.  Then convert the innermost operands to a suitable
> +   unsigned type (to avoid introducing undefined behaviour), perform the
> +   operation and convert the result to the desired type.
> +
> +   This narrows the arithmetic operation.  */

This is also a part of what shorten_binary_op does, so please refer to
that in the comment so we can eventually complete this from there
and remove shorten_binary_op.

> +(for op (plus minus)
> +  (simplify
> +(convert (op (convert@2 @0) (convert @1)))
> +(if (TREE_TYPE (@0) == TREE_TYPE (@1)
> + && TREE_TYPE (@0) == type

Otherwhere in match.pd we use

(GIMPLE && types_compatible_p (TREE_TYPE (@0), TREE_TYPE (@1)))
 || (GENERIC && TREE_TYPE (@0) == TREE_TYPE (@1))

for type equality checks.  And I think even for GENERIC you want
to ignore qualifiers and thus use TYPE_MAIN_VARIANT (TREE_TYPE (@0))
== TYPE_MAIN_VARAINT (TREE_TYPE (@1)).

> + && INTEGRAL_TYPE_P (type)

So instead of testing TREE_TYPE (@0) == type we probably want
to just assert that TYPE_PRECISON (TREE_TYPE (@0)) == TYPE_PRECISION
(type) to allow sign-changes.  Say for

short i, j;
(unsigned short) (i + j)

we still want to narrow the widened i and j.

> + && TYPE_PRECISION (TREE_TYPE (@2)) > TYPE_PRECISION (TREE_TYPE
> (@0))
> +/* This prevents infinite recursion.  */
> +&& unsigned_type_for (TREE_TYPE (@0)) != TREE_TYPE (@2))

How can it recurse with the type precision constraint right above?
If you think it's necessary then nesting that condition below as

> +  (with { tree utype = unsigned_type_for (TREE_TYPE (@0)); }
 (if (utype != TREE_TYPE (@2))

avoids evaluating unsigned_type_for twice.

Note that technically we don't need to perform the operation in an unsigned
type iff TYPE_OVERFLOW_WRAPS (TREE_TYPE (@0)).  Thus

   (if (TYPE_OVERFLOW_WRAPS (TREE_TYPE (@0))
 (convert (op @0 @1)))
   (with { tree utype = unsigned_type_for (TREE_TYPE (@0)); }
 (convert (op (convert:utype @0) (convert:utype @1))

You can see that GCC exploits that with -fwrapv.  Maybe this
simplification should be split out into a separate pattern though,
tacking sign-changes for binary ops only.

Thanks,
Richard.

> +(convert (op (convert:utype @0) (convert:utype @1)))
> diff --git a/gcc/testsuite/ChangeLog b/gcc/testsuite/ChangeLog
> index 15d5e2d..76e5254 100644
> --- a/gcc/testsuite/ChangeLog
> +++ b/gcc/testsuite/ChangeLog
> @@ -1,3 +1,8 @@
> +2015-02-10  Jeff Law  
> +
> +   PR rtl-optimization/47477
> +   * gcc.dg/tree-ssa/narrow-arith-1.c: New test.
> +
>  2015-02-10  Richard Biener  
>
> PR tree-optimization/64909
> diff --git a

Re: [PATCH][AArch64] Fix illegal assembly 'eon v1, v2, v3'

2015-02-11 Thread Alan Lawrence

Here is a revised patch using '#'. Bootstrap + check-gcc natively on 
aarch64-none-linux-gnu.


My feeling is still against including the testcase because even when it passes 
it doesn't increase my confidence that the compiler is right very much (i.e. the 
insn could still be reading which_alternative at split time, but 
which_alternative just happened to have a useful value in it - as was happening 
in existing test gcc.target/aarch64/eon_1.c). I've recorded the testcase in 
PR/64997, however.


If the maintainers feel the testcase should be included, then I could prepare 
that as a separate patch?


gcc/ChangeLog:

PR target/64997
* config/aarch64/aarch64.md (*xor_one_cmpl3): Use FP_REGNUM_P
as split condition; force split via '#' in output pattern.

--Alan

James Greenhalgh wrote:

On Wed, Jan 28, 2015 at 02:04:04PM +, James Greenhalgh wrote:

On Wed, Jan 28, 2015 at 12:32:45PM +, Alan Lawrence wrote:

Ok for stage 4?

This is a regression from 4.9, so once we iron out some nits, it should
be.


gcc/ChangeLog:

* config/aarch64/aarch64.md (*xor_one_cmpl3): Use FP_REGNUM_P
as split condition.

And a testcase, please!


diff --git a/gcc/config/aarch64/aarch64.md b/gcc/config/aarch64/aarch64.md
index 
bc49fbe68a978b3ca069c6d084f542773df84bcb..d4b3f7b03ba0ab570cec5ce862e8c5f38f417ed1
 100644
--- a/gcc/config/aarch64/aarch64.md
+++ b/gcc/config/aarch64/aarch64.md
@@ -3054,7 +3054,7 @@
   (match_operand:GPI 2 "register_operand" "r,w"]
   ""
   "eon\\t%0, %1, %2" ;; For GPR registers (only).

This should be:
"@
 eon\\t%0, %1, %2
 #"

which would have forced a split.

Your patch is useful regardless, as I guess we could have ended up
needlessly splitting if we got unlucky with whatever had been left
in which_alternative.


Hi Alan,

Do you have any plans to respin this patch? I'd like to see it fixed
for GCC 5.0 if possible.

Thanks,
James

diff --git a/gcc/config/aarch64/aarch64.md b/gcc/config/aarch64/aarch64.md
index bc49fbe..9f08046 100644
--- a/gcc/config/aarch64/aarch64.md
+++ b/gcc/config/aarch64/aarch64.md
@@ -3053,8 +3053,10 @@
 (not:GPI (xor:GPI (match_operand:GPI 1 "register_operand" "r,?w")
   (match_operand:GPI 2 "register_operand" "r,w"]
   ""
-  "eon\\t%0, %1, %2" ;; For GPR registers (only).
-  "reload_completed && (which_alternative == 1)" ;; For SIMD registers.
+  "@
+  eon\\t%0, %1, %2
+  #"
+  "reload_completed && FP_REGNUM_P (REGNO (operands[0]))" ;; For SIMD registers.
   [(set (match_operand:GPI 0 "register_operand" "=w")
 (xor:GPI (match_operand:GPI 1 "register_operand" "w")
  (match_operand:GPI 2 "register_operand" "w")))

Re: [Patch, fortran] PR64932 [4.9/5 Regression] ICE in gfc_conv_descriptor_data_get for generated finalizer

2015-02-11 Thread Tobias Burnus

Dear Paul, dear all,

first, sorry for the belate review.

Paul Thomas wrote:
> This is a slight development of the patch posted on the PR itself.
>
> class.c(finalize_component) is not able to deal correctly with
> non-allocatable, derived type array components that have allocatable
> components. Rather than generating loops in finalize_component,  the
> condition is detected in trans-stmt.c(gfc_trans_deallocate) and
> gfc_deallocate_alloc_comp is called after obtaining the derived type
> for the array and checking that it is not finalizable.
> 
> Happily, this fix does not generate the error:
> Error: Two or more part references with nonzero rank must not be
> specified at (1)
> which occurs if the code is written explicitly.
> 
> Bootstraps and regtests on FC21/x86_64
> 
> OK for trunk and 4.9?

OK. I think the patch should be okay - even if the code is not really
beautiful. (Fault of our data representation not of the patch itself.)

Cheers,

Tobias

Re: [wwwdocs] Some stuff for porting to

2015-02-11 Thread Marek Polacek

On Tue, Feb 10, 2015 at 09:56:48PM +, Joseph Myers wrote:
> __FUNCTION__ and __func__ aren't macros (essentially they're built-in 
> variables).

Oops, I must have been thinking of __STDC_VERSION__ when writing that.
Fixed, thanks.

Index: porting_to.html
===
RCS file: /cvs/gcc/wwwdocs/htdocs/gcc-5/porting_to.html,v
retrieving revision 1.3
diff -u -r1.3 porting_to.html
--- porting_to.html 10 Feb 2015 11:12:20 -  1.3
+++ porting_to.html 11 Feb 2015 11:03:54 -
@@ -251,6 +251,99 @@
^
 
 
+__STDC_VERSION__ macro
+
+In the C11 mode, the __STDC_VERSION__ standard macro,
+introduced in C95, is now defined to 201112L.  Typically,
+this macro is used as in the following:
+
+
+  #if !defined __STDC_VERSION__ || __STDC_VERSION__ < 199901L
+/* ... */
+  #else
+  # include 
+  #endif
+
+
+You can check the macro using gcc -dM -E -std=gnu11 - < /dev/null 
| grep STDC_VER.
+
+Different meaning of the %a *scanf modifier
+
+The GNU C library supports dynamic allocation via the %a
+modifier.  But in C99, the %a modifier is a synonym for
+%f (float), so the compiler expects an argument of type
+float *.  This in combination with the -Wformat
+warning option may result in additional warnings:
+
+
+  #include 
+
+  int
+  main (void)
+  {
+char *s;
+scanf ("%as", &s);
+  }
+
+
+
+q.c:7:10: warning: format '%a' 
expects argument of type 'float *', but argument 2 has type 'char 
**' [-Wformat=]
+  scanf ("%as", &s);
+ ^
+
+
+The fix is to use the %m modifier instead, specified by
+POSIX.1-2008.
+
+New warnings
+
+Several new warnings have been added to the C front end.  One of the new
+warnings is that GCC now warns about non-standard predefined identifiers with
+the -Wpedantic option.  For instance:
+
+
+  void
+  foo (void)
+  {
+const char *s = __FUNCTION__;
+  }
+
+
+
+q.c:4:19: warning: ISO C does not support 
'__FUNCTION__' predefined identifier [-Wpedantic]
+  const char *s = __FUNCTION__;
+  ^
+
+
+The fix is either to use the standard predefined identifier 
__func__
+(since C99), or to use the __extension__ keyword:
+
+
+  const char *s = __extension__ __FUNCTION__;
+
+
+C++ language issues
+
+Converting std::nullptr_t to bool
+
+Converting std::nullptr_t to bool in the C++11
+mode now requires direct-initialization.  This has been changed in
+http://www.open-std.org/jtc1/sc22/wg21/docs/cwg_defects.html#1423";>DR 
1423.
+As a consequence, the following is invalid:
+
+
+  bool b = nullptr;
+
+
+but the following is valid:
+
+
+  bool b(nullptr);
+
+
+It is recommended to use true, resp. false keywords
+in such cases.
+
 Links
 
 

Marek

Re: [PATCH][libstdc++][Testsuite] isctype test fails for newlib.

2015-02-11 Thread Matthew Wahab


On 10/02/15 08:51, Matthew Wahab wrote:

On 09/02/15 23:17, Hans-Peter Nilsson wrote:

On Mon, 9 Feb 2015, Matthew Wahab wrote:

Attached a patch to replace the test for macro __NEWLIB__ with a test for
macro NEWLINE_IN_CLASS_BLANK, defined by
   { dg-additional-options "-DNEWLINE_IN_CLASS_BLANK" { target newlib } }

Tested with check-target-libstdc++-v3, with the modified tests, for
arm-none-eabi and aarch64-none-linux-gnu.

Does this look ok to commit?


Not valid for approval, but that's what I meant!
Modulo the typo forgetting to
s/__NEWLIB__/NEWLINE_IN_CLASS_BLANK/ in the ifdef in
wchar_t/isctype.cc of course.


Attached the fixed patch.
Tested by running check-target-libstdc++-v3, with the modified tests, 
for arm-none-eabi and aarch64-none-linux-gnu.


Ok to commit?
Matthew

libstdc++-v3/
2015-02-11  Matthew Wahab  

* testsuite/28_regex/traits/char/isctype.cc (test01): Replace test
for __NEWLIB__ macro with a dejagnu set macro.
* testsuite/28_regex/traits/wchar_t/isctype.cc (test01): Likewise.
diff --git a/libstdc++-v3/testsuite/28_regex/traits/char/isctype.cc b/libstdc++-v3/testsuite/28_regex/traits/char/isctype.cc
index 7c47045..b6fd6ff 100644
--- a/libstdc++-v3/testsuite/28_regex/traits/char/isctype.cc
+++ b/libstdc++-v3/testsuite/28_regex/traits/char/isctype.cc
@@ -1,5 +1,6 @@
 // { dg-do run }
 // { dg-options "-std=gnu++11" }
+// { dg-additional-options "-DNEWLINE_IN_CLASS_BLANK" { target newlib } }
 
 //
 // 2010-06-23  Stephen M. Webb 
@@ -53,8 +54,8 @@ test01()
   VERIFY(!t.isctype('_', t.lookup_classname(range(digit;
   VERIFY( t.isctype(' ', t.lookup_classname(range(blank;
   VERIFY( t.isctype('\t', t.lookup_classname(range(blank;
-#if defined (__NEWLIB__)
-  /* newlib includes '\n' in class 'blank'.
+#if defined (NEWLINE_IN_CLASS_BLANK)
+  /* On some targets, '\n' is in class 'blank'.
  See https://gcc.gnu.org/ml/gcc-patches/2015-02/msg00059.html.  */
   VERIFY( t.isctype('\n', t.lookup_classname(range(blank;
 #else
diff --git a/libstdc++-v3/testsuite/28_regex/traits/wchar_t/isctype.cc b/libstdc++-v3/testsuite/28_regex/traits/wchar_t/isctype.cc
index 1b3d69a..144d679 100644
--- a/libstdc++-v3/testsuite/28_regex/traits/wchar_t/isctype.cc
+++ b/libstdc++-v3/testsuite/28_regex/traits/wchar_t/isctype.cc
@@ -1,5 +1,6 @@
 // { dg-do run }
 // { dg-options "-std=gnu++11" }
+// { dg-additional-options "-DNEWLINE_IN_CLASS_BLANK" { target newlib } }
 
 // Copyright (C) 2010-2015 Free Software Foundation, Inc.
 //
@@ -50,8 +51,8 @@ test01()
   VERIFY(!t.isctype(L'_', t.lookup_classname(range(digit;
   VERIFY( t.isctype(L' ', t.lookup_classname(range(blank;
   VERIFY( t.isctype(L'\t', t.lookup_classname(range(blank;
-#if defined (__NEWLIB__)
-  /* newlib includes '\n' in class 'blank'.
+#if defined (NEWLINE_IN_CLASS_BLANK)
+  /* On some targets, '\n' is in class 'blank'.
  See https://gcc.gnu.org/ml/gcc-patches/2015-02/msg00059.html.  */
   VERIFY( t.isctype(L'\n', t.lookup_classname(range(blank;
 #else

Re: [RFC][PR target/39726 P4 regression] match.pd pattern to do type narrowing

2015-02-11 Thread Richard Biener

On Wed, Feb 11, 2015 at 7:43 AM, Jeff Law  wrote:
> On 02/03/15 05:23, Joseph Myers wrote:
>>
>>
>>> +&& TYPE_PRECISION (TREE_TYPE (@0)) == TYPE_PRECISION (TREE_TYPE
>>> (@1))
>>> +&& TYPE_UNSIGNED (TREE_TYPE (@0)) == TYPE_UNSIGNED (TREE_TYPE
>>> (@1))
>>> +&& TYPE_PRECISION (type) > TYPE_PRECISION (TREE_TYPE (@0)))
>>> +  (convert (bit_and (inner_op @0 @1) (convert @3))
>>
>>
>> I still don't think this is safe.  Suppose @0 and @1 are -128 in type
>> int8_t and @3 is 128 in a wider type and the operation is AND.  Then the
>> original expression has result 128.  But if you convert @3 to int8_t you
>> get -128 and this would result in -128 from the simplified expression.
>
> Looking at this again, @3 is being converted to the wrong type, plain and
> simple.  I should never write code while heavily medicated.  I suspect that
> combined with trying to work around the genmatch bug Richi just fixed sent
> me down a totally wrong path.
>
> I think the way to go is to always convert the inner operands to an unsigned
> type.  In fact, everything except the outer convert should be using an
> unsigned type of the same size/precision as @0's type.  The outer convert
> should, of course, be the final type.
>
> That avoids all the concerns about sign bit propagation from the narrow type
> into the wider final type.
>
> Application of this pattern (and the one I posted for 47477) is a concern
> for targets that don't do sub-word arithmetic/logicals.  But I just did a
> sniff test of one such target (v850-elf because it was handy) and I couldn't
> spot a change in the end code using both the 47477 patch and my WIP patch
> for this BZ.

The c-family frontends perform this kind of narrowing already anyway
(via the shorten_* stuff which is misplaced there and should be done
elsewhere for all frontends - thus in match.pd, thanks for starting that).

Richard.

> jeff
>

Re: [PATCH] PR 62173, re-shuffle insns for RTL loop invariant hoisting

2015-02-11 Thread Jiong Wang

2014-12-19 15:21 GMT+00:00 Kenneth Zadeck :
>
> however, since i am a nice person 
>
> loop-invariant solves the DF_UD_CHAIN which builds a data structure that
> connects each use with all of the defs that reach it.   I believe that this
> is the opposite of what you want here.
>
> if you really need this, you need to also turn on the DF_DU_CHAIN which
> builds the opposite structure.Both structures can be space hogs, but
> they are both turned on in other places in the compiler so it is unlikely to
> be an issue.

Exactly, Thanks, Kenneth.

This approach works from my experiment and look much better than
previous REG_NOTE approach.
while it do have one problem. We need the UD/DU chain built before we
do insn re-shuffling.
While after re-shuffling, UD chain needs update, otherwise, the later
"check_dependecies" in find_invariant_insn may fail.

although we have re-shuffle instruction 1 into 2, the later check
still using old UD info, thus
decide instruction 2 is not iv.

1: regA <- vfp + regB
2: regA <- vfp + const

my current fix is to insert those re-shuffled insn into a table named
"vfp_const_iv", then skip those
dependencies check  for them as they don't have any dependencies.

>
>
>
>>
> LOG_LINKs have nothing to do with single use; they point from the
> _first_
> use to its corresponding def.
>
> You might want to look at what fwprop does instead.

 Pass rtl fwprop uses df information in single-definition way, it
 doesn't really take into consideration if register is a single use.
 This often corrupts other optimizations like post-increment and
 load/store pair.  For example:

add r2, r1, r0
ldr rx, [r2]
add r2, r2, #4
 is transformed into below form:
add r2, r1, r0
ldr rx, [r1, r0]
add r2, r2, #4

 As a result, post-increment opportunity is corrupted, also definition
 of r2 can't be deleted because it's not single use.

 Thanks,
 bin
>>>
>>> thanks for all these suggestion.
>>>
>>> Have look at the LOG_LINK build function, a simple reverse scan, while
>>> needs to allocate big map array for all pseudo regs. Haven't looked at
>>> similar code in fwprop,
>>>
>>> actually, when found the first matched insn pattern, I just want to
>>> scan several insns next, then abort quickly if nothing meet
>>> requirement. there is no need to build full single-use information.
>>>
>>> still can anyone confirm that it is safe to re-use REG_DEAD info there
>>> without calling df_note_add_problem and df_analysis first? or I am
>>> using those info passed down from the previous pass which calculated
>>> these info and maybe broken?
>>
>> It's not safe to use REG_DEAD info without re-computing it.
>
> not sure that reg_dead is the right answer even if you did re-compute it.
> I believe you can have two parallel uses (on both sides of an if-then-else)
> for a single def (above the if then else) and have two REG_DEAD notes.
>
>> Richard.
>
>

Re: [PATCH PR64705]Don't aggressively expand induction variable's base

2015-02-11 Thread Richard Biener

On Wed, Feb 11, 2015 at 9:23 AM, Bin.Cheng  wrote:
> On Tue, Feb 10, 2015 at 12:24 AM, Richard Biener
>  wrote:
>> On February 9, 2015 11:09:49 AM CET, Bin Cheng  wrote:
>>
>> Did you extract a testcase for it?  Note that the IV step itself may be 
>> expanded
>> Too much.
>>
>>   I
>>>looked into the regression and thought it was because of passes after
>>>IVOPT.
>>>The IVOPT dump is at least not worse than the original version.
>>
>> But different I presume.  So how did you figure it regressed?
> The tree level dump is like,
>
> ---Without patch
> +++ With patch
>:
>_618 = MAX_EXPR ;
>_619 = (integer(kind=8)) _618;
>_629 = (unsigned long) _618;
>
>..
>
>:
>_257 = _619 + 1;
>_255 = (sizetype) _257;
>_261 = _255 * 8;
>_256 = (sizetype) _140;
> -  _1174 = (sizetype) _618;
> + _1174 = (sizetype) _619;
>_1175 = _256 + _1174;
>_1176 = _1175 * 8;
>_1177 = _148 + _1176;
>ivtmp.3849_258 = (unsigned long) _1177;
>_1179 = (unsigned int) _104;
>
> Previously, the computation of _1174 can be replaced by _629 in bb8 in
> DOM2 pass, while it can't after patching.  This is the only possible
> regression that I can see on TREE level.  There is another difference
> but not regression on TREE.  Seems real change happens on RTL pre with
> different register uses in the input.  I failed to go further or
> extract a test case, it's just very volatile.

Well, I can see what is happening and indeed we shouldn't blame the
patch for this.

> With all of this, I guess this patch shouldn't be blamed for this.  I
> also wonder if the PR should be fixed in this way since the patch
> definitely is a corner case.

It might not fix the real issue (whatever that is), but not making
IVOPTs (or tree-affines) life harder is very good (I believe I have
seen this kind of issue as well).

So I do think that the patch is fine.  Just seen the known-to-work GCC 3.4
version so it's even a regression 

Thanks,
Richard.

> Thanks,
> bin
>>
>> Thanks,
>> Richard.
>>
>>>Bootstrap and test on x86_64 and AArch64, so is it OK?
>>>
>>>2015-02-09  Bin Cheng  
>>>
>>>   PR tree-optimization/64705
>>>   * tree-ssa-loop-niter.h (expand_simple_operations): New parameter.
>>>   * tree-ssa-loop-niter.c (expand_simple_operations): New parameter.
>>>   (tree_simplify_using_condition_1, refine_bounds_using_guard)
>>>   (number_of_iterations_exit): Pass new argument to
>>>   expand_simple_operations.
>>>   * tree-ssa-loop-ivopts.c (extract_single_var_from_expr): New.
>>>   (find_bivs, find_givs_in_stmt_scev): Pass new argument to
>>>   expand_simple_operations.  Call extract_single_var_from_expr.
>>>   (difference_cannot_overflow_p): Pass new argument to
>>>   expand_simple_operations.
>>>
>>>gcc/testsuite/ChangeLog
>>>2015-02-09  Bin Cheng  
>>>
>>>   PR tree-optimization/64705
>>>   * gcc.dg/tree-ssa/pr64705.c: New test.
>>
>>

[PATCH][1/n] Fix PR65015

2015-02-11 Thread Richard Biener


The following drops tempfile names from the DW_AT_producer string
resulting from -fltrans-output-list and -fresolution.

Applied as obvious.

Richard.

2015-02-11  Richard Biener  

PR lto/65015
* dwarf2out.c (gen_producer_string): Drop -fltrans-output-list
and -fresolution.

Index: gcc/dwarf2out.c
===
*** gcc/dwarf2out.c (revision 220578)
--- gcc/dwarf2out.c (working copy)
*** gen_producer_string (void)
*** 19651,19656 
--- 19651,19658 
case OPT_nostdinc:
case OPT_nostdinc__:
case OPT_fpreprocessed:
+   case OPT_fltrans_output_list_:
+   case OPT_fresolution_:
/* Ignore these.  */
continue;
default:

[PATCH][2/n] Drop DW_AT_name for LTO produced unit DIEs

2015-02-11 Thread Richard Biener


Testing in progress.

Ok?

Thanks,
Richard.

2015-02-11  Richard Biener  

PR lto/65015
* dwarf2out.c (dwarf2out_finish): Do not emit DW_AT_name
on the comp unit DIE for LTO produced units.

Index: gcc/dwarf2out.c
===
--- gcc/dwarf2out.c (revision 220613)
+++ gcc/dwarf2out.c (working copy)
@@ -24521,8 +24521,11 @@ dwarf2out_finish (const char *filename)
   gen_remaining_tmpl_value_param_die_attribute ();
 
   /* Add the name for the main input file now.  We delayed this from
- dwarf2out_init to avoid complications with PCH.  */
-  add_name_attribute (comp_unit_die (), remap_debug_filename (filename));
+ dwarf2out_init to avoid complications with PCH.
+ Avoid doing this for LTO produced units as it adds random
+ tempfile names.  */
+  if (!in_lto_p)
+add_name_attribute (comp_unit_die (), remap_debug_filename (filename));
   if (!IS_ABSOLUTE_PATH (filename) || targetm.force_at_comp_dir)
 add_comp_dir_attribute (comp_unit_die ());
   else if (get_AT (comp_unit_die (), DW_AT_comp_dir) == NULL)

Re: [PATCH] Fix ubsan and C++14 constexpr ICEs (PR sanitizer/63956)

2015-02-11 Thread Marek Polacek

On Sun, Jan 25, 2015 at 12:07:46PM -0800, Mike Stump wrote:
> On Dec 1, 2014, at 2:52 AM, Marek Polacek  wrote:
> > On Sun, Nov 30, 2014 at 11:00:12PM -0500, Jason Merrill wrote:
> >> On 11/27/2014 08:57 AM, Marek Polacek wrote:
> >>> -/* { dg-error "is not a constant expression" "" { target c++ } 12 } */
> >>> +/* { dg-error "" "" { xfail { *-*-* } } 11 } */
> >> 
> >> Please keep the expected message.
> > 
> > Done in the below.
> > 
> > 2014-12-01  Marek Polacek  
> > 
> > PR sanitizer/63956
> > * ubsan.c (is_ubsan_builtin_p): Check also built-in class.
> > cp/
> > * constexpr.c: Include ubsan.h.
> > (cxx_eval_call_expression): Bail out for IFN_UBSAN_{NULL,BOUNDS}
> > internal functions and for ubsan builtins.
> > * error.c: Include internal-fn.h.
> > (dump_expr): Add printing of internal functions.
> > testsuite/
> > * c-c++-common/ubsan/shift-5.c: Add fails.
> 
> Do you see:
> 
> PASS: c-c++-common/ubsan/shift-5.c   -O0   (test for errors, line 11)
> XFAIL: c-c++-common/ubsan/shift-5.c   -O0   (test for errors, line 11)
> PASS: c-c++-common/ubsan/shift-5.c   -O0   (test for errors, line 14)
> XFAIL: c-c++-common/ubsan/shift-5.c   -O0   (test for errors, line 14)
> PASS: c-c++-common/ubsan/shift-5.c   -O0   (test for errors, line 17)
> XFAIL: c-c++-common/ubsan/shift-5.c   -O0   (test for errors, line 17)
> PASS: c-c++-common/ubsan/shift-5.c   -O0   (test for errors, line 20)
> XFAIL: c-c++-common/ubsan/shift-5.c   -O0   (test for errors, line 20)
> PASS: c-c++-common/ubsan/shift-5.c   -O0   (test for errors, line 34)
> XFAIL: c-c++-common/ubsan/shift-5.c   -O0   (test for errors, line 34)
> PASS: c-c++-common/ubsan/shift-5.c   -O0   (test for errors, line 37)
> XFAIL: c-c++-common/ubsan/shift-5.c   -O0   (test for errors, line 37)
> 
> on x86_64 on linux?
> 
> If so, this really isn’t cool.  You cannot have the same name as both pass 
> and fail.  At the heart of regression analysis is the notion that no test 
> that passed before now fails.  One can run contrib/compare_tests to see if a 
> patch one is working on has any regressions in it, the beauty of the script 
> is it will tell you in plain language if there are any regressions or not.  
> The standard for gcc is, no regressions.
> 
> Could you find a way to fix this?  Splitting into C and C++ test cases might 
> be one way.  Fixing any expected failures might be another.

Sorry for not replying sooner.

The following patch splits the test into C and C++ test cases, so
hopefully fixing the issue.  Ok for trunk?

2015-02-11  Marek Polacek  

* g++.dg/ubsan/shift-1.C: New test.
* gcc.dg/ubsan/c-shift-2.c: New test.
* c-c++-common/ubsan/shift-5.c: Remove file.

diff --git gcc/testsuite/g++.dg/ubsan/shift-1.C 
gcc/testsuite/g++.dg/ubsan/shift-1.C
index e69de29..8a71279 100644
--- gcc/testsuite/g++.dg/ubsan/shift-1.C
+++ gcc/testsuite/g++.dg/ubsan/shift-1.C
@@ -0,0 +1,37 @@
+/* { dg-do compile } */
+/* { dg-options "-fsanitize=shift -w" } */
+/* { dg-shouldfail "ubsan" } */
+
+int
+foo (int x)
+{
+  /* None of the following should pass.  */
+  switch (x)
+{
+case 1 >> -1:
+/* { dg-error "is not a constant expression" "" { xfail { *-*-* } } 11 } */
+case -1 >> -1:
+/* { dg-error "is not a constant expression" "" { xfail { *-*-* } } 13 } */
+case 1 << -1:
+/* { dg-error "is not a constant expression" "" { xfail { *-*-* } } 15 } */
+case -1 << -1:
+/* { dg-error "is not a constant expression" "" { xfail { *-*-* } } 17 } */
+  return 1;
+}
+  return 0;
+}
+
+int
+bar (int x)
+{
+  /* None of the following should pass.  */
+  switch (x)
+{
+case -1 >> 200:
+/* { dg-error "is not a constant expression" "" { xfail { *-*-* } } 30 } */
+case 1 << 200:
+/* { dg-error "is not a constant expression" "" { xfail { *-*-* } } 32 } */
+  return 1;
+}
+  return 0;
+}
diff --git gcc/testsuite/gcc.dg/ubsan/c-shift-2.c 
gcc/testsuite/gcc.dg/ubsan/c-shift-2.c
index e69de29..beb0dbe 100644
--- gcc/testsuite/gcc.dg/ubsan/c-shift-2.c
+++ gcc/testsuite/gcc.dg/ubsan/c-shift-2.c
@@ -0,0 +1,31 @@
+/* { dg-do compile } */
+/* { dg-options "-fsanitize=shift -w" } */
+/* { dg-shouldfail "ubsan" } */
+
+int
+foo (int x)
+{
+  /* None of the following should pass.  */
+  switch (x)
+{
+case 1 >> -1: /* { dg-error "case label does not reduce to an integer 
constant" } */
+case -1 >> -1: /* { dg-error "case label does not reduce to an integer 
constant" } */
+case 1 << -1: /* { dg-error "case label does not reduce to an integer 
constant" } */
+case -1 << -1: /* { dg-error "case label does not reduce to an integer 
constant" } */
+  return 1;
+}
+  return 0;
+}
+
+int
+bar (int x)
+{
+  /* None of the following should pass.  */
+  switch (x)
+{
+case -1 >> 200: /* { dg-error "case label does not reduce to an integer 
constant" } */
+case 1 << 200: /* { dg-error "case label does not reduce to an integer 
constant" } */
+  return 1;
+}
+

Re: [PATCH] Fix PR ipa/64813

2015-02-11 Thread Martin Liška


On 02/10/2015 05:00 PM, Jakub Jelinek wrote:

On Tue, Feb 10, 2015 at 04:56:40PM +0100, Martin Liška wrote:

On 02/10/2015 01:50 PM, Richard Biener wrote:

On Tue, Feb 10, 2015 at 12:07 PM, Martin Liška  wrote:

Hello.

Following patch is fix for PR ipa/64813. The patch was tested on a darwin
target
by Dominique.

Ready for trunk?


-  if (!(gimple_call_flags (call) & ECF_NORETURN))
+  if (!alias_is_noreturn)

that was technically unnecessary, right?  The call flags properly
return ECF_NORETURN already?

Ok if that is the case.


Hi.

You are right, !(gimple_call_flags (call) & ECF_NORETURN) returns a correct 
value.
Motivation for replacement is not to repeat the same condition, I hope the value
of alias_is_noreturn is correct in all uses.


And gimple_call_flags (call) & ECF_NORETURN doesn't?  I mean, if you could
initialize alias_is_noreturn to that, it would be nicer.

Jakub



Hello.

There are actually 3 places I need to guard if a function non-return.
First place is responsible for result type creation:

  /* Build call to the function being thunked.  */
  if (!VOID_TYPE_P (restype) && !alias_is_noreturn)

Where I can create guard just based on TREE_THIS_VOLATILE of the called 
function.
Once the new gimple call is created I can use:

(gimple_call_flags (call) & ECF_NORETURN

So that's the reason I create one bool value (alias_is_noreturn) and the very
beginning of expand_thunk method. Dost it make sense?

Thanks,
Martin

Re: [patch, libgfortran] Bug 57822 - I/O: "(g0)" wrongly prints "E+0000"

2015-02-11 Thread Jakub Jelinek

On Sun, Feb 08, 2015 at 05:10:44PM -0800, Jerry DeLisle wrote:
> The attached patch fixes this by checking for the case when we are doing g0
> editing and the exponent is 0.
> 
> Regression tested on X86-64.  For the larger kinds, we are on a different
> code path out of necessity, so we need to address this corner case.
> 
> I will commit in a day or two as simple/obvious, with a Changelog for the
> testsuite as well.

The test fails on most of the architectures.
Not all targets support _10 or _16.

Jakub

Re: Re: [PATCH 7/8] Model cache auto-prefetcher in scheduler

2015-02-11 Thread Jiong Wang



On 20/01/15 13:26, Maxim Kuvyrkov wrote:

On Jan 20, 2015, at 1:24 PM, Richard Earnshaw  wrote:
...

In general, how should someone tuning the compiler for this parameter
select a value that isn't one of (-1, m_i_q_d+1)?

 From my experiments it seems there are 4 reasonable values for the parameter: 
(-1) autopref turned off, (0) turned on in rank_for_schedule, (m_i_q_d+1) 
turned on everywhere.  If there is a static constructor generated for tune 
tables and it is a problem to have it -- I can shrink acceptable values to 
these 3 and call it a day.


You only mention 3 values: what was the fourth?

Typo.  No fourth.


It might be better then
to define a set of values that represent each of these cases and only
allow the tuning parameters to select one of those.  The init code then
uses that set to select how to set up the various parameters to meet
those goals.

So something like

ARM_SCHED_AUTOPREF_OFF
ARM_SCHED_AUTOPREF_RANK
ARM_SCHED_AUTOPREF_FULL

A patch is attached.  I bootstrapped it on arm-linux-gnueabihf.  OK to apply?

bootstrap failure on chromebook, reproduced on two chromebook.

see https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65020 please.




--
Maxim Kuvyrkov
www.linaro.org

Re: [patch, libgfortran] Bug 57822 - I/O: "(g0)" wrongly prints "E+0000"

2015-02-11 Thread Jakub Jelinek

On Wed, Feb 11, 2015 at 02:04:09PM +0100, Jakub Jelinek wrote:
> On Sun, Feb 08, 2015 at 05:10:44PM -0800, Jerry DeLisle wrote:
> > The attached patch fixes this by checking for the case when we are doing g0
> > editing and the exponent is 0.
> > 
> > Regression tested on X86-64.  For the larger kinds, we are on a different
> > code path out of necessity, so we need to address this corner case.
> > 
> > I will commit in a day or two as simple/obvious, with a Changelog for the
> > testsuite as well.
> 
> The test fails on most of the architectures.
> Not all targets support _10 or _16.

For _16 you'd need to guard the test with fortran_real_16 (perhaps split
that part into a separate test), for _10 dunno.

Jakub

Re: [PATCH][2/n] Drop DW_AT_name for LTO produced unit DIEs

2015-02-11 Thread Richard Biener

On Wed, 11 Feb 2015, Richard Biener wrote:

> 
> Testing in progress.
> 
> Ok?

So it was said that retaining something like  is better.
Thus the following.

LTO bootstrapped & tested on x86_64-unknown-linux-gnu - ok?

Thanks,
Richard.

2015-02-11  Richard Biener  

PR lto/65015
* dwarf2out.c (dwarf2out_finish): Use  as DW_AT_name
for LTO produced CUs.

Index: gcc/dwarf2out.c
===
--- gcc/dwarf2out.c (revision 220613)
+++ gcc/dwarf2out.c (working copy)
@@ -24521,8 +24521,13 @@ dwarf2out_finish (const char *filename)
   gen_remaining_tmpl_value_param_die_attribute ();
 
   /* Add the name for the main input file now.  We delayed this from
- dwarf2out_init to avoid complications with PCH.  */
-  add_name_attribute (comp_unit_die (), remap_debug_filename (filename));
+ dwarf2out_init to avoid complications with PCH.
+ For LTO produced units use a fixed artificial name to avoid
+ leaking tempfile names into the dwarf.  */
+  if (!in_lto_p)
+add_name_attribute (comp_unit_die (), remap_debug_filename (filename));
+  else
+add_name_attribute (comp_unit_die (), "");
   if (!IS_ABSOLUTE_PATH (filename) || targetm.force_at_comp_dir)
 add_comp_dir_attribute (comp_unit_die ());
   else if (get_AT (comp_unit_die (), DW_AT_comp_dir) == NULL)

Re: [PATCH] Fix PR ipa/64813

2015-02-11 Thread Richard Biener

On Wed, Feb 11, 2015 at 1:46 PM, Martin Liška  wrote:
> On 02/10/2015 05:00 PM, Jakub Jelinek wrote:
>>
>> On Tue, Feb 10, 2015 at 04:56:40PM +0100, Martin Liška wrote:
>>>
>>> On 02/10/2015 01:50 PM, Richard Biener wrote:

 On Tue, Feb 10, 2015 at 12:07 PM, Martin Liška  wrote:
>
> Hello.
>
> Following patch is fix for PR ipa/64813. The patch was tested on a
> darwin
> target
> by Dominique.
>
> Ready for trunk?


 -  if (!(gimple_call_flags (call) & ECF_NORETURN))
 +  if (!alias_is_noreturn)

 that was technically unnecessary, right?  The call flags properly
 return ECF_NORETURN already?

 Ok if that is the case.
>>>
>>>
>>> Hi.
>>>
>>> You are right, !(gimple_call_flags (call) & ECF_NORETURN) returns a
>>> correct value.
>>> Motivation for replacement is not to repeat the same condition, I hope
>>> the value
>>> of alias_is_noreturn is correct in all uses.
>>
>>
>> And gimple_call_flags (call) & ECF_NORETURN doesn't?  I mean, if you could
>> initialize alias_is_noreturn to that, it would be nicer.
>>
>> Jakub
>>
>
> Hello.
>
> There are actually 3 places I need to guard if a function non-return.
> First place is responsible for result type creation:
>
>   /* Build call to the function being thunked.  */
>   if (!VOID_TYPE_P (restype) && !alias_is_noreturn)
>
> Where I can create guard just based on TREE_THIS_VOLATILE of the called
> function.
> Once the new gimple call is created I can use:
>
> (gimple_call_flags (call) & ECF_NORETURN
>
> So that's the reason I create one bool value (alias_is_noreturn) and the
> very
> beginning of expand_thunk method. Dost it make sense?

Yes, and I think your patch is ok.

Thanks,
Richard.

> Thanks,
> Martin

[committed] Fix C #pragma omp atomic parsing (PR c/64824)

2015-02-11 Thread Jakub Jelinek

Hi!

This patch fixes a thinko in POP macro in c_parser_binary_expression that
resulted in not triggering the build2 case if omp_atomic_lhs is non-NULL.

Fixed thusly, bootstrapped/regtested on x86_64-linux and i686-linux,
committed to trunk so far, 4.9 testing in progress, 4.8 will follow at
some point.

2015-02-11  Jakub Jelinek  

PR c/64824
* c-parser.c (c_parser_binary_expression): Fix OpenMP stack[sp].prec
check in the POP macro.

* testsuite/libgomp.c/atomic-18.c: New test.
* testsuite/libgomp.c++/atomic-16.C: New test.

--- gcc/c/c-parser.c.jj 2015-02-04 23:30:34.0 +0100
+++ gcc/c/c-parser.c2015-02-11 12:16:31.274379159 +0100
@@ -6233,8 +6233,8 @@ c_parser_binary_expression (c_parser *pa
 if (__builtin_expect (omp_atomic_lhs != NULL_TREE, 0) && sp == 1 \
&& c_parser_peek_token (parser)->type == CPP_SEMICOLON\
&& ((1 << stack[sp].prec) \
-   & (1 << (PREC_BITOR | PREC_BITXOR | PREC_BITAND | PREC_SHIFT  \
-| PREC_ADD | PREC_MULT)))\
+   & ((1 << PREC_BITOR) | (1 << PREC_BITXOR) | (1 << PREC_BITAND)\
+  | (1 << PREC_SHIFT) | (1 << PREC_ADD) | (1 << PREC_MULT))) \
&& stack[sp].op != TRUNC_MOD_EXPR \
&& stack[0].expr.value != error_mark_node \
&& stack[1].expr.value != error_mark_node \
--- libgomp/testsuite/libgomp.c/atomic-18.c.jj  2015-02-11 12:23:12.704648053 
+0100
+++ libgomp/testsuite/libgomp.c/atomic-18.c 2015-02-11 12:24:11.749658560 
+0100
@@ -0,0 +1,61 @@
+/* PR c/64824 */
+/* { dg-do run } */
+/* { dg-options "-O2 -fopenmp" } */
+
+void
+f1 (void)
+{
+  short a;
+  short b = 1;
+  int c = 3;
+#pragma omp atomic capture
+  a = b = c << b;
+  if (b != 6 || a != 6)
+__builtin_abort ();
+}
+
+void
+f2 (void)
+{
+  short a;
+  short b = 1;
+  int c = 3;
+#pragma omp atomic capture
+  a = b = c + b;
+  if (b != 4 || a != 4)
+__builtin_abort ();
+}
+
+void
+f3 (void)
+{
+  short a;
+  short b = 1;
+  long long int c = 3;
+#pragma omp atomic capture
+  a = b = c + b;
+  if (b != 4 || a != 4)
+__builtin_abort ();
+}
+
+void
+f4 (void)
+{
+  char a;
+  char b = 1;
+  long long int c = 3LL;
+#pragma omp atomic capture
+  a = b = c << b;
+  if (b != 6 || a != 6)
+__builtin_abort ();
+}
+
+int
+main ()
+{
+  f1 ();
+  f2 ();
+  f3 ();
+  f4 ();
+  return 0;
+}
--- libgomp/testsuite/libgomp.c++/atomic-16.C.jj2015-02-11 
12:24:03.337799529 +0100
+++ libgomp/testsuite/libgomp.c++/atomic-16.C   2015-02-11 12:23:58.019888648 
+0100
@@ -0,0 +1,5 @@
+// PR c/64824
+// { dg-do run }
+// { dg-options "-O2 -fopenmp" }
+
+#include "../libgomp.c/atomic-18.c"

Jakub

Re: [testsuite] PATCH: Add check_effective_target_pie

2015-02-11 Thread H.J. Lu

On Tue, Feb 10, 2015 at 3:11 PM, Rainer Orth
 wrote:
> Jeff Law  writes:
>
>>> Subject: [PATCH 1/5] Add check_effective_target_pie
>>>
>>> Hi,
>>>
>>> This patch adds check_effective_target_pie to check if the current
>>> multilib generates PIE by default.
>>>
>>> Thanks.
>>>
>>> H.J.
>>> ---
>>> 2015-01-11  H.J. Lu  
>>>
>>>  * gcc.target/i386/pie.c: New test.
>>>
>>>  * lib/target-supports.exp (check_effective_target_pie): New.
>> OK.
>
> The new proc is bogus, unfortunately: there's already an existing
> check_effective_target_pie that checks if a target can support PIE.  The
> new one just overrides the previous one.  On targets supporting PIE
> (like Darwin), but not defaulting to it, the PIE tests suddenly turn out
> UNSUPPORTED.
>
> You should rename the new one to
> e.g. check_effective_target_pie_default, update the single user, and
> document it in sourcebuild.texi.

I checked in this as an obvious fix.

Thanks.


-- 
H.J.
---
Index: ChangeLog
===
--- ChangeLog (revision 220617)
+++ ChangeLog (working copy)
@@ -1,3 +1,7 @@
+2015-02-11  H.J. Lu  
+
+ * doc/sourcebuild.texi (pie_enabled): Document.
+
 2015-02-11  Martin Liska  

  PR ipa/64813
Index: testsuite/lib/target-supports.exp
===
--- testsuite/lib/target-supports.exp (revision 220617)
+++ testsuite/lib/target-supports.exp (working copy)
@@ -1100,8 +1100,8 @@

 # Return 1 if the current multilib generates PIE by default.

-proc check_effective_target_pie { } {
-return [check_no_compiler_messages pie assembly {
+proc check_effective_target_pie_enabled { } {
+return [check_no_compiler_messages pie_enabled assembly {
  #ifndef __PIE__
  #error unsupported
  #endif
Index: testsuite/gcc.dg/tree-ssa/ssa-store-ccp-3.c
===
--- testsuite/gcc.dg/tree-ssa/ssa-store-ccp-3.c (revision 220617)
+++ testsuite/gcc.dg/tree-ssa/ssa-store-ccp-3.c (working copy)
@@ -13,6 +13,6 @@

 /* There should be no reference for nonpic targets to
conststaticvariable as we should have inlined the 0. */
-/* { dg-final { scan-tree-dump-times "conststaticvariable" 0
"optimized" { target { pie || nonpic } } } } */
-/* { dg-final { scan-tree-dump-times "conststaticvariable" 1
"optimized" { target { { ! pie } && { ! nonpic } } } } } */
+/* { dg-final { scan-tree-dump-times "conststaticvariable" 0
"optimized" { target { pie_enabled || nonpic } } } } */
+/* { dg-final { scan-tree-dump-times "conststaticvariable" 1
"optimized" { target { { ! pie_enabled } && { ! nonpic } } } } } */
 /* { dg-final { cleanup-tree-dump "optimized" } } */
Index: testsuite/ChangeLog
===
--- testsuite/ChangeLog (revision 220617)
+++ testsuite/ChangeLog (working copy)
@@ -1,3 +1,11 @@
+2015-02-11  H.J. Lu  
+
+ * lib/target-supports.exp (check_effective_target_pie): Renamed
+ to ...
+ (check_effective_target_pie_enabled): This.
+ * gcc.dg/tree-ssa/ssa-store-ccp-3.c: Replace pie with pie_enabled.
+ * gcc.target/i386/pie.c: Likewise.
+
 2015-02-11  Andrew Pinski  

  PR target/64893
Index: testsuite/gcc.target/i386/pie.c
===
--- testsuite/gcc.target/i386/pie.c (revision 220617)
+++ testsuite/gcc.target/i386/pie.c (working copy)
@@ -1,4 +1,4 @@
-/* { dg-do compile { target pie } } */
+/* { dg-do compile { target pie_enabled } } */
 /* { dg-options "-O2" } */

 int foo (void);
Index: doc/sourcebuild.texi
===
--- doc/sourcebuild.texi (revision 220617)
+++ doc/sourcebuild.texi (working copy)
@@ -1884,6 +1884,9 @@
 @item nonpic
 Target does not generate PIC by default.

+@item pie_enabled
+Target generates PIE by default.
+
 @item pcc_bitfield_type_matters
 Target defines @code{PCC_BITFIELD_TYPE_MATTERS}.

Re: [testsuite] PATCH: Add check_effective_target_pie

2015-02-11 Thread Rainer Orth

"H.J. Lu"  writes:

>> The new proc is bogus, unfortunately: there's already an existing
>> check_effective_target_pie that checks if a target can support PIE.  The
>> new one just overrides the previous one.  On targets supporting PIE
>> (like Darwin), but not defaulting to it, the PIE tests suddenly turn out
>> UNSUPPORTED.
>>
>> You should rename the new one to
>> e.g. check_effective_target_pie_default, update the single user, and
>> document it in sourcebuild.texi.
>
> I checked in this as an obvious fix.

I think pie_enabled is not a very descriptive name:

Index: doc/sourcebuild.texi
===
--- doc/sourcebuild.texi (revision 220617)
+++ doc/sourcebuild.texi (working copy)
@@ -1884,6 +1884,9 @@
 @item nonpic
 Target does not generate PIC by default.

+@item pie_enabled
+Target generates PIE by default.
+
 @item pcc_bitfield_type_matters
 Target defines @code{PCC_BITFIELD_TYPE_MATTERS}.

With -fpie, PIE is also enabled, just not the default without any
options.  Please either go with the pie_default I sugested or wait for
others to weigh in before rushing in another `obvious' fix.

Thanks.
Rainer

-- 
-
Rainer Orth, Center for Biotechnology, Bielefeld University

Re: nvptx offloading patches [4/n]

2015-02-11 Thread Thomas Schwinge

Hi!

On Wed, 28 Jan 2015 18:05:25 +0100, I wrote:
> On Sat, 1 Nov 2014 13:11:29 +0100, Bernd Schmidt  
> wrote:
> > I'm sending this for reference more than anything else - this is the 
> > patch that adds the target support for offloading to the nvptx port. It 
> > depends on the other offloading patches Ilya is currently submitting.
> 
> Committed to trunk in r220209:
> 
> commit 9c08fbb35aa95420268c2762151f22a6a9b90e85
> Author: tschwinge 
> Date:   Wed Jan 28 17:03:44 2015 +
> 
> nvptx mkoffload.

Committed to trunk in r220620:

commit c84c7f1923ab042478d73029475cf7f1aab6a61a
Author: tschwinge 
Date:   Wed Feb 11 14:15:38 2015 +

nvptx mkoffload: __OPENMP_TARGET__ -> __OFFLOAD_TABLE__.

gcc/
* config/nvptx/mkoffload.c (process): Refer to __OFFLOAD_TABLE__
instead of __OPENMP_TARGET__.

git-svn-id: svn+ssh://gcc.gnu.org/svn/gcc/trunk@220620 
138bc75d-0d04-0410-961f-82ee72b054a4
---
 gcc/ChangeLog| 3 +++
 gcc/config/nvptx/mkoffload.c | 4 ++--
 2 files changed, 5 insertions(+), 2 deletions(-)

diff --git gcc/ChangeLog gcc/ChangeLog
index 1479dcb..bc4a050 100644
--- gcc/ChangeLog
+++ gcc/ChangeLog
@@ -1,5 +1,8 @@
 2015-02-11  Thomas Schwinge  
 
+   * config/nvptx/mkoffload.c (process): Refer to __OFFLOAD_TABLE__
+   instead of __OPENMP_TARGET__.
+
* config/nvptx/mkoffload.c: Include "gomp-constants.h".
(process): Use its GOMP_DEVICE_NVIDIA_PTX instead of (wrongly)
hard-coding PTX_ID.
diff --git gcc/config/nvptx/mkoffload.c gcc/config/nvptx/mkoffload.c
index 8f359cf..2287316 100644
--- gcc/config/nvptx/mkoffload.c
+++ gcc/config/nvptx/mkoffload.c
@@ -791,9 +791,9 @@ process (FILE *in, FILE *out)
 
   fprintf (out, "extern void GOMP_offload_register (const void *, int, void 
*);\n");
 
-  fprintf (out, "extern void *__OPENMP_TARGET__[];\n\n");
+  fprintf (out, "extern void *__OFFLOAD_TABLE__[];\n\n");
   fprintf (out, "static __attribute__((constructor)) void init (void)\n{\n");
-  fprintf (out, "  GOMP_offload_register (__OPENMP_TARGET__, %d,\n",
+  fprintf (out, "  GOMP_offload_register (__OFFLOAD_TABLE__, %d,\n",
   GOMP_DEVICE_NVIDIA_PTX);
   fprintf (out, " &target_data);\n");
   fprintf (out, "};\n");


Grüße,
 Thomas


pgp1FDb66hFdS.pgp
Description: PGP signature

Re: nvptx offloading patches [4/n]

2015-02-11 Thread Thomas Schwinge

Hi!

On Wed, 28 Jan 2015 20:59:45 +0300, Ilya Verbin  wrote:
> On 28 Jan 18:05, Thomas Schwinge wrote:
> > +  fprintf (out, "#define PTX_ID 1\n");
> > +  fprintf (out, "static __attribute__((constructor)) void init 
> > (void)\n{\n");
> > +  fprintf (out, "  GOMP_offload_register (__OPENMP_TARGET__, PTX_ID,\n");
> 
> The file include/gomp-constants.h already contains:
> #define GOMP_DEVICE_NVIDIA_PTX  5
> 
> I guess it would be better to include gomp-constants.h into mkoffload and to 
> use
> this define.

Thanks!  You're right indeed -- I mistakenly committed an older version
of this file.  Committed to trunk in r220619:

commit ab0e6fbc36cacaa619dfacec41e17a681a28562a
Author: tschwinge 
Date:   Wed Feb 11 14:15:28 2015 +

nvptx mkoffload: Don't hard-code GOMP_DEVICE_NVIDIA_PTX.

gcc/
* config/nvptx/mkoffload.c: Include "gomp-constants.h".
(process): Use its GOMP_DEVICE_NVIDIA_PTX instead of (wrongly)
hard-coding PTX_ID.

git-svn-id: svn+ssh://gcc.gnu.org/svn/gcc/trunk@220619 
138bc75d-0d04-0410-961f-82ee72b054a4
---
 gcc/ChangeLog| 6 ++
 gcc/config/nvptx/mkoffload.c | 5 +++--
 2 files changed, 9 insertions(+), 2 deletions(-)

diff --git gcc/ChangeLog gcc/ChangeLog
index 2fa7ff2..1479dcb 100644
--- gcc/ChangeLog
+++ gcc/ChangeLog
@@ -1,3 +1,9 @@
+2015-02-11  Thomas Schwinge  
+
+   * config/nvptx/mkoffload.c: Include "gomp-constants.h".
+   (process): Use its GOMP_DEVICE_NVIDIA_PTX instead of (wrongly)
+   hard-coding PTX_ID.
+
 2015-02-11  H.J. Lu  
 
* doc/sourcebuild.texi (pie_enabled): Document.
diff --git gcc/config/nvptx/mkoffload.c gcc/config/nvptx/mkoffload.c
index 38ccdba..8f359cf 100644
--- gcc/config/nvptx/mkoffload.c
+++ gcc/config/nvptx/mkoffload.c
@@ -35,6 +35,7 @@
 #include "obstack.h"
 #include "diagnostic-core.h"
 #include "collect-utils.h"
+#include "gomp-constants.h"
 
 const char tool_name[] = "nvptx mkoffload";
 
@@ -791,9 +792,9 @@ process (FILE *in, FILE *out)
   fprintf (out, "extern void GOMP_offload_register (const void *, int, void 
*);\n");
 
   fprintf (out, "extern void *__OPENMP_TARGET__[];\n\n");
-  fprintf (out, "#define PTX_ID 1\n");
   fprintf (out, "static __attribute__((constructor)) void init (void)\n{\n");
-  fprintf (out, "  GOMP_offload_register (__OPENMP_TARGET__, PTX_ID,\n");
+  fprintf (out, "  GOMP_offload_register (__OPENMP_TARGET__, %d,\n",
+  GOMP_DEVICE_NVIDIA_PTX);
   fprintf (out, " &target_data);\n");
   fprintf (out, "};\n");
 }


Grüße,
 Thomas


pgpaqE_j2EL5W.pgp
Description: PGP signature

Re: nvptx offloading patches [4/n]

2015-02-11 Thread Thomas Schwinge

Hi!

On Wed, 28 Jan 2015 18:05:25 +0100, I wrote:
> On Sat, 1 Nov 2014 13:11:29 +0100, Bernd Schmidt  
> wrote:
> > I'm sending this for reference more than anything else - this is the 
> > patch that adds the target support for offloading to the nvptx port. It 
> > depends on the other offloading patches Ilya is currently submitting.
> 
> Committed to trunk in r220209:
> 
> commit 9c08fbb35aa95420268c2762151f22a6a9b90e85
> Author: tschwinge 
> Date:   Wed Jan 28 17:03:44 2015 +
> 
> nvptx mkoffload.

Committed to trunk in r220621:

commit bfeb7c1c69130bb13265c49126e6159dbd9b8a5d
Author: tschwinge 
Date:   Wed Feb 11 14:15:47 2015 +

nvptx mkoffload: Initialize GCC diagnostic machinery before using it.

gcc/
* config/nvptx/mkoffload.c: Include "diagnostic.h" instead of
"diagnostic-core.h".
(main): Initialize progname, and call diagnostic_initialize.

git-svn-id: svn+ssh://gcc.gnu.org/svn/gcc/trunk@220621 
138bc75d-0d04-0410-961f-82ee72b054a4
---
 gcc/ChangeLog| 4 
 gcc/config/nvptx/mkoffload.c | 5 -
 2 files changed, 8 insertions(+), 1 deletion(-)

diff --git gcc/ChangeLog gcc/ChangeLog
index bc4a050..482b128 100644
--- gcc/ChangeLog
+++ gcc/ChangeLog
@@ -1,5 +1,9 @@
 2015-02-11  Thomas Schwinge  
 
+   * config/nvptx/mkoffload.c: Include "diagnostic.h" instead of
+   "diagnostic-core.h".
+   (main): Initialize progname, and call diagnostic_initialize.
+
* config/nvptx/mkoffload.c (process): Refer to __OFFLOAD_TABLE__
instead of __OPENMP_TARGET__.
 
diff --git gcc/config/nvptx/mkoffload.c gcc/config/nvptx/mkoffload.c
index 2287316..739aee8 100644
--- gcc/config/nvptx/mkoffload.c
+++ gcc/config/nvptx/mkoffload.c
@@ -33,7 +33,7 @@
 #include "intl.h"
 #include 
 #include "obstack.h"
-#include "diagnostic-core.h"
+#include "diagnostic.h"
 #include "collect-utils.h"
 #include "gomp-constants.h"
 
@@ -828,6 +828,9 @@ main (int argc, char **argv)
   FILE *out = stdout;
   const char *outname = 0;
 
+  progname = "mkoffload";
+  diagnostic_initialize (global_dc, 0);
+
   char *collect_gcc = getenv ("COLLECT_GCC");
   if (collect_gcc == NULL)
 fatal_error (input_location, "COLLECT_GCC must be set.");


Grüße,
 Thomas


pgp87KXUxlCcH.pgp
Description: PGP signature

Re: [testsuite] PATCH: Add check_effective_target_pie

2015-02-11 Thread H.J. Lu

On Wed, Feb 11, 2015 at 6:10 AM, Rainer Orth
 wrote:
> "H.J. Lu"  writes:
>
>>> The new proc is bogus, unfortunately: there's already an existing
>>> check_effective_target_pie that checks if a target can support PIE.  The
>>> new one just overrides the previous one.  On targets supporting PIE
>>> (like Darwin), but not defaulting to it, the PIE tests suddenly turn out
>>> UNSUPPORTED.
>>>
>>> You should rename the new one to
>>> e.g. check_effective_target_pie_default, update the single user, and
>>> document it in sourcebuild.texi.
>>
>> I checked in this as an obvious fix.
>
> I think pie_enabled is not a very descriptive name:
>
> Index: doc/sourcebuild.texi
> ===
> --- doc/sourcebuild.texi (revision 220617)
> +++ doc/sourcebuild.texi (working copy)
> @@ -1884,6 +1884,9 @@
>  @item nonpic
>  Target does not generate PIC by default.
>
> +@item pie_enabled
> +Target generates PIE by default.
> +
>  @item pcc_bitfield_type_matters
>  Target defines @code{PCC_BITFIELD_TYPE_MATTERS}.
>
> With -fpie, PIE is also enabled, just not the default without any

I was testing

# make RUNTESTFLAGS="--target_board='unix{-m32\ -fpie,-fpie}'

I don't consider PIE is default.  It is just enabled.

> options.  Please either go with the pie_default I sugested or wait for
> others to weigh in before rushing in another `obvious' fix.
>



-- 
H.J.

Re: [PATCH] PR 62173, re-shuffle insns for RTL loop invariant hoisting

2015-02-11 Thread Kenneth Zadeck


On 02/11/2015 06:20 AM, Jiong Wang wrote:

2014-12-19 15:21 GMT+00:00 Kenneth Zadeck :

however, since i am a nice person 

loop-invariant solves the DF_UD_CHAIN which builds a data structure that
connects each use with all of the defs that reach it.   I believe that this
is the opposite of what you want here.

if you really need this, you need to also turn on the DF_DU_CHAIN which
builds the opposite structure.Both structures can be space hogs, but
they are both turned on in other places in the compiler so it is unlikely to
be an issue.

Exactly, Thanks, Kenneth.

This approach works from my experiment and look much better than
previous REG_NOTE approach.
while it do have one problem. We need the UD/DU chain built before we
do insn re-shuffling.
While after re-shuffling, UD chain needs update, otherwise, the later
"check_dependecies" in find_invariant_insn may fail.

although we have re-shuffle instruction 1 into 2, the later check
still using old UD info, thus
decide instruction 2 is not iv.

1: regA <- vfp + regB
2: regA <- vfp + const

my current fix is to insert those re-shuffled insn into a table named
"vfp_const_iv", then skip those
dependencies check  for them as they don't have any dependencies.
You now are beginning to understand why Mark Wegman and I invented SSA 
form almost 30 years ago  There is no good way to keep the 
information up to data as you are changing the program.To a large 
extent you are on your own.   If what you are suggesting works, then 
this is likely much faster than rebuilding the chains.






LOG_LINKs have nothing to do with single use; they point from the
_first_
use to its corresponding def.

You might want to look at what fwprop does instead.

Pass rtl fwprop uses df information in single-definition way, it
doesn't really take into consideration if register is a single use.
This often corrupts other optimizations like post-increment and
load/store pair.  For example:

add r2, r1, r0
ldr rx, [r2]
add r2, r2, #4
is transformed into below form:
add r2, r1, r0
ldr rx, [r1, r0]
add r2, r2, #4

As a result, post-increment opportunity is corrupted, also definition
of r2 can't be deleted because it's not single use.

Thanks,
bin

thanks for all these suggestion.

Have look at the LOG_LINK build function, a simple reverse scan, while
needs to allocate big map array for all pseudo regs. Haven't looked at
similar code in fwprop,

actually, when found the first matched insn pattern, I just want to
scan several insns next, then abort quickly if nothing meet
requirement. there is no need to build full single-use information.

still can anyone confirm that it is safe to re-use REG_DEAD info there
without calling df_note_add_problem and df_analysis first? or I am
using those info passed down from the previous pass which calculated
these info and maybe broken?

It's not safe to use REG_DEAD info without re-computing it.

not sure that reg_dead is the right answer even if you did re-compute it.
I believe you can have two parallel uses (on both sides of an if-then-else)
for a single def (above the if then else) and have two REG_DEAD notes.


Richard.

Re: [patch, build] Restore bootstrap in building libcc1 on darwin

2015-02-11 Thread Iain Sandoe

Hi Jeff,

On 9 Feb 2015, at 14:47, Jeff Law  wrote:

> On 02/01/15 09:42, Iain Sandoe wrote:
>> 
>> This is a GCC5 bootstrap regression on 32bit Darwin hosts ( I can raise a PR 
>> if that is considered necessary).
> Has this been addressed or is it still pending?

I believe it's still awaiting review - I'm travelling this week, so limited 
access to resources - but will be able to tackle any comments at the weekend.
thanks
Iain

> 
> 
>> 
>> In fact it is not libcc1, but libiberty that is the cause -
>> 
>> On 26 Jan 2015, at 14:13, Rainer Orth wrote:
>> 
>>> FX  writes:
>>> 
> The default BOOT_CFLAGS are: -O2 -g -mdynamic-no-pic
> the libiberty pic build appends: -fno-common (and not even -fPIC) [NB
> -fPIC _won't_ override -mdynamic-no-pic, so that's not a simple way out]
> This means that the PIC library is being built with non-pic relocs.
 
 config/mh-darwin says that -mdynamic-no-pic is there because it “speeds
 compiles by 3-5%”. I don’t think we care about speed when the bootstrap
 fails, so can we remove it altogether?
>>> 
>>> Darwin/i686 still doesn't bootstrap without this patch, I believe.
>>> Shouldn't it be applied to trunk before GCC 5 ships, rather than leaving
>>> that target broken?
>> 
>> The PIC variant of libiberty, has never (since it was added, AFAICT) catered 
>> for the possibility that non-PIC and PIC options might conflict (and/or that 
>> it might not be possible to override non-PIC options simply by appending PIC 
>> ones).
>> 
>> This has gone un-noticed until the libcc1 plugin started linking with the 
>> pic version of libiberty.
>> 
>> Darwin uses -mdynamic-no-pic as a default flag during bootstrap for 32bit 
>> targets, since that gives a stated ~ 3-5% improvement in performance.
>> 
>> It is not possible to override this non-pic option by appending -fPIC (that 
>> simply results in a warning that mdynamic-no-pic takes precedence).
>> 
>> I'd rather not pretend that there's no problem and simply penalise 
>> performance on m32 darwin by removing the option from the bootflags.
>> 
>> So here's a patch that allows a target to declare incompatible non-pic 
>> options (in the same way that we have PIC options already declared as 
>> distinct).
>> 
>> The patch is largely mechanical (each of the targets adjusted to use the 
>> NONPIC flag for the relevant case).
>> 
>> As an aside, is there a portability reason that we don't make this repeated 
>> operation into a make function?
>> 
>> bootstapped on x86_64-linux, x86_64-darwin12, i686-darwin10, powerpc-darwin9,
>> cross-compiled x86-64-darwin12 X i686-darwin10 , native X i686-darwin10 
>> (x86_64-darwin12 build).
>> 
>> all languages including Ada (note that there's a local patch needed to 
>> work-around an Ada bootstrap issue - pr64349)
>> 
>> OK for trunk?
>> Iain
>> 
>> libiberty:
>>  * Makefile.in (NONPICFLAG, NEWCFLAGS, NEWCPPFLAGS): New.
>>  (COMPILE.c): Adjust to use new flags. (all non-pic targets):
>>  Adjust to use NONPIC flag.
>>  * configure.ac (NOPICFLAG): New.
>>  * configure: Regenerate.
>> 
>

Re: [PATCH] PR rtl-optimization/32219: optimizer causes wrong code in pic/hidden/weak symbol checking

2015-02-11 Thread Jack Howarth

Richard,
  Your proposed patch fails to bootstrap here on
x86_64-apple-darwin14 against the system clang compilers...

g++ -c   -g  -DIN_GCC-fno-exceptions -fno-rtti
-fasynchronous-unwind-tables -W -Wall -Wno-narrowing -Wwrite-strings
-Wcast-qual -Wno-format -Wmissing-format-attribute
-Woverloaded-virtual -pedantic -Wno-long-long -Wno-variadic-macros
-Wno-overlength-strings -fno-common  -DHAVE_CONFIG_H -I. -I.
-I../../gcc-5-20150211/gcc -I../../gcc-5-20150211/gcc/.
-I../../gcc-5-20150211/gcc/../include
-I../../gcc-5-20150211/gcc/../libcpp/include -I/sw/include
-I/sw/include  -I../../gcc-5-20150211/gcc/../libdecnumber
-I../../gcc-5-20150211/gcc/../libdecnumber/dpd -I../libdecnumber
-I../../gcc-5-20150211/gcc/../libbacktrace -I/sw/include -I/sw/include
-o varasm.o -MT varasm.o -MMD -MP -MF ./.deps/varasm.TPo
../../gcc-5-20150211/gcc/varasm.c
...
../../gcc-5-20150211/gcc/varasm.c:6899:9: error: use of undeclared
identifier 'resolution_to_local_definition_p'
return resolution_to_local_definition_p (vnode->resolution);
   ^
../../gcc-5-20150211/gcc/varasm.c:6906:9: error: use of undeclared
identifier 'resolution_to_local_definition_p'
return resolution_to_local_definition_p (node->resolution);
   ^
Jack


On Tue, Feb 10, 2015 at 4:19 PM, Richard Henderson  wrote:
> On 02/07/2015 08:45 AM, H.J. Lu wrote:
>> Here is an updated patch.
>
> This is an annoying comment to differentiate 3 different versions, FYI.
>
>> @@ -6826,11 +6817,18 @@ default_binds_local_p_1 (const_tree exp, int shlib)
>>&& (TREE_STATIC (exp) || DECL_EXTERNAL (exp)))
>>  {
>>varpool_node *vnode = varpool_node::get (exp);
>> -  if (vnode && (resolution_local_p (vnode->resolution) || 
>> vnode->in_other_partition))
>> - resolved_locally = true;
>> -  if (vnode
>> -   && resolution_to_local_definition_p (vnode->resolution))
>> - resolved_to_local_def = true;
>> +  /* If not building shared library, common or initialized symbols
>> +  are also resolved locally, regardless they are weak or not if
>> +  weak_dominate is true.  */
>> +  if (vnode)
>> + {
>> +   if ((weak_dominate && !shlib && vnode->definition)
>> +   || vnode->in_other_partition
>> +   || resolution_local_p (vnode->resolution))
>> + resolved_locally = true;
>> +   if (resolution_to_local_definition_p (vnode->resolution))
>> + resolved_to_local_def = true;
>> + }
>>  }
>>else if (TREE_CODE (exp) == FUNCTION_DECL && TREE_PUBLIC (exp))
>>  {
>
> Not the same test for a function_decl?  That's certainly a mistake.
>
> Indeed, I believe we can now unify these two cases by using the base
> symtab_node instead of varpool_node and cgraph_node.
>
> I'm a bit confused about why we have both resolved_to_local_def vs
> resolved_locally.  At least within this function, which only cares about 
> module
> locality rather than object file locality.  Honza?
>
>> @@ -6859,9 +6857,15 @@ default_binds_local_p_1 (const_tree exp, int shlib)
>>else if (! TREE_PUBLIC (exp))
>>  local_p = true;
>>/* A variable is local if the user has said explicitly that it will
>> - be.  */
>> + be and it is resolved or defined locally, not compiling for PIC,
>> + not weak or weak_dominate is false.  */
>>else if ((DECL_VISIBILITY_SPECIFIED (exp)
>>   || resolved_to_local_def)
>> +&& (!weak_dominate
>> +|| resolved_locally
>> +|| !flag_pic
>> +|| !DECL_EXTERNAL (exp)
>> +|| !DECL_WEAK (exp))
>>  && DECL_VISIBILITY (exp) != VISIBILITY_DEFAULT)
>>  local_p = true;
>
> I think this test is conflating several issues, and as such it's very 
> confusing.
>
> As an existing issue, I'm not sure why "specified" visibility is any different
> from unspecified visibility.  As far as I'm aware, the "specified" bit simply
> means that the decl doesn't inherit inherit visibility from the class, or from
> the command-line.  But once we're this far, the visibility actually applied to
> the symbol should be all that matters.
>
> Unfortunately, this test is 11 years old, from r85167.  It came in as part of
> the implementation of the visibility attribute for the class.
>
> I believe the next test should be
>
>   else if (DECL_WEAK (exp) && !resolved_locally)
> local_p = false;
>
> Given the change above, resolved_locally will now be true for (weak

Re: nvptx offloading patches [4/n]

2015-02-11 Thread Thomas Schwinge

Hi Bernd!

On Sat, 1 Nov 2014 13:11:29 +0100, Bernd Schmidt  
wrote:
> I'm sending this for reference more than anything else - this is the 
> patch that adds the target support for offloading to the nvptx port.

(I committed this in r220209.)

> I actually expect this to change a little in the near future; the nvptx 
> mkoffload duplicates some of the logic that we have in nvptx-as and I'm 
> thinking of making some improvements. But I figure it would be good to 
> show the entire picture, as it is as of now.

I'm aware this is in progress, and will replace the code I'm commenting
on below.  Just to make sure that similar issues don't exist in nvptx-as,
too.

> --- /dev/null
> +++ git/gcc/config/nvptx/mkoffload.c

> +static Token *
> +parse_file (Token *tok)
> +{
> +  [...]
> +  else
> +{
> +  /* Something strange.  Ignore it.  */
> +  if (comment)
> + append_stmt (&fns, comment);
> +
> +  while (tok->kind && !tok->end)
> + tok++;
> +}
> +  return tok;
> +}

I'm not sure if silently ignoring "strange" tokens is a good strategy?

If -freorder-blocks-and-partition is active, this results in PTX code
such as:

// BEGIN PREAMBLE
.version3.1
.target sm_30
.address_size 64
// END PREAMBLE

$LCOLDB0:
$LHOTB0:
// BEGIN FUNCTION DECL: vec_mult$_omp_fn$1
.entry vec_mult$_omp_fn$1(.param.u64 %in_ar1);
// BEGIN FUNCTION DEF: vec_mult$_omp_fn$1
.entry vec_mult$_omp_fn$1(.param.u64 %in_ar1)
{
.reg.u64 %ar1;
[...]

Note the global cold/hot labels.  This confuses mkoffload, and it runs
into a busy loop due to what I understand to be a bug in skipping of
"strange" tokens, cited above, which such global labels would fall under.
Here is what might be a fix for this (but I didn't analyze the parsing
code in detail); OK for trunk?

--- gcc/config/nvptx/mkoffload.c
+++ gcc/config/nvptx/mkoffload.c
@@ -755,8 +755,9 @@ parse_file (Token *tok)
   if (comment)
append_stmt (&fns, comment);

-  while (tok->kind && !tok->end)
+  do
tok++;
+  while (tok->kind && !tok->end);
 }
   return tok;
 }

Grüße,
 Thomas

signature.asc
Description: PGP signature

[nvptx] -freorder-blocks-and-partition, -freorder-functions (was: nvptx offloading patches [4/n])

2015-02-11 Thread Thomas Schwinge

Hi!

On Wed, 11 Feb 2015 15:44:26 +0100, I wrote:
> If -freorder-blocks-and-partition is active, this results in PTX code
> such as:
> 
> // BEGIN PREAMBLE
> .version3.1
> .target sm_30
> .address_size 64
> // END PREAMBLE
> 
> $LCOLDB0:
> $LHOTB0:
> // BEGIN FUNCTION DECL: vec_mult$_omp_fn$1
> .entry vec_mult$_omp_fn$1(.param.u64 %in_ar1);
> // BEGIN FUNCTION DEF: vec_mult$_omp_fn$1
> .entry vec_mult$_omp_fn$1(.param.u64 %in_ar1)
> {
> .reg.u64 %ar1;
> [...]
> 
> Note the global cold/hot labels.

Such partitioning might not make a lot of sense for the virtual ISA that
PTX is, but disabling it in nvptx.c:nvptx_option_override does not work.
(Because that is not invoked in the offloading code path?)  I see x86 has
a ix86_option_override_internal (but I don't know how that options
processing works) -- is something like that needed for nvptx, too, and
how to interconnect that with the offloading code path?  Sounds a bit
like what Jakub suggests in ?

Maybe -freorder-functions (of no use?) should also be disabled?

Here is a WIP patch for -freorder-blocks-and-partition (missing handling
of the offloading code path) -- does something like that make sense?

--- gcc/config/nvptx/nvptx.c
+++ gcc/config/nvptx/nvptx.c
@@ -93,6 +93,18 @@ nvptx_option_override (void)
   init_machine_status = nvptx_init_machine_status;
   /* Gives us a predictable order, which we need especially for variables.  */
   flag_toplevel_reorder = 1;
+  /* If enabled, global cold/hot labels will be emitted, which our mkoffload
+ currently doesn't cope with.  Also, it's not clear whether such
+ partitioning actually has any positive effect on the virtual ISA that PTX
+ is.  */
+  if (flag_reorder_blocks_and_partition)
+{
+  inform (input_location,
+ "-freorder-blocks-and-partition not supported on this "
+ "architecture");
+  flag_reorder_blocks_and_partition = 0;
+  flag_reorder_blocks = 1;
+}
   /* Assumes that it will see only hard registers.  */
   flag_var_tracking = 0;
   write_symbols = NO_DEBUG;

Grüße,
 Thomas

signature.asc
Description: PGP signature

[4.9] 3 backports to 4.9 branch from the trunk

2015-02-11 Thread Jakub Jelinek

Hi!

I've bootstrapped/regtested these 3 backports from trunk to 4.9 branch
on x86_64-linux and i686-linux and installed.

Jakub
2015-02-11  Jakub Jelinek  

Backported from mainline
2015-02-04  Jakub Jelinek  

PR c/64824
PR c/64868
* c-omp.c (c_finish_omp_atomic): Use TRUNC_DIV_EXPR
instead of RDIV_EXPR.  Use build_binary_op instead of
build2_loc.

* c-parser.c (c_parser_omp_atomic): Handle RDIV_EXPR.

* parser.c (cp_parser_omp_atomic): Handle RDIV_EXPR.

* testsuite/libgomp.c/pr64824.c: New test.
* testsuite/libgomp.c/pr64868.c: New test.
* testsuite/libgomp.c++/pr64824.C: New test.
* testsuite/libgomp.c++/pr64868.C: New test.

--- gcc/c-family/c-omp.c(revision 220419)
+++ gcc/c-family/c-omp.c(revision 220420)
@@ -206,6 +206,9 @@ c_finish_omp_atomic (location_t loc, enu
   return error_mark_node;
 }
 
+  if (opcode == RDIV_EXPR)
+opcode = TRUNC_DIV_EXPR;
+
   /* ??? Validate that rhs does not overlap lhs.  */
 
   /* Take and save the address of the lhs.  From then on we'll reference it
@@ -240,7 +243,7 @@ c_finish_omp_atomic (location_t loc, enu
  to do this, and then take it apart again.  */
   if (swapped)
 {
-  rhs = build2_loc (loc, opcode, TREE_TYPE (lhs), rhs, lhs);
+  rhs = build_binary_op (loc, opcode, rhs, lhs, 1);
   opcode = NOP_EXPR;
 }
   bool save = in_late_binary_op;
--- gcc/c/c-parser.c(revision 220419)
+++ gcc/c/c-parser.c(revision 220420)
@@ -12611,6 +12611,7 @@ restart:
{
case MULT_EXPR:
case TRUNC_DIV_EXPR:
+   case RDIV_EXPR:
case PLUS_EXPR:
case MINUS_EXPR:
case LSHIFT_EXPR:
--- gcc/cp/parser.c (revision 220419)
+++ gcc/cp/parser.c (revision 220420)
@@ -29835,6 +29835,7 @@ restart:
{
case MULT_EXPR:
case TRUNC_DIV_EXPR:
+   case RDIV_EXPR:
case PLUS_EXPR:
case MINUS_EXPR:
case LSHIFT_EXPR:
--- libgomp/testsuite/libgomp.c++/pr64868.C (revision 0)
+++ libgomp/testsuite/libgomp.c++/pr64868.C (revision 220420)
@@ -0,0 +1,5 @@
+// PR c/64868
+// { dg-do run }
+// { dg-options "-O2 -fopenmp" }
+
+#include "../libgomp.c/pr64868.c"
--- libgomp/testsuite/libgomp.c++/pr64824.C (revision 0)
+++ libgomp/testsuite/libgomp.c++/pr64824.C (revision 220420)
@@ -0,0 +1,5 @@
+// PR c/64824
+// { dg-do run }
+// { dg-options "-O2 -fopenmp" }
+
+#include "../libgomp.c/pr64824.c"
--- libgomp/testsuite/libgomp.c/pr64868.c   (revision 0)
+++ libgomp/testsuite/libgomp.c/pr64868.c   (revision 220420)
@@ -0,0 +1,87 @@
+/* PR c/64868 */
+/* { dg-do run } */
+/* { dg-options "-O2 -fopenmp" } */
+
+float f = 2.0f;
+double d = 4.0;
+long double ld = 8.0L;
+
+void
+foo ()
+{
+#pragma omp atomic
+  f = 1.0f / f;
+#pragma omp atomic
+  f = 1 / f;
+#pragma omp atomic
+  f = f / 2.0f;
+#pragma omp atomic
+  f = f / 2;
+#pragma omp atomic
+  f /= 2.0f;
+#pragma omp atomic
+  f /= 2;
+#pragma omp atomic
+  d = 1.0 / d;
+#pragma omp atomic
+  d = 1 / d;
+#pragma omp atomic
+  d = d / 2.0;
+#pragma omp atomic
+  d = d / 2;
+#pragma omp atomic
+  d /= 2.0;
+#pragma omp atomic
+  d /= 2;
+#pragma omp atomic
+  ld = 1.0L / ld;
+#pragma omp atomic
+  ld = 1 / ld;
+#pragma omp atomic
+  ld = ld / 2.0L;
+#pragma omp atomic
+  ld = ld / 2;
+#pragma omp atomic
+  ld /= 2.0L;
+#pragma omp atomic
+  ld /= 2;
+  if (f != 0.125f || d != 0.25 || ld != 0.5L)
+__builtin_abort ();
+}
+
+#ifdef __cplusplus
+template 
+void
+bar ()
+{
+  T v = ::d;
+#pragma omp atomic
+  v *= 16;
+#pragma omp atomic
+  v = 1.0 / v;
+#pragma omp atomic
+  v = N1 / v;
+#pragma omp atomic
+  v = v / 2.0;
+#pragma omp atomic
+  v = v / N2;
+#pragma omp atomic
+  v /= 2.0;
+#pragma omp atomic
+  v /= N2;
+  if (v != 0.25)
+__builtin_abort ();
+}
+#endif
+
+int
+main ()
+{
+  foo ();
+#ifdef __cplusplus
+  bar ();
+  bar ();
+  bar ();
+#endif
+  return 0;
+}
--- libgomp/testsuite/libgomp.c/pr64824.c   (revision 0)
+++ libgomp/testsuite/libgomp.c/pr64824.c   (revision 220420)
@@ -0,0 +1,16 @@
+/* PR c/64824 */
+/* { dg-do run } */
+/* { dg-options "-O2 -fopenmp" } */
+
+int
+main ()
+{
+  long long a;
+  long long b = 1LL;
+  int c = 3;
+#pragma omp atomic capture
+  a = b = c << b;
+  if (b != 6LL || a != 6LL)
+__builtin_abort ();
+  return 0;
+}
2015-02-11  Jakub Jelinek  

Backported from mainline
2015-02-09  Jakub Jelinek  

PR target/64979
* tree-stdarg.c (pass_stdarg::execute): Scan phi node args for
va_list escapes.

* gcc.dg/tree-ssa/stdarg-7.c: New test.
* gcc.c-torture/execute/pr64979.c: New test.

--- gcc/tree-stdarg.c   (revision 220542)
+++ gcc/tree-stdarg.c   (revision 220543)
@@ -823,21 +823,22 @@ execute_optimize_stdarg (void)
   /* For va_list_simple_ptr, we have to check PHI node

[PATCH] Fix -fsection-anchors alias handling (PR middle-end/65003)

2015-02-11 Thread Jakub Jelinek

Hi!

This patch fixes a thinko in -fsection-anchors alias handling code
Honza added in r211357.  Usually for aliases we don't create
SYMBOL_REF_HAS_BLOCK_INFO_P, but in the following testcase we do,
because the rtl is created through notice_global_symbol already
when the symbol wasn't an alias, only later optimizations made it into
an alias.  The thinko is that DECL_RTL of the alias target would be
SYMBOL_REF, I think that can't ever happen, usually the DECL_RTL would
be a MEM with SYMBOL_REF operand.

I've bootstrapped/regtested this on
{x86_64,i686,ppc64,ppc64le,aarch64,armv7hl,s390,s390x}-linux without
regressions, so I hope the assertion this way is safe, if not, we'd
need to avoid turning a symbol into an alias somewhere much earlier.

Ok for trunk?

2015-02-11  Jakub Jelinek  

PR middle-end/65003
* varasm.c (place_block_symbol): Assert that DECL_RTL of the
ultimate alias is MEM with SYMBOL_REF satisfying
SYMBOL_REF_HAS_BLOCK_INFO_P as its operand.  Don't pass the MEM
to place_block_symbol, but instead pass the SYMBOL_REF operand of it.

* g++.dg/opt/pr65003.C: New test.

--- gcc/varasm.c.jj 2015-02-04 15:24:20.0 +0100
+++ gcc/varasm.c2015-02-10 15:37:55.872609142 +0100
@@ -7180,6 +7180,10 @@ place_block_symbol (rtx symbol)
{
  rtx target = DECL_RTL (snode->ultimate_alias_target ()->decl);
 
+ gcc_assert (MEM_P (target)
+ && GET_CODE (XEXP (target, 0)) == SYMBOL_REF
+ && SYMBOL_REF_HAS_BLOCK_INFO_P (XEXP (target, 0)));
+ target = XEXP (target, 0);
  place_block_symbol (target);
  SYMBOL_REF_BLOCK_OFFSET (symbol) = SYMBOL_REF_BLOCK_OFFSET (target);
  return;
--- gcc/testsuite/g++.dg/opt/pr65003.C.jj   2015-02-10 15:50:44.240734029 
+0100
+++ gcc/testsuite/g++.dg/opt/pr65003.C  2015-02-10 15:50:39.468814055 +0100
@@ -0,0 +1,31 @@
+// PR middle-end/65003
+// { dg-do compile }
+// { dg-options "-O2" }
+// { dg-additional-options "-fpic" { target fpic } }
+
+struct A
+{
+  void operator= (A &);
+  A ();
+};
+struct B
+{
+  A b;
+};
+struct C
+{
+  virtual bool foo (int &, bool) const;
+};
+struct D : virtual C
+{
+  bool foo (int &, bool) const;
+  B e;
+};
+struct F : D
+{
+  F (int &, const int &, const A &);
+  bool foo (int &, bool) const;
+};
+bool D::foo (int &, bool) const {}
+F::F (int &, const int &, const A &) {}
+bool F::foo (int &, bool) const {}

Jakub

Re: [PATCH] Fix -fsection-anchors alias handling (PR middle-end/65003)

2015-02-11 Thread Richard Biener

On Wed, 11 Feb 2015, Jakub Jelinek wrote:

> Hi!
> 
> This patch fixes a thinko in -fsection-anchors alias handling code
> Honza added in r211357.  Usually for aliases we don't create
> SYMBOL_REF_HAS_BLOCK_INFO_P, but in the following testcase we do,
> because the rtl is created through notice_global_symbol already
> when the symbol wasn't an alias, only later optimizations made it into
> an alias.  The thinko is that DECL_RTL of the alias target would be
> SYMBOL_REF, I think that can't ever happen, usually the DECL_RTL would
> be a MEM with SYMBOL_REF operand.

Yeah, indeed.

> I've bootstrapped/regtested this on
> {x86_64,i686,ppc64,ppc64le,aarch64,armv7hl,s390,s390x}-linux without
> regressions, so I hope the assertion this way is safe, if not, we'd
> need to avoid turning a symbol into an alias somewhere much earlier.
> 
> Ok for trunk?

Ok.

Thanks,
Richard.

> 2015-02-11  Jakub Jelinek  
> 
>   PR middle-end/65003
>   * varasm.c (place_block_symbol): Assert that DECL_RTL of the
>   ultimate alias is MEM with SYMBOL_REF satisfying
>   SYMBOL_REF_HAS_BLOCK_INFO_P as its operand.  Don't pass the MEM
>   to place_block_symbol, but instead pass the SYMBOL_REF operand of it.
> 
>   * g++.dg/opt/pr65003.C: New test.
> 
> --- gcc/varasm.c.jj   2015-02-04 15:24:20.0 +0100
> +++ gcc/varasm.c  2015-02-10 15:37:55.872609142 +0100
> @@ -7180,6 +7180,10 @@ place_block_symbol (rtx symbol)
>   {
> rtx target = DECL_RTL (snode->ultimate_alias_target ()->decl);
>  
> +   gcc_assert (MEM_P (target)
> +   && GET_CODE (XEXP (target, 0)) == SYMBOL_REF
> +   && SYMBOL_REF_HAS_BLOCK_INFO_P (XEXP (target, 0)));
> +   target = XEXP (target, 0);
> place_block_symbol (target);
> SYMBOL_REF_BLOCK_OFFSET (symbol) = SYMBOL_REF_BLOCK_OFFSET (target);
> return;
> --- gcc/testsuite/g++.dg/opt/pr65003.C.jj 2015-02-10 15:50:44.240734029 
> +0100
> +++ gcc/testsuite/g++.dg/opt/pr65003.C2015-02-10 15:50:39.468814055 
> +0100
> @@ -0,0 +1,31 @@
> +// PR middle-end/65003
> +// { dg-do compile }
> +// { dg-options "-O2" }
> +// { dg-additional-options "-fpic" { target fpic } }
> +
> +struct A
> +{
> +  void operator= (A &);
> +  A ();
> +};
> +struct B
> +{
> +  A b;
> +};
> +struct C
> +{
> +  virtual bool foo (int &, bool) const;
> +};
> +struct D : virtual C
> +{
> +  bool foo (int &, bool) const;
> +  B e;
> +};
> +struct F : D
> +{
> +  F (int &, const int &, const A &);
> +  bool foo (int &, bool) const;
> +};
> +bool D::foo (int &, bool) const {}
> +F::F (int &, const int &, const A &) {}
> +bool F::foo (int &, bool) const {}
> 
>   Jakub
> 
> 

-- 
Richard Biener 
SUSE LINUX GmbH, GF: Felix Imendoerffer, Jane Smithard, Jennifer Guild,
Dilip Upmanyu, Graham Norton HRB 21284 (AG Nuernberg)

Re: [testsuite] PATCH: Add check_effective_target_pie

2015-02-11 Thread Rainer Orth

"H.J. Lu"  writes:

> On Wed, Feb 11, 2015 at 6:10 AM, Rainer Orth
>  wrote:
>> "H.J. Lu"  writes:
>>
 The new proc is bogus, unfortunately: there's already an existing
 check_effective_target_pie that checks if a target can support PIE.  The
 new one just overrides the previous one.  On targets supporting PIE
 (like Darwin), but not defaulting to it, the PIE tests suddenly turn out
 UNSUPPORTED.

 You should rename the new one to
 e.g. check_effective_target_pie_default, update the single user, and
 document it in sourcebuild.texi.
>>>
>>> I checked in this as an obvious fix.
>>
>> I think pie_enabled is not a very descriptive name:
>>
>> Index: doc/sourcebuild.texi
>> ===
>> --- doc/sourcebuild.texi (revision 220617)
>> +++ doc/sourcebuild.texi (working copy)
>> @@ -1884,6 +1884,9 @@
>>  @item nonpic
>>  Target does not generate PIC by default.
>>
>> +@item pie_enabled
>> +Target generates PIE by default.
>> +
>>  @item pcc_bitfield_type_matters
>>  Target defines @code{PCC_BITFIELD_TYPE_MATTERS}.
>>
>> With -fpie, PIE is also enabled, just not the default without any
>
> I was testing
>
> # make RUNTESTFLAGS="--target_board='unix{-m32\ -fpie,-fpie}'
>
> I don't consider PIE is default.  It is just enabled.
>
>> options.  Please either go with the pie_default I sugested or wait for
>> others to weigh in before rushing in another `obvious' fix.

Then the description (both sourcebuild.texi and target-supports.texi) is
confusing.

What are you trying to achieve here, actually?  Even on Solaris 11/x86
(which doesn't support PIE), -fpie lets the
check_effective_target_pie_enabled (or whatever it's called) proc pass.
Shouldn't it also check if the target can support PIE at all?

Please clarify your goals before going forward with this.

Rainer

-- 
-
Rainer Orth, Center for Biotechnology, Bielefeld University

Re: nvptx offloading patches [4/n]

2015-02-11 Thread Bernd Schmidt


On 02/11/2015 03:44 PM, Thomas Schwinge wrote:

Note the global cold/hot labels.  This confuses mkoffload, and it runs
into a busy loop due to what I understand to be a bug in skipping of
"strange" tokens, cited above, which such global labels would fall under.
Here is what might be a fix for this (but I didn't analyze the parsing
code in detail); OK for trunk?


I'd rather fail if anything unexpected is seen. Things like 
-freorder-blocks-and-partition should be forced off in 
nvptx_option_override.



Bernd

Re: patch to fix rtl documentation for new floating point comparisons

2015-02-11 Thread Kenneth Zadeck


I vote for quiet.For three reasons:
1) it matches the expectation of what programmers expect. I view 
people who do floating point as falling into two categories:
a) people who wish that there were no such thing as nans. These 
people are happy to write programs with just < > <= >= == != and then be 
unhappily surprised if their code blows up.
b) careful floating pt programmers.  These people never want a 
signal, they properly choose their operators to with knowledge of how 
each of those operations work with respect to nans.

These people will never use anything like a signaling <>
2) it matches iso n1778 which is primarily written to satisfy the needs 
to (b).
3) Whenever you leave something like this undefined, you are basically 
saying "do not optimize"


On 02/10/2015 04:46 PM, Joseph Myers wrote:

On Mon, 9 Feb 2015, Kenneth Zadeck wrote:


I don't think it's useful to have the trapping semantics unspecified for a
comparison operation like that.  So the question is what's most useful for
LTGT and LTGT_EXPR to do (presumably they should do the same thing).  Lots
of existing code in this area seems confused (for example, HONOR_SNANS
should be irrelevant to reversing an equality comparison, as reversing it
will change neither results nor exceptions raised, whether or not
signaling NaNs are involved).  But maybe more code treats LTGT as a
signaling operation than otherwise, in accordance with the original
intent, and so that's the most appropriate way to document it (with !UNEQ
being the representation of the corresponding quiet operation).

section 7.12.14.4 in C99 seems pretty much to say that this is a quiet
operation.

It says islessgreater is quiet.  It says nothing about the LTGT RTL
operation or the LTGT_EXPR GIMPLE/GENERIC operation.
__builtin_islessgreater is implemented using UNEQ_EXPR not LTGT_EXPR.

It may make sense to define LTGT as exactly !UNEQ, and so quiet, but the
choice of definition is a matter of what's convenient for the
implementation (and which choice you make determines which existing code
in GCC should be considered incorrect).

Where back ends implement ltgt patterns, I don't know if they are
consistently quiet or signaling.

Re: [PATCH][libstdc++][Testsuite] isctype test fails for newlib.

2015-02-11 Thread Jonathan Wakely


On 11/02/15 11:14 +, Matthew Wahab wrote:

Attached the fixed patch.
Tested by running check-target-libstdc++-v3, with the modified tests, 
for arm-none-eabi and aarch64-none-linux-gnu.


Ok to commit?


OK, thanks.

Re: [testsuite] PATCH: Add check_effective_target_pie

2015-02-11 Thread H.J. Lu

On Wed, Feb 11, 2015 at 7:19 AM, Rainer Orth
 wrote:
> "H.J. Lu"  writes:
>
>> On Wed, Feb 11, 2015 at 6:10 AM, Rainer Orth
>>  wrote:
>>> "H.J. Lu"  writes:
>>>
> The new proc is bogus, unfortunately: there's already an existing
> check_effective_target_pie that checks if a target can support PIE.  The
> new one just overrides the previous one.  On targets supporting PIE
> (like Darwin), but not defaulting to it, the PIE tests suddenly turn out
> UNSUPPORTED.
>
> You should rename the new one to
> e.g. check_effective_target_pie_default, update the single user, and
> document it in sourcebuild.texi.

 I checked in this as an obvious fix.
>>>
>>> I think pie_enabled is not a very descriptive name:
>>>
>>> Index: doc/sourcebuild.texi
>>> ===
>>> --- doc/sourcebuild.texi (revision 220617)
>>> +++ doc/sourcebuild.texi (working copy)
>>> @@ -1884,6 +1884,9 @@
>>>  @item nonpic
>>>  Target does not generate PIC by default.
>>>
>>> +@item pie_enabled
>>> +Target generates PIE by default.
>>> +
>>>  @item pcc_bitfield_type_matters
>>>  Target defines @code{PCC_BITFIELD_TYPE_MATTERS}.
>>>
>>> With -fpie, PIE is also enabled, just not the default without any
>>
>> I was testing
>>
>> # make RUNTESTFLAGS="--target_board='unix{-m32\ -fpie,-fpie}'
>>
>> I don't consider PIE is default.  It is just enabled.
>>
>>> options.  Please either go with the pie_default I sugested or wait for
>>> others to weigh in before rushing in another `obvious' fix.
>
> Then the description (both sourcebuild.texi and target-supports.texi) is
> confusing.

That is how other options are described.

> What are you trying to achieve here, actually?  Even on Solaris 11/x86
> (which doesn't support PIE), -fpie lets the
> check_effective_target_pie_enabled (or whatever it's called) proc pass.
> Shouldn't it also check if the target can support PIE at all?
>

Assembly outputs may be different, depending on if PIE is
enabled nor not. When we scan assembly outputs for test
results, we have different expected results when PIE is enabled.
That is how pie_enabled is used so far.

-- 
H.J.

Re: [RFC][PR target/39726 P4 regression] match.pd pattern to do type narrowing

2015-02-11 Thread Jeff Law


On 02/11/15 04:16, Richard Biener wrote:


Application of this pattern (and the one I posted for 47477) is a concern
for targets that don't do sub-word arithmetic/logicals.  But I just did a
sniff test of one such target (v850-elf because it was handy) and I couldn't
spot a change in the end code using both the 47477 patch and my WIP patch
for this BZ.


The c-family frontends perform this kind of narrowing already anyway
(via the shorten_* stuff which is misplaced there and should be done
elsewhere for all frontends - thus in match.pd, thanks for starting that).
True, but I wanted to see if there was any impact, but thankfully there 
isn't.


The fact that the C/C++ front-ends are doing most of the shortening now 
probably explains why the fix for 47477 only affected code generation 
for the Java front-end.


Jeff

Re: [RFC][PR target/39726 P4 regression] match.pd pattern to do type narrowing

2015-02-11 Thread Jakub Jelinek

On Wed, Feb 11, 2015 at 08:56:20AM -0700, Jeff Law wrote:
> On 02/11/15 04:16, Richard Biener wrote:
> >>
> >>Application of this pattern (and the one I posted for 47477) is a concern
> >>for targets that don't do sub-word arithmetic/logicals.  But I just did a
> >>sniff test of one such target (v850-elf because it was handy) and I couldn't
> >>spot a change in the end code using both the 47477 patch and my WIP patch
> >>for this BZ.
> >
> >The c-family frontends perform this kind of narrowing already anyway
> >(via the shorten_* stuff which is misplaced there and should be done
> >elsewhere for all frontends - thus in match.pd, thanks for starting that).
> True, but I wanted to see if there was any impact, but thankfully there
> isn't.
> 
> The fact that the C/C++ front-ends are doing most of the shortening now
> probably explains why the fix for 47477 only affected code generation for
> the Java front-end.

The C/C++ FEs are doing it only if it appears on a single statement though,
something fold can see at once.
If you rewrite it so that there are multiple vars holding the intermediate
values, they of course can't do that and it is a task for match.pd or
specialized pass to handle those.

Jakub

Re: [testsuite] PATCH: Add check_effective_target_pie

2015-02-11 Thread Rainer Orth

"H.J. Lu"  writes:

> On Wed, Feb 11, 2015 at 7:19 AM, Rainer Orth
>  wrote:
>> "H.J. Lu"  writes:
>>
>>> On Wed, Feb 11, 2015 at 6:10 AM, Rainer Orth
>>>  wrote:
 "H.J. Lu"  writes:

>> The new proc is bogus, unfortunately: there's already an existing
>> check_effective_target_pie that checks if a target can support PIE.  The
>> new one just overrides the previous one.  On targets supporting PIE
>> (like Darwin), but not defaulting to it, the PIE tests suddenly turn out
>> UNSUPPORTED.
>>
>> You should rename the new one to
>> e.g. check_effective_target_pie_default, update the single user, and
>> document it in sourcebuild.texi.
>
> I checked in this as an obvious fix.

 I think pie_enabled is not a very descriptive name:

 Index: doc/sourcebuild.texi
 ===
 --- doc/sourcebuild.texi (revision 220617)
 +++ doc/sourcebuild.texi (working copy)
 @@ -1884,6 +1884,9 @@
  @item nonpic
  Target does not generate PIC by default.

 +@item pie_enabled
 +Target generates PIE by default.
 +
  @item pcc_bitfield_type_matters
  Target defines @code{PCC_BITFIELD_TYPE_MATTERS}.

 With -fpie, PIE is also enabled, just not the default without any
>>>
>>> I was testing
>>>
>>> # make RUNTESTFLAGS="--target_board='unix{-m32\ -fpie,-fpie}'
>>>
>>> I don't consider PIE is default.  It is just enabled.
>>>
 options.  Please either go with the pie_default I sugested or wait for
 others to weigh in before rushing in another `obvious' fix.
>>
>> Then the description (both sourcebuild.texi and target-supports.texi) is
>> confusing.
>
> That is how other options are described.

You're not getting my point:

* target-supports.exp now has

# Return 1 if the current multilib generates PIE by default.

* sourcebuild.texi states

Target generates PIE by default.

Which one is it?

>> What are you trying to achieve here, actually?  Even on Solaris 11/x86
>> (which doesn't support PIE), -fpie lets the
>> check_effective_target_pie_enabled (or whatever it's called) proc pass.
>> Shouldn't it also check if the target can support PIE at all?
>
> Assembly outputs may be different, depending on if PIE is
> enabled nor not. When we scan assembly outputs for test
> results, we have different expected results when PIE is enabled.
> That is how pie_enabled is used so far.

But why would you need a new effective-target keyword for that?  Simply
test the existing { target pie }, add -fpie and scan the assembler
output.  Why would you need pie_enabled or whatever at all?  Again: what
are you trying to achieve that cannot be done with the current keyword?

Right now, in gcc.target/i386/pie.c, you're only testing the PIE case
when PIE has been enabled by some means external to testsuite.  The test
always comes out UNSUPPORTED if not, so it isn't even exercised in a
regular bootstrap (except on Darwin which defaults to PIE, I believe).

Rainer

-- 
-
Rainer Orth, Center for Biotechnology, Bielefeld University

Re: [Patch, fortran] PR64952 - Missing temporary in assignment from elemental function

2015-02-11 Thread Dominique d'Humières

Dear Paul,

The patch works as advertised! I have two remarks:

(1) AFAIU there is no need for a temporary for

PROGRAM Main 
INTEGER :: i, index(5) = (/ (i, i = 1,5) /) 
REAL :: tmp(5), array(5) = (/ (i+0.0, i = 1,5) /) 
tmp = Fred(index,array) 
array = tmp
PRINT *, array 
CONTAINS 
ELEMENTAL FUNCTION Fred (n, x) 
REAL :: Fred 
INTEGER, INTENT(IN) :: n 
REAL, INTENT(IN) :: x 
! In general, this would be in an external procedure 
Fred = x+SUM(array(:n-1))+SUM(array(n+1:)) 
 END FUNCTION Fred 
END PROGRAM Main 

However I get

[Book15] f90/bug% gfc elemental_weird_db_2.f90 -Warray-temporaries
elemental_weird_db_2.f90:4:10:

 tmp = Fred(index,array) 
  1
Warning: Creating array temporary at (1) [-Warray-temporaries]

(2) You wrote: « However, this works and has no perceivable effect on 
Polyhedron timings. ». This is hardly a surprise since the Polyhedron tests 
don’t use any elemental procedure.

Thanks,

Dominique

> Le 10 févr. 2015 à 23:35, Paul Richard Thomas  
> a écrit :
> 
> Dear Mikael, dear all,
> 
> Thank you for the previous review. I believe that the attached
> responds to all of your comments and correctly compiles the three
> testcases that you provided. Two of these have been included in the
> original testcase and the third appears separately.
> 
> Bootstrapped and reg
> tested on FC21/x86_64 - OK for trunk?
> 
> Cheers
> 
> Paul
> 
> 2015-02-10  Paul Thomas  
> 
>PR fortran/64952
>* gfortran.h : Add 'array_outer_dependency' to symbol_attr.
>* trans.h : Add 'array_outer_dependency' to gfc_ss_info.
>* module.c : Add AB_ARRAY_OUTER_DEPENDENCY to ab_attribute.
>Add same to attr_bits.
>(mio_symbol_attribute): Handle 'array_outer_dependency' attr
>in module read and write.
>* resolve.c (resolve_function): If an elemental function is
>referenced that is marked as having an external array reference
>and the current namespace is that of an elemental function,
>mark the containing function likewise.
>(resolve_variable): Mark elemental function symbol
>as 'array_outer_dependency' if it has an array reference from
>outside its own namespace.
>* trans-array.c (gfc_conv_resolve_dependencies): If any ss is
>marked as 'array_outer_dependency' generate a temporary.
>(gfc_walk_function_expr): If the function is marked as
>'array_outer_dependency', likewise mark the head gfc_ss.
> 
> 2015-02-10  Paul Thomas  
> 
>PR fortran/64952
>* gfortran.dg/elemental_dependency_4.f90: New test
>* gfortran.dg/elemental_dependency_5.f90: New test

Re: [testsuite] PATCH: Add check_effective_target_pie

2015-02-11 Thread H.J. Lu

On Wed, Feb 11, 2015 at 8:15 AM, Rainer Orth
 wrote:
> "H.J. Lu"  writes:
>
>> On Wed, Feb 11, 2015 at 7:19 AM, Rainer Orth
>>  wrote:
>>> "H.J. Lu"  writes:
>>>
 On Wed, Feb 11, 2015 at 6:10 AM, Rainer Orth
  wrote:
> "H.J. Lu"  writes:
>
>>> The new proc is bogus, unfortunately: there's already an existing
>>> check_effective_target_pie that checks if a target can support PIE.  The
>>> new one just overrides the previous one.  On targets supporting PIE
>>> (like Darwin), but not defaulting to it, the PIE tests suddenly turn out
>>> UNSUPPORTED.
>>>
>>> You should rename the new one to
>>> e.g. check_effective_target_pie_default, update the single user, and
>>> document it in sourcebuild.texi.
>>
>> I checked in this as an obvious fix.
>
> I think pie_enabled is not a very descriptive name:
>
> Index: doc/sourcebuild.texi
> ===
> --- doc/sourcebuild.texi (revision 220617)
> +++ doc/sourcebuild.texi (working copy)
> @@ -1884,6 +1884,9 @@
>  @item nonpic
>  Target does not generate PIC by default.
>
> +@item pie_enabled
> +Target generates PIE by default.
> +
>  @item pcc_bitfield_type_matters
>  Target defines @code{PCC_BITFIELD_TYPE_MATTERS}.
>
> With -fpie, PIE is also enabled, just not the default without any

 I was testing

 # make RUNTESTFLAGS="--target_board='unix{-m32\ -fpie,-fpie}'

 I don't consider PIE is default.  It is just enabled.

> options.  Please either go with the pie_default I sugested or wait for
> others to weigh in before rushing in another `obvious' fix.
>>>
>>> Then the description (both sourcebuild.texi and target-supports.texi) is
>>> confusing.
>>
>> That is how other options are described.
>
> You're not getting my point:
>
> * target-supports.exp now has
>
> # Return 1 if the current multilib generates PIE by default.
>
> * sourcebuild.texi states
>
> Target generates PIE by default.
>
> Which one is it?

I copied them from nonpic

# Return 1 if the current multilib does not generate PIC by default.

@item nonpic
Target does not generate PIC by default.

>>> What are you trying to achieve here, actually?  Even on Solaris 11/x86
>>> (which doesn't support PIE), -fpie lets the
>>> check_effective_target_pie_enabled (or whatever it's called) proc pass.
>>> Shouldn't it also check if the target can support PIE at all?
>>
>> Assembly outputs may be different, depending on if PIE is
>> enabled nor not. When we scan assembly outputs for test
>> results, we have different expected results when PIE is enabled.
>> That is how pie_enabled is used so far.
>
> But why would you need a new effective-target keyword for that?  Simply
> test the existing { target pie }, add -fpie and scan the assembler
> output.  Why would you need pie_enabled or whatever at all?  Again: what
> are you trying to achieve that cannot be done with the current keyword?
>
> Right now, in gcc.target/i386/pie.c, you're only testing the PIE case
> when PIE has been enabled by some means external to testsuite.  The test
> always comes out UNSUPPORTED if not, so it isn't even exercised in a
> regular bootstrap (except on Darwin which defaults to PIE, I believe).

That is intentional.  gcc.target/i386/pie.c will be test by

1. RUNTESTFLAGS="--target_board='unix{-m32\ -fpie,-fpie}'" .  Or
2. PIE is enabled by default:

https://gcc.gnu.org/ml/gcc-patches/2015-01/msg00956.html


-- 
H.J.

PING^2: [PATCH]: New configure options that make the compiler use -fPIE and -pie as default option

2015-02-11 Thread H.J. Lu

PING.

On Wed, Jan 28, 2015 at 8:05 AM, H.J. Lu  wrote:
> PING.
>
> On Tue, Jan 13, 2015 at 3:25 PM, H.J. Lu  wrote:
>> On Tue, Jan 13, 2015 at 5:03 AM, H.J. Lu  wrote:
>>> On Mon, Jan 12, 2015 at 11:50:41PM +, Joseph Myers wrote:
 On Mon, 12 Jan 2015, H.J. Lu wrote:

 > +if test x$enable_default_pie = xyes; then
 > +  AC_MSG_CHECKING(if $target supports default PIE)
 > +  enable_default_pie=no
 > +  case $target in
 > +i?86*-*-linux* | x86_64*-*-linux*)
 > +  saved_LDFLAGS="$LDFLAGS"
 > +  saved_CFLAGS="$CFLAGS"
 > +  CFLAGS="$CFLAGS -fPIE"
 > +  LDFLAGS="$LDFLAGS -fPIE -pie"
 > +  AC_TRY_LINK(,,[enable_default_pie=yes],)
 > +  LDFLAGS="$saved_LDFLAGS"
 > +  CFLAGS="$saved_CFLAGS"
 > +  ;;
 > +*)
 > +  ;;
 > +esac

 There should not be any such hardcoding of targets here without concrete
 evidence that the targets for which this sets enable_default_pie=no really
 cannot support PIE.  In particular, there is no reason at all for this to
 be architecture-specific; all GNU/Linux architectures should support PIE.

 I believe AC_TRY_LINK here will test for the host, whereas what you want
 to know is what's supported for the target (but it's not possible to run
 link tests for the target at this point; the compiler for the target
 hasn't even been built).

 So: just presume that if the user passes --enable-default-pie then they
 know what they are doing, and don't try to override their choice.

 > diff --git a/gcc/doc/install.texi b/gcc/doc/install.texi
 > index c9e3bf1..89fc305 100644
 > --- a/gcc/doc/install.texi
 > +++ b/gcc/doc/install.texi
 > @@ -1583,6 +1583,10 @@ not be built.
 >  Specify that the run-time libraries for stack smashing protection
 >  should not be built.
 >
 > +@item --enable-default-pie
 > +Turn on @option{-fPIE} and @option{-pie} by default if supported.
 > +Currently supported targets are i?86-*-linux* and x86-64-*-linux*.

 The "if supported" and target list can then be removed here.

>>>
>>> Here is the updated patch.  To support --enable-default-pie, each target
>>> must update STARTFILE_SPEC to support PIE_SPEC and NO_PIE_SPEC.  I can
>>> provide STARTFILE_SPEC patch if needed.
>>>
>>> Thanks.
>>>
>>>
>>> H.J.
>>> ---
>>> gcc/
>>>
>>> 2015-01-12  Magnus Granberg  
>>> H.J. Lu  
>>>
>>> * Makefile.in (COMPILER): Add @NO_PIE_CFLAGS@.
>>> (LINKER): Add @NO_PIE_FLAG@.
>>> (libgcc.mvars): Set NO_PIE_CFLAGS to -fno-PIE for
>>> --enable-default-pie.
>>> * common.opt (fPIE): Initialize to -1.
>>> (fpie): Likewise.
>>> (static): Add "RejectNegative Negative(shared)".
>>> (no-pie): New option.
>>> (pie): Replace "Negative(shared)" with "Negative(no-pie)".
>>> * configure.ac: Add --enable-default-pie.
>>> (NO_PIE_CFLAGS): New.  Check if -fno-PIE works.  AC_SUBST.
>>> (NO_PIE_FLAG): New.  Check if -no-pie works.  AC_SUBST.
>>> * defaults.h (DEFAULT_FLAG_PIE): New.  Default PIE to -fPIE.
>>> * gcc.c (NO_PIE_SPEC): New.
>>> (PIE_SPEC): Likewise.
>>> (LD_PIE_SPEC): Likewise.
>>> (LINK_PIE_SPEC): Handle -no-pie.  Use PIE_SPEC and LD_PIE_SPEC.
>>> * opts.c (DEFAULT_FLAG_PIE): New.  Set to 0 if ENABLE_DEFAULT_PIE
>>> is undefined.
>>> (finish_options): Update opts->x_flag_pie if it is -1.
>>> * config/gnu-user.h (FVTABLE_VERIFY_SPEC): New.
>>> (GNU_USER_TARGET_STARTFILE_SPEC): Use FVTABLE_VERIFY_SPEC.  Use
>>> NO_PIE_SPEC and NO_PIE_SPEC if ENABLE_DEFAULT_PIE is defined.
>>> (GNU_USER_TARGET_STARTFILE_SPEC): Use FVTABLE_VERIFY_SPEC.
>>> * doc/install.texi: Document --enable-default-pie.
>>> * doc/invoke.texi: Document -no-pie.
>>> * config.in: Regenerated.
>>> * configure: Likewise.
>>>
>>> gcc/ada/
>>>
>>> 2015-01-12  H.J. Lu  
>>>
>>> * gcc-interface/Makefile.in (TOOLS_LIBS): Add @NO_PIE_FLAG@.
>>>
>>> libgcc/
>>>
>>> 2015-01-12  H.J. Lu  
>>>
>>> * Makefile.in (CRTSTUFF_CFLAGS): Add $(NO_PIE_CFLAGS).
>>>
>>
>> This is the updated patch.  I fixed the -r regression.   LTO tests
>> pass now.
>>
>> --
>> H.J.
>
>
>
> --
> H.J.



-- 
H.J.

Re: [Patch, fortran] PR64952 - Missing temporary in assignment from elemental function

2015-02-11 Thread Paul Richard Thomas

Dear Dominique,

> The patch works as advertised! I have two remarks:

Of course it does :-)

>
> (1) AFAIU there is no need for a temporary for

Indeed not. I cannot see how that can be avoided without a much more
elaborate patch. Frankly, I do not see that it would be warranted.
Much better some extra temporaries in corner cases, than wrong code.
>
> PROGRAM Main
> INTEGER :: i, index(5) = (/ (i, i = 1,5) /)
> REAL :: tmp(5), array(5) = (/ (i+0.0, i = 1,5) /)
> tmp = Fred(index,array)
> array = tmp
> PRINT *, array
> CONTAINS
> ELEMENTAL FUNCTION Fred (n, x)
> REAL :: Fred
> INTEGER, INTENT(IN) :: n
> REAL, INTENT(IN) :: x
> ! In general, this would be in an external procedure
> Fred = x+SUM(array(:n-1))+SUM(array(n+1:))
>  END FUNCTION Fred
> END PROGRAM Main
>
> However I get
>
> [Book15] f90/bug% gfc elemental_weird_db_2.f90 -Warray-temporaries
> elemental_weird_db_2.f90:4:10:
>
>  tmp = Fred(index,array)
>   1
> Warning: Creating array temporary at (1) [-Warray-temporaries]
>
> (2) You wrote: << However, this works and has no perceivable effect on 
> Polyhedron timings. >>. This is hardly a surprise since the Polyhedron tests 
> don't use any elemental procedure.

:-) You might have gathered that I didn't check! Within errors, it
didn't affect compile times either.

Thanks for the feedback

Paul

>
> Thanks,
>
> Dominique
>
>> Le 10 févr. 2015 à 23:35, Paul Richard Thomas 
>>  a écrit :
>>
>> Dear Mikael, dear all,
>>
>> Thank you for the previous review. I believe that the attached
>> responds to all of your comments and correctly compiles the three
>> testcases that you provided. Two of these have been included in the
>> original testcase and the third appears separately.
>>
>> Bootstrapped and reg
>> tested on FC21/x86_64 - OK for trunk?
>>
>> Cheers
>>
>> Paul
>>
>> 2015-02-10  Paul Thomas  
>>
>>PR fortran/64952
>>* gfortran.h : Add 'array_outer_dependency' to symbol_attr.
>>* trans.h : Add 'array_outer_dependency' to gfc_ss_info.
>>* module.c : Add AB_ARRAY_OUTER_DEPENDENCY to ab_attribute.
>>Add same to attr_bits.
>>(mio_symbol_attribute): Handle 'array_outer_dependency' attr
>>in module read and write.
>>* resolve.c (resolve_function): If an elemental function is
>>referenced that is marked as having an external array reference
>>and the current namespace is that of an elemental function,
>>mark the containing function likewise.
>>(resolve_variable): Mark elemental function symbol
>>as 'array_outer_dependency' if it has an array reference from
>>outside its own namespace.
>>* trans-array.c (gfc_conv_resolve_dependencies): If any ss is
>>marked as 'array_outer_dependency' generate a temporary.
>>(gfc_walk_function_expr): If the function is marked as
>>'array_outer_dependency', likewise mark the head gfc_ss.
>>
>> 2015-02-10  Paul Thomas  
>>
>>PR fortran/64952
>>* gfortran.dg/elemental_dependency_4.f90: New test
>>* gfortran.dg/elemental_dependency_5.f90: New test
>



-- 
Outside of a dog, a book is a man's best friend. Inside of a dog it's
too dark to read.

Groucho Marx

RE: [PATCH, FT32] initial support

2015-02-11 Thread Joseph Myers

On Wed, 11 Feb 2015, James Bowman wrote:

> +@table @gcctabopt
> +
> +@item -mspace
> +@opindex mspace
> +Enable code-size optimizations.
> +Some of these optimizations incur a minor performance penalty.

We already have -Os, so why is an architecture-specific option for this 
needed?

> +A 16-bit signed constant (-32768..32767)

Use @minus{} for a minus sign in Texinfo documentation, and @dots{} 
instead of literal ".." or "...".

-- 
Joseph S. Myers
jos...@codesourcery.com

Re: [RFA][PATCH][PR rtl-optimization/47477] Type narrowing in match.pd

2015-02-11 Thread Jeff Law


On 02/11/15 03:56, Richard Biener wrote:









diff --git a/gcc/ChangeLog b/gcc/ChangeLog
index 7f3816c..7a95029 100644
--- a/gcc/ChangeLog
+++ b/gcc/ChangeLog
@@ -1,3 +1,8 @@
+2015-02-10  Jeff Law  
+
+   * match.pd (convert (plus/minus (convert @0) (convert @1): New
+   simplifier to narrow arithmetic.
+


Please reference PR47477 from here

Doh.  Fixed.





  2015-02-10  Richard Biener  

 PR tree-optimization/64909
diff --git a/gcc/match.pd b/gcc/match.pd
index 81c4ee6..abc703e 100644
--- a/gcc/match.pd
+++ b/gcc/match.pd
@@ -1018,3 +1018,21 @@ along with GCC; see the file COPYING3.  If not see
 (logs (pows @0 @1))
 (mult @1 (logs @0)

+/* If we have a narrowing conversion of an arithmetic operation where
+   both operands are widening conversions from the same type as the outer
+   narrowing conversion.  Then convert the innermost operands to a suitable
+   unsigned type (to avoid introducing undefined behaviour), perform the
+   operation and convert the result to the desired type.
+
+   This narrows the arithmetic operation.  */


This is also a part of what shorten_binary_op does, so please refer to
that in the comment so we can eventually complete this from there
and remove shorten_binary_op.
Done.  I've created a block comment meant to cover this pattern and any 
additions we need along the way to moving shortening/narrowing out of 
the front-ends.






+(for op (plus minus)
+  (simplify
+(convert (op (convert@2 @0) (convert @1)))
+(if (TREE_TYPE (@0) == TREE_TYPE (@1)
+ && TREE_TYPE (@0) == type


Otherwhere in match.pd we use

(GIMPLE && types_compatible_p (TREE_TYPE (@0), TREE_TYPE (@1)))
  || (GENERIC && TREE_TYPE (@0) == TREE_TYPE (@1))

for type equality checks.  And I think even for GENERIC you want
to ignore qualifiers and thus use TYPE_MAIN_VARIANT (TREE_TYPE (@0))
== TYPE_MAIN_VARAINT (TREE_TYPE (@1)).
Seems reasonable.Makes for some ugly formatting though.  It might 
make sense to factor that test so we don't end up repeating it senselessly.





+ && INTEGRAL_TYPE_P (type)


So instead of testing TREE_TYPE (@0) == type we probably want
to just assert that TYPE_PRECISON (TREE_TYPE (@0)) == TYPE_PRECISION
(type) to allow sign-changes.  Say for

Yea, makes sense.



short i, j;
(unsigned short) (i + j)

we still want to narrow the widened i and j.

Right.



+ && TYPE_PRECISION (TREE_TYPE (@2)) > TYPE_PRECISION (TREE_TYPE
(@0))
+/* This prevents infinite recursion.  */
+&& unsigned_type_for (TREE_TYPE (@0)) != TREE_TYPE (@2))


How can it recurse with the type precision constraint right above?
It may no longer be needed.  The precision check used to be >=, but that 
regressed some java codes.  So it got tightened in the last iteration 
before posting.




Note that technically we don't need to perform the operation in an unsigned
type iff TYPE_OVERFLOW_WRAPS (TREE_TYPE (@0)).  Thus

(if (TYPE_OVERFLOW_WRAPS (TREE_TYPE (@0))
  (convert (op @0 @1)))
(with { tree utype = unsigned_type_for (TREE_TYPE (@0)); }
  (convert (op (convert:utype @0) (convert:utype @1))

You can see that GCC exploits that with -fwrapv.  Maybe this
simplification should be split out into a separate pattern though,
tacking sign-changes for binary ops only.

No strong opinion on separate pattern or integrating into existing pattern.

jeff

Re: [RFC][PR target/39726 P4 regression] match.pd pattern to do type narrowing

2015-02-11 Thread Joseph Myers

On Tue, 10 Feb 2015, Jeff Law wrote:

> I think the way to go is to always convert the inner operands to an unsigned
> type.  In fact, everything except the outer convert should be using an
> unsigned type of the same size/precision as @0's type.  The outer convert
> should, of course, be the final type.
> 
> That avoids all the concerns about sign bit propagation from the narrow type
> into the wider final type.

Subject of course to allowing for the cases where the original expression 
uses sign-extension (signed narrow type), the mask is consistent with this 
(sign bit of narrow type and all higher bits set, e.g. -128), and the 
operation is bitwise not arithmetic, if you want to optimize those as 
well.

-- 
Joseph S. Myers
jos...@codesourcery.com

Re: [PATCH] Fix ubsan and C++14 constexpr ICEs (PR sanitizer/63956)

2015-02-11 Thread Mike Stump

On Feb 11, 2015, at 4:24 AM, Marek Polacek  wrote:
> 
> The following patch splits the test into C and C++ test cases, so
> hopefully fixing the issue.  Ok for trunk?
> 
> 2015-02-11  Marek Polacek  
> 
>   * g++.dg/ubsan/shift-1.C: New test.
>   * gcc.dg/ubsan/c-shift-2.c: New test.
>   * c-c++-common/ubsan/shift-5.c: Remove file.
> 
> diff --git gcc/testsuite/g++.dg/ubsan/shift-1.C 
> gcc/testsuite/g++.dg/ubsan/shift-1.C
> index e69de29..8a71279 100644
> --- gcc/testsuite/g++.dg/ubsan/shift-1.C
> +++ gcc/testsuite/g++.dg/ubsan/shift-1.C
> @@ -0,0 +1,37 @@
> +/* { dg-do compile } */
> +/* { dg-options "-fsanitize=shift -w" } */
> +/* { dg-shouldfail "ubsan" } */
> +
> +int
> +foo (int x)
> +{
> +  /* None of the following should pass.  */
> +  switch (x)
> +{
> +case 1 >> -1:
> +/* { dg-error "is not a constant expression" "" { xfail { *-*-* } } 11 } */

Never include line numbers, unless there is no other way.  Here, I think you 
can drop it, and merely ensure this is on the right line?

An example from gcc.dg:

int g2(int a; __attribute__((unused))); /* { dg-error "just a forward 
declaration" "no parms" { xfail *-*-* } } */

I’m hoping that style will work.

> +case -1 >> -1:
> +/* { dg-error "is not a constant expression" "" { xfail { *-*-* } } 13 } */

Likewise.

> +case 1 << -1:
> +/* { dg-error "is not a constant expression" "" { xfail { *-*-* } } 15 } */

Likewise.

> +case -1 << -1:
> +/* { dg-error "is not a constant expression" "" { xfail { *-*-* } } 17 } */

Likewise.

> +  return 1;
> +}
> +  return 0;
> +}
> +
> +int
> +bar (int x)
> +{
> +  /* None of the following should pass.  */
> +  switch (x)
> +{
> +case -1 >> 200:
> +/* { dg-error "is not a constant expression" "" { xfail { *-*-* } } 30 } */

Likewise.

> +case 1 << 200:
> +/* { dg-error "is not a constant expression" "" { xfail { *-*-* } } 32 } */

Likewise.

Ok with the above fixes, if applicable.

[gomp4] Merge trunk r220584 (2015-02-10) into gomp-4_0-branch

2015-02-11 Thread Thomas Schwinge

Hi!

In r220627, I have committed a merge from trunk r220584 (2015-02-10) into
gomp-4_0-branch.


Grüße,
 Thomas


pgpwCbyTbYKcB.pgp
Description: PGP signature

Re: [Patch, fortran] PR64952 - Missing temporary in assignment from elemental function

2015-02-11 Thread Dominique d'Humières

Dear Paul,

> Le 11 févr. 2015 à 17:57, Paul Richard Thomas  
> a écrit :
> 
> Dear Dominique,
> 
>> The patch works as advertised! I have two remarks:
> 
> Of course it does :-)

;-)

>> 
>> (1) AFAIU there is no need for a temporary for
> 
> Indeed not. I cannot see how that can be avoided without a much more
> elaborate patch. Frankly, I do not see that it would be warranted.
> Much better some extra temporaries in corner cases, than wrong code.

Agreed.

>> (2) You wrote: << However, this works and has no perceivable effect on 
>> Polyhedron timings. >>. This is hardly a surprise since the Polyhedron tests 
>> don't use any elemental procedure.
> 
> :-) You might have gathered that I didn't check! Within errors, it
> didn't affect compile times either.

AFAICT, I have only one test in my archives that time elemental functions: 
pr40581 for which I don’t see any difference with/without your patch.

Looking at bugzilla it seems that Fran Martinez Fadrique is a heavy consumer of 
elemental procs (see e.g. pr52332).
May be you can ask him to do some timing of his code with/without your patch.

> Thanks for the feedback

You’re welcome!

Dominique

Re: [PATCH] PR 62173, re-shuffle insns for RTL loop invariant hoisting

2015-02-11 Thread Jiong Wang


On 11/02/15 14:21, Kenneth Zadeck wrote:

On 02/11/2015 06:20 AM, Jiong Wang wrote:

2014-12-19 15:21 GMT+00:00 Kenneth Zadeck :

however, since i am a nice person 

loop-invariant solves the DF_UD_CHAIN which builds a data structure that
connects each use with all of the defs that reach it.   I believe that this
is the opposite of what you want here.

if you really need this, you need to also turn on the DF_DU_CHAIN which
builds the opposite structure.Both structures can be space hogs, but
they are both turned on in other places in the compiler so it is unlikely to
be an issue.

Exactly, Thanks, Kenneth.

This approach works from my experiment and look much better than
previous REG_NOTE approach.
while it do have one problem. We need the UD/DU chain built before we
do insn re-shuffling.
While after re-shuffling, UD chain needs update, otherwise, the later
"check_dependecies" in find_invariant_insn may fail.

although we have re-shuffle instruction 1 into 2, the later check
still using old UD info, thus
decide instruction 2 is not iv.

1: regA <- vfp + regB
2: regA <- vfp + const

my current fix is to insert those re-shuffled insn into a table named
"vfp_const_iv", then skip those
dependencies check  for them as they don't have any dependencies.

You now are beginning to understand why Mark Wegman and I invented SSA
form almost 30 years ago


Indeed, thanks.

attachment is the new draft patch, pass x86-84 bootstrap
(actually it will not affect x86-64, because of addressing mode),
 
while failed on aarch64 bootstrap during stage2/3 binary comparision,

there must be some glitches needs to be fixed.

Will clean up later after I finished my upcoming long flight and what I
am seeking now is whether the general thoughts, code logic & framework, in this
patch is acceptable to the community?

personally, I think this version is much better than previous version.
new added code integrated with existed code in a more natural way. no use
of unsafe REG_NOTE info which is not updated in this pass.

and from AArch64 gcc bootstrap, 239 new loop invariant found by this
re-shuffling. so, this small improvement on rtl loop invariant pass do have
some value.

please review and give comments.

Thanks.

2015-02-11  Jiong Wang  

gcc/
  * loop-invariant.c (find_defs): Enable DF_DU_CHAIN build.
  (vfp_const_iv): New hash table.
  (expensive_addr_check_p): New boolean.
  (init_inv_motion_data): Initialize new variables.
  (free_inv_motion_data): Release hash table.
  (create_new_invariant): Set cheap_address to false for iv in
  vfp_const_iv table.
  (find_invariant_insn): Skip dependencies check for iv in vfp_const_iv table.
  (use_for_single_du): New function.
  (reshuffle_insn_with_vfp): Likewise.
  (find_invariants_bb): Call reshuffle_insn_with_vfp.
 
gcc/testsuite/

  * gcc.dg/pr62173.c: New testcase.
diff --git a/gcc/loop-invariant.c b/gcc/loop-invariant.c
index f79b497..b1c4760 100644
--- a/gcc/loop-invariant.c
+++ b/gcc/loop-invariant.c
@@ -203,6 +203,8 @@ typedef struct invariant *invariant_p;
 /* The invariants.  */
 
 static vec invariants;
+static hash_table  > *vfp_const_iv;
+static bool need_expensive_addr_check_p;
 
 /* Check the size of the invariant table and realloc if necessary.  */
 
@@ -695,7 +697,7 @@ find_defs (struct loop *loop)
 
   df_remove_problem (df_chain);
   df_process_deferred_rescans ();
-  df_chain_add_problem (DF_UD_CHAIN);
+  df_chain_add_problem (DF_UD_CHAIN + DF_DU_CHAIN);
   df_set_flags (DF_RD_PRUNE_DEAD_DEFS);
   df_analyze_loop (loop);
   check_invariant_table_size ();
@@ -742,6 +744,9 @@ create_new_invariant (struct def *def, rtx_insn *insn, bitmap depends_on,
 	 See http://gcc.gnu.org/ml/gcc-patches/2009-10/msg01210.html .  */
   inv->cheap_address = address_cost (SET_SRC (set), word_mode,
 	 ADDR_SPACE_GENERIC, speed) < 3;
+
+  if (need_expensive_addr_check_p && vfp_const_iv->find (insn))
+	inv->cheap_address = false;
 }
   else
 {
@@ -952,7 +957,8 @@ find_invariant_insn (rtx_insn *insn, bool always_reached, bool always_executed)
 return;
 
   depends_on = BITMAP_ALLOC (NULL);
-  if (!check_dependencies (insn, depends_on))
+  if (!vfp_const_iv->find (insn)
+  && !check_dependencies (insn, depends_on))
 {
   BITMAP_FREE (depends_on);
   return;
@@ -1007,6 +1013,143 @@ find_invariants_insn (rtx_insn *insn, bool always_reached, bool always_executed)
   record_uses (insn);
 }

+/*  Find the single use of def of insn. At the same time,
+make sure def of insn is the single def which reach the use.  */
+
+static rtx_insn *
+use_for_single_du (rtx_insn *insn)
+{
+  struct df_insn_info *insn_info = DF_INSN_INFO_GET (insn);
+  df_ref def;
+
+  FOR_EACH_INSN_INFO_DEF (def, insn_info)
+{
+  struct df_link *chain = DF_REF_CHAIN (def);
+
+  if (!chain
+	  || chain->next)
+	return NULL;
+
+  df_ref use = chain->ref;
+  chain = DF_REF_CHAIN (use);
+
+  if (!chain
+	  || chain->next)
+	return NULL;
+
+  return DF

Re: [PATCH] Fix PR64764 && Fix scan-tree-dump for scop-19.c

2015-02-11 Thread Tom de Vries


On 11-02-15 10:22, Dominique d'Humières wrote:



Le 10 févr. 2015 à 12:42, Tom de Vries  a écrit :

I think we need to understand first what's going on.


Sure, my patch was mainly to silence the failures on my working tree.


In both test-cases, on Linux with -fpic the inlining of one funct ion into the 
other is not done, because we cannot be sure the function call in one function 
binds to the other function at runtime.


I don’t understand why -fpic should change the inlining decision (except 
register pressure and so on).



It's about symbol binding, not inlining heuristics.


So, is the inlining happening with -fpic for Darwin? If so, is that correct?


The inlining is happening for darwin. I don’t know if that is correct or not.


I found an explanation.

darwin.c:
...
/* Cross-module name binding.  Darwin does not support overriding
   functions at dynamic-link time, except for vtables in kexts.  */

bool
darwin_binds_local_p (const_tree decl)
{
  return default_binds_local_p_1 (decl,
  TARGET_KEXTABI && DARWIN_VTABLE_P (decl));
}
...

i386.c:
...
#if TARGET_MACHO
#undef TARGET_BINDS_LOCAL_P
#define TARGET_BINDS_LOCAL_P darwin_binds_local_p
#endif
...

This is probably the cause for the difference. I guess that means your patch is 
ok.

Thanks,
- Tom

Re: Fix i386 wrong code issues introduced by the local flag fix

2015-02-11 Thread Jan Hubicka

Hi,
this patch fixes ICE when building firefox reported by Markus:
> 
> lto1: internal compiler error: Segmentation fault
> 0x106e1feb crash_signal
> ../../gcc/gcc/toplev.c:383
> 0x10589cd4 lto_get_decl_name_mapping(lto_file_decl_data*, char const*)
> ../../gcc/gcc/lto-section-in.c:369
> 0x1057bb6b copy_function_or_variable
> ../../gcc/gcc/lto-streamer-out.c:2194
> 0x1057df9b lto_output()
> ../../gcc/gcc/lto-streamer-out.c:2289
> 0x105edd87 write_lto
> ../../gcc/gcc/passes.c:2405
> 0x105f3073 ipa_write_optimization_summaries(lto_symtab_encoder_d*)
> ../../gcc/gcc/passes.c:2607
> 0x1018f6ab do_stream_out
> ../../gcc/gcc/lto/lto.c:2480
> 0x10190a33 stream_out
> ../../gcc/gcc/lto/lto.c:2544
> 0x10190a33 lto_wpa_write_files
> ../../gcc/gcc/lto/lto.c:2661
> 0x1019c0a7 do_whole_program_analysis
> ../../gcc/gcc/lto/lto.c:3331
> 0x1019c0a7 lto_main()
> ../../gcc/gcc/lto/lto.c:3451
> 

I tested it on Firefox, will commit after bootstrap®testing finishes (x86_64)

Honza

* ipa.c (symbol_table::remove_unreachable_nodes): Avoid releasing
bodies of thunks; comment on why.
* symtab.c (symtab_node::get_partitioning_class): Aliases of extern
symbols are extern.
Index: ipa.c
===
--- ipa.c   (revision 220608)
+++ ipa.c   (working copy)
@@ -537,7 +537,13 @@ symbol_table::remove_unreachable_nodes (
   /* If node is unreachable, remove its body.  */
   else if (!reachable.contains (node))
 {
- if (!body_needed_for_clonning.contains (node->decl))
+ /* We keep definitions of thunks and aliases in the boundary so
+we can walk to the ultimate alias targets and function symbols
+reliably.  */
+ if (node->alias || node->thunk.thunk_p)
+   ;
+ else if (!body_needed_for_clonning.contains (node->decl)
+ && !node->alias && !node->thunk.thunk_p)
node->release_body ();
  else if (!node->clone_of)
gcc_assert (in_lto_p || DECL_RESULT (node->decl));
Index: symtab.c
===
--- symtab.c(revision 220608)
+++ symtab.c(working copy)
@@ -1779,6 +1779,8 @@ symtab_node::get_partitioning_class (voi
 
   if (varpool_node *vnode = dyn_cast  (this))
 {
+  if (alias && definition && !ultimate_alias_target ()->definition)
+   return SYMBOL_EXTERNAL;
   /* Constant pool references use local symbol names that can not
  be promoted global.  We should never put into a constant pool
  objects that can not be duplicated across partitions.  */
@@ -1790,7 +1792,7 @@ symtab_node::get_partitioning_class (voi
  Handle them as external; compute_ltrans_boundary take care to make
  proper things to happen (i.e. to make them appear in the boundary but
  with body streamed, so clone can me materialized).  */
-  else if (!dyn_cast  (this)->definition)
+  else if (!dyn_cast  (this)->function_symbol ()->definition)
 return SYMBOL_EXTERNAL;
 
   /* Linker discardable symbols are duplicated to every use unless they are

Re: C++ Patch for c++/60894

2015-02-11 Thread Paolo Carlini


Hi,

On 12/01/2014 09:59 PM, Jason Merrill wrote:

On 12/01/2014 07:01 AM, Fabien Chêne wrote:

2014-11-03 21:18 GMT+01:00 Fabien Chêne :

2014-10-09 15:34 GMT+02:00 Jason Merrill :
[...]

If the USING_DECL is returned, the code below will be rejected as
expected, but the error message will not mention the line where the
USING_DECL appears as the previous definition, but at the target
declaration of the USING_DECL instead.



I think that's what happens if you strip the USING_DECL and return 
what it
points to; if you return the USING_DECL itself that shouldn't 
happen (though

then the caller needs to be able to deal with getting a USING_DECL).


[Sorry for the delay] Humm, l_a_c_t returns a TYPE upon success, shall
I change it and return a DECL instead ?


Ping. Before I made the change, I'd like to be sure this is what you
have in mind.


I think that makes sense.


Today I was having a look to this pending issue and went astray due to 
the broken thread: I wondered if, basing on Fabien' first try and the 
comments accompanying tag_scope, it would make sense to use 
strip_using_decl only when the scope is ts_global (or maybe != 
ts_current)?!? The below certainly passes testing. Unless of course 
Fabien has the above implemented and ready (I don't, I miss the correct 
changes to the lookup_and_check_tag callers)


Thanks,
Paolo.

/
Index: decl.c
===
--- decl.c  (revision 220611)
+++ decl.c  (working copy)
@@ -12207,6 +12207,9 @@ lookup_and_check_tag (enum tag_types tag_code, tre
  && DECL_TEMPLATE_TEMPLATE_PARM_P (decl
 decl = DECL_TEMPLATE_RESULT (decl);
 
+  if (scope == ts_global)
+decl = strip_using_decl (decl);
+
   if (decl && TREE_CODE (decl) == TYPE_DECL)
 {
   /* Look for invalid nested type:

Re: [PATCH][AArch64] Testcase fix for __ATOMIC_CONSUME

2015-02-11 Thread Torvald Riegel

On Mon, 2015-02-09 at 09:10 -0800, Mike Stump wrote:
> On Feb 9, 2015, at 7:11 AM, Alex Velenko  wrote:
> > The following patch makes atomic-op-consume.c XFAIL
> > 
> > Is this patch ok?
> 
> Ok.
> 
> I’d shorten the comment above the xfail to be exceedingly short:
> 
>   /* PR59448 consume not implemented yet */
> 
> The reason is the brain can process this about 8x faster.  Also, one can cut 
> and paste the PR part into a web browser directly, or, if you have an 
> electric bugzilla mode for emacs, it will pop right up. */

Given the discussions we had in ISO C++ SG1, it seems the only way to
fix memory_order_consume is to deprecate it (or let it rot forever), and
add something else to the standard.  See
http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2014/n4321.pdf

IOW, I believe the promotion is here to stay.  I'm not aware of any
other implementation doing something else.

Thus, XFAIL doesn't seem right to me.

Re: [PATCH] Fix ubsan and C++14 constexpr ICEs (PR sanitizer/63956)

2015-02-11 Thread Marek Polacek

On Wed, Feb 11, 2015 at 09:44:24AM -0800, Mike Stump wrote:
> On Feb 11, 2015, at 4:24 AM, Marek Polacek  wrote:
> > 
> > The following patch splits the test into C and C++ test cases, so
> > hopefully fixing the issue.  Ok for trunk?
> > 
> > 2015-02-11  Marek Polacek  
> > 
> > * g++.dg/ubsan/shift-1.C: New test.
> > * gcc.dg/ubsan/c-shift-2.c: New test.
> > * c-c++-common/ubsan/shift-5.c: Remove file.
> > 
> > diff --git gcc/testsuite/g++.dg/ubsan/shift-1.C 
> > gcc/testsuite/g++.dg/ubsan/shift-1.C
> > index e69de29..8a71279 100644
> > --- gcc/testsuite/g++.dg/ubsan/shift-1.C
> > +++ gcc/testsuite/g++.dg/ubsan/shift-1.C
> > @@ -0,0 +1,37 @@
> > +/* { dg-do compile } */
> > +/* { dg-options "-fsanitize=shift -w" } */
> > +/* { dg-shouldfail "ubsan" } */
> > +
> > +int
> > +foo (int x)
> > +{
> > +  /* None of the following should pass.  */
> > +  switch (x)
> > +{
> > +case 1 >> -1:
> > +/* { dg-error "is not a constant expression" "" { xfail { *-*-* } } 11 } */
> 
> Never include line numbers, unless there is no other way.  Here, I think you 
> can drop it, and merely ensure this is on the right line?
> 
> An example from gcc.dg:
> 
> int g2(int a; __attribute__((unused))); /* { dg-error "just a forward 
> declaration" "no parms" { xfail *-*-* } } */
> 
> I’m hoping that style will work.

Yeah, it works, I don't know why I changed it for the C test case, but
not for the C++ one.

> Ok with the above fixes, if applicable.

Thanks.


2015-02-11  Marek Polacek  

* g++.dg/ubsan/shift-1.C: New test.
* gcc.dg/ubsan/c-shift-2.c: New test.
* c-c++-common/ubsan/shift-5.c: Remove file.

diff --git gcc/testsuite/g++.dg/ubsan/shift-1.C 
gcc/testsuite/g++.dg/ubsan/shift-1.C
index e69de29..05e049e 100644
--- gcc/testsuite/g++.dg/ubsan/shift-1.C
+++ gcc/testsuite/g++.dg/ubsan/shift-1.C
@@ -0,0 +1,31 @@
+/* { dg-do compile } */
+/* { dg-options "-fsanitize=shift -w" } */
+/* { dg-shouldfail "ubsan" } */
+
+int
+foo (int x)
+{
+  /* None of the following should pass.  */
+  switch (x)
+{
+case 1 >> -1: /* { dg-error "is not a constant expression" "" { xfail { 
*-*-* } } } */
+case -1 >> -1: /* { dg-error "is not a constant expression" "" { xfail { 
*-*-* } } } */
+case 1 << -1: /* { dg-error "is not a constant expression" "" { xfail { 
*-*-* } } } */
+case -1 << -1: /* { dg-error "is not a constant expression" "" { xfail { 
*-*-* } } } */
+  return 1;
+}
+  return 0;
+}
+
+int
+bar (int x)
+{
+  /* None of the following should pass.  */
+  switch (x)
+{
+case -1 >> 200: /* { dg-error "is not a constant expression" "" { xfail { 
*-*-* } } } */
+case 1 << 200: /* { dg-error "is not a constant expression" "" { xfail { 
*-*-* } } } */
+  return 1;
+}
+  return 0;
+}
diff --git gcc/testsuite/gcc.dg/ubsan/c-shift-2.c 
gcc/testsuite/gcc.dg/ubsan/c-shift-2.c
index e69de29..beb0dbe 100644
--- gcc/testsuite/gcc.dg/ubsan/c-shift-2.c
+++ gcc/testsuite/gcc.dg/ubsan/c-shift-2.c
@@ -0,0 +1,31 @@
+/* { dg-do compile } */
+/* { dg-options "-fsanitize=shift -w" } */
+/* { dg-shouldfail "ubsan" } */
+
+int
+foo (int x)
+{
+  /* None of the following should pass.  */
+  switch (x)
+{
+case 1 >> -1: /* { dg-error "case label does not reduce to an integer 
constant" } */
+case -1 >> -1: /* { dg-error "case label does not reduce to an integer 
constant" } */
+case 1 << -1: /* { dg-error "case label does not reduce to an integer 
constant" } */
+case -1 << -1: /* { dg-error "case label does not reduce to an integer 
constant" } */
+  return 1;
+}
+  return 0;
+}
+
+int
+bar (int x)
+{
+  /* None of the following should pass.  */
+  switch (x)
+{
+case -1 >> 200: /* { dg-error "case label does not reduce to an integer 
constant" } */
+case 1 << 200: /* { dg-error "case label does not reduce to an integer 
constant" } */
+  return 1;
+}
+  return 0;
+}

Marek

[PATCH][PR target/63347][P3 Regression] Fix SCHED_GROUP_P handling in haifa-sched.c

2015-02-11 Thread Jeff Law



PR 63347 is a case where we can end up scheduling an insn into the 
middle of a scheduling group.


Basically given two consecutive insns A & B, if B has SCHED_GROUP_P set, 
then it must always be immediately after A.


The problem we ran into in this PR was that there was a cost to issue B, 
so it got delayed several cycles.  While B was delayed an earlier, 
unrelated insn Z which had been delayed earlier became ready and issued. 
 The result being A Z B which ultimately led to incorrect code generation.


The fix is actually quite simple.  In prune_ready_list, we queue insns 
that have had their dependencies satisfied, but have a cost (say for 
example, they're waiting for a functional unit to become available).  We 
want to queue based on that cost, with one exception.  If the insn to be 
queued is part of a scheduling group, then we want to queue it with cost 1.


That ensures that on each cycle, any SCHED_GROUP_P insn that has its 
dependencies satisfied is in the ready list.  Once we do that, 
everything is in place to assure that nothing issues until after the 
SCHED_GROUP_P insn issues.


Bootstrapped and regression tested on x86_64-unknown-linux-gnu.  Also 
verified the testcase works correctly on m68k-unknown-linux.  Installed 
on the trunk.






commit 07e421b3fbc2d29c9636200906c3edf3cbd33499
Author: Jeff Law 
Date:   Wed Feb 11 16:19:05 2015 -0700

PR target/63347
* haifa-sched.c (prune_ready_list): If we have a SCHED_GROUP_P insn
that needs to be queued, just queue it for a single cycle.

PR target/63347
* gcc.target/m68k/pr63347.c: New test.

diff --git a/gcc/ChangeLog b/gcc/ChangeLog
index d895209..c9ac045 100644
--- a/gcc/ChangeLog
+++ b/gcc/ChangeLog
@@ -1,3 +1,9 @@
+2015-02-11  Jeff Law  
+
+   PR target/63347
+   * haifa-sched.c (prune_ready_list): If we have a SCHED_GROUP_P insn
+   that needs to be queued, just queue it for a single cycle.
+
 2015-02-11  Jan Hubicka  
 
* ipa.c (symbol_table::remove_unreachable_nodes): Avoid releasing
diff --git a/gcc/haifa-sched.c b/gcc/haifa-sched.c
index 75d2421..64c8c9c1 100644
--- a/gcc/haifa-sched.c
+++ b/gcc/haifa-sched.c
@@ -6291,7 +6291,15 @@ prune_ready_list (state_t temp_state, bool 
first_cycle_insn_p,
  if (SCHED_GROUP_P (insn) && cost > min_cost_group)
min_cost_group = cost;
  ready_remove (&ready, i);
- queue_insn (insn, cost, reason);
+ /* Normally we'd want to queue INSN for COST cycles.  However,
+if SCHED_GROUP_P is set, then we must ensure that nothing
+else comes between INSN and its predecessor.  If there is
+some other insn ready to fire on the next cycle, then that
+invariant would be broken.
+
+So when SCHED_GROUP_P is set, just queue this insn for a
+single cycle.  */
+ queue_insn (insn, SCHED_GROUP_P (insn) ? 1 : cost, reason);
  if (i + 1 < n)
break;
}
diff --git a/gcc/testsuite/ChangeLog b/gcc/testsuite/ChangeLog
index c62cc23..ee5be51 100644
--- a/gcc/testsuite/ChangeLog
+++ b/gcc/testsuite/ChangeLog
@@ -1,3 +1,8 @@
+2015-02-11  Jeff Law  
+
+   PR target/63347
+   * gcc.target/m68k/pr63347.c: New test.
+
 2015-02-11  Marek Polacek  
 
* g++.dg/ubsan/shift-1.C: New test.
diff --git a/gcc/testsuite/gcc.target/m68k/pr63347.c 
b/gcc/testsuite/gcc.target/m68k/pr63347.c
new file mode 100644
index 000..1d23e9a
--- /dev/null
+++ b/gcc/testsuite/gcc.target/m68k/pr63347.c
@@ -0,0 +1,46 @@
+/* { dg-do compile } */
+/* { dg-options "-O2 -mcpu=5208" } */
+
+#include 
+
+void __attribute__ ((noinline))
+oof()
+{
+  asm volatile ("" ::: "memory");
+}
+int print_info(unsigned int *ip_addr)
+{
+int invalid = 0;
+
+if (ip_addr) {
+unsigned int haddr = *ip_addr;
+oof("stuff");
+if (0x0 == haddr) {
+invalid = 1;
+}
+oof("stuff2");
+} else {
+invalid = 1;
+}
+
+return invalid;
+}
+
+int main(int argc, char *argv[])
+{
+unsigned int myaddr;
+int ret;
+
+myaddr = 0x0;
+ret = print_info(&myaddr);
+if (!ret)
+abort ();
+
+myaddr = 0x01020304;
+ret = print_info(&myaddr);
+if (ret)
+abort ();
+exit (0);
+}
+
+

RE: [PATCH, FT32] initial support

2015-02-11 Thread James Bowman

> > +@table @gcctabopt
> > +
> > +@item -mspace
> > +@opindex mspace
> > +Enable code-size optimizations.
> > +Some of these optimizations incur a minor performance penalty.
> 
> We already have -Os, so why is an architecture-specific option for this 
> needed?

Code compiled with -mspace is somewhat slower than code without.
So we typically build *all* code with -Os, with everything
non-critical also compiled -mspace.

Is this a legitimate option or should I just use -Os?

--
James Bowman
FTDI Open Source Liaison

Re: [RFC testsuite] Fix PR64850, tweak acc_on_device* tests

2015-02-11 Thread Kaz Kojima

Thomas Schwinge  wrote:
> To resolve the immediate problem: is my approval "enough" for Kaz to
> commit the patch, or does that need a "more authoritative approval"?

I'd like to commit my patch as a "quick fix" in a few days if
no one objects.

> I think I copied this approach from some other test case (but don't
> remember which).  In the current set of tests, we need to verify that the
> acc_on_device library function is called 0, 1, or 4 times (see below).
> 
> For example, for gcc.dg/goacc/acc_on_device-1.c we'Ve got:
> 
> $ build-gcc/gcc/xgcc -Bbuild-gcc/gcc/ 
> source-gcc/gcc/testsuite/gcc.dg/goacc/acc_on_device-1.c -fopenacc -O -std=c89 
> -Wno-implicit-function-declaration -S -fpic -mcmodel=large -o 
> acc_on_device-1.s
> $ grep acc_on_device < acc_on_device-1.s
> .file   "acc_on_device-1.c"
> movabsq $acc_on_device@PLTOFF, %rdx
> movabsq $acc_on_device@PLTOFF, %rdx
> movabsq $acc_on_device@PLTOFF, %rdx
> movabsq $acc_on_device@PLTOFF, %rdx
> 
> Isn't it even more fragile to scan here for acc_on_device being called
> four times compared to the -fdump-rtl-expand dump?  Or should I split up
> the four tests into four separate files?  (I guess I lack knowledge of
> the best approach for doing such a thing in the GCC testsuite.)

Another example with the asm output of m68k compiler:

lea acc_on_device,%a2
jsr (%a2)
...
jsr (%a2)
...
jsr (%a2)
...
jsr (%a2)

for -fopenacc -O -fno-openacc acc_on_device-1.c.  It seems that
getting the number of calls for the specific function isn't easy
with the asm output on some targets.

Regards,
kaz

Re: Add CFLAGS_FOR_TARGET to Ada OS Constant Extraction Process

2015-02-11 Thread Joel Sherrill


On 2/11/2015 1:22 AM, Arnaud Charlet wrote:
>> When building Ada for RTEMS, you need to pass in the CFLAGS_FOR_TARGET
>> to see OS specific .h files. This was already done for the RTS but needed to
>> be added to the process which extracts value settings from the .h files.
>>
>> This patch is against 4.9.  OK to apply to 4.8, 4.9, and head assuming it
>> applies cleanly?
> This patch won't apply cleanly for head I think, where I suspect at least
> part of your patch is OBE.
OK. Rebuilding on the head.
> It would be better to first have a patch for trunk (if still relevant)
> and then backport it to the other branches, since changes on trunk have
> already been done in a different way to address at least partly this issue
> (e.g. use of GNATLIBCFLAGS_FOR_C in OSCONS_CPP, different computation of
> OSCONS_CC).
I am happy to do and from your description it sounds like the it would
have built
if the code had my previous patch to add CFLAGS_FOR_TARGET to the
GNATLIBCFLAGS.

GNATLIBCFLAGS_FOR_C = -W -Wall $(GNATLIBCFLAGS) $(CFLAGS_FOR_TARGET) \
-fexceptions -DIN_RTS -DHAVE_GETIPINFO

But the build didn't get far enough to know. It fails before that with:

gcc -c -g -O2  -gnatpg -gnata -W -Wall -nostdinc -I- -I. -Iada/generated
-Iada -I/users/joel/test-gcc/gcc/gcc/ada
-I/users/joel/test-gcc/gcc/gcc/ada/gcc-interface
/users/joel/test-gcc/gcc/gcc/ada/a-ioexce.ads -o ada/a-ioexce.o
a-ioexce.ads:21:19: violation of restriction "No_Elaboration_Code" at
system.ads:53
a-ioexce.ads:22:19: violation of restriction "No_Elaboration_Code" at
system.ads:53
a-ioexce.ads:23:19: violation of restriction "No_Elaboration_Code" at
system.ads:53
a-ioexce.ads:24:19: violation of restriction "No_Elaboration_Code" at
system.ads:53
a-ioexce.ads:25:19: violation of restriction "No_Elaboration_Code" at
system.ads:53
a-ioexce.ads:26:19: violation of restriction "No_Elaboration_Code" at
system.ads:53
a-ioexce.ads:27:19: violation of restriction "No_Elaboration_Code" at
system.ads:53
a-ioexce.ads:28:19: violation of restriction "No_Elaboration_Code" at
system.ads:53

Any ideas?

My native compiler is from the head back in August. I will rebuild it
overnight
and try again in the morning in case that's a factor.

> Arno

-- 
Joel Sherrill, Ph.D. Director of Research & Development
joel.sherr...@oarcorp.comOn-Line Applications Research
Ask me about RTEMS: a free RTOS  Huntsville AL 35805
Support Available(256) 722-9985

Re: [PATCH] PR rtl-optimization/32219: optimizer causees wrong code in pic/hidden/weak symbol checking

2015-02-11 Thread Richard Henderson

On 02/10/2015 01:19 PM, Richard Henderson wrote:
> As an existing issue, I'm not sure why "specified" visibility is any different
> from unspecified visibility.  As far as I'm aware, the "specified" bit simply
> means that the decl doesn't inherit inherit visibility from the class, or from
> the command-line.  But once we're this far, the visibility actually applied to
> the symbol should be all that matters.

The test is there to differentiate explicit visibility from that implied from
the command-line.  Without it, we assume hidden visibility for external symbols
too early, making the command-line option useless.  This is visible even in
building libgcc.

I believe this set of patches does what we want, and cleans things up a bit in
the process.


r~
From bf5d4f20368d1173a8d586f6bdd258b00b807e33 Mon Sep 17 00:00:00 2001
From: Richard Henderson 
Date: Wed, 11 Feb 2015 17:37:48 -0800
Subject: [PATCH 1/5] Replace local_p with direct returns

---
 gcc/varasm.c | 66 +++-
 1 file changed, 34 insertions(+), 32 deletions(-)

diff --git a/gcc/varasm.c b/gcc/varasm.c
index 3f62fca..83d9de3 100644
--- a/gcc/varasm.c
+++ b/gcc/varasm.c
@@ -6814,7 +6814,6 @@ default_binds_local_p (const_tree exp)
 bool
 default_binds_local_p_1 (const_tree exp, int shlib)
 {
-  bool local_p;
   bool resolved_locally = false;
   bool resolved_to_local_def = false;
 
@@ -6845,54 +6844,57 @@ default_binds_local_p_1 (const_tree exp, int shlib)
 
   /* A non-decl is an entry in the constant pool.  */
   if (!DECL_P (exp))
-local_p = true;
+return true;
+
   /* Weakrefs may not bind locally, even though the weakref itself is always
  static and therefore local.  Similarly, the resolver for ifunc functions
  might resolve to a non-local function.
  FIXME: We can resolve the weakref case more curefuly by looking at the
  weakref alias.  */
-  else if (lookup_attribute ("weakref", DECL_ATTRIBUTES (exp))
+  if (lookup_attribute ("weakref", DECL_ATTRIBUTES (exp))
 	   || (TREE_CODE (exp) == FUNCTION_DECL
 	   && lookup_attribute ("ifunc", DECL_ATTRIBUTES (exp
-local_p = false;
+return false;
+
   /* Static variables are always local.  */
-  else if (! TREE_PUBLIC (exp))
-local_p = true;
-  /* A variable is local if the user has said explicitly that it will
- be.  */
-  else if ((DECL_VISIBILITY_SPECIFIED (exp)
-	|| resolved_to_local_def)
-	   && DECL_VISIBILITY (exp) != VISIBILITY_DEFAULT)
-local_p = true;
+  if (! TREE_PUBLIC (exp))
+return true;
+
+  /* A variable is local if the user has said explicitly that it will be.  */
+  if ((DECL_VISIBILITY_SPECIFIED (exp) || resolved_to_local_def)
+  && DECL_VISIBILITY (exp) != VISIBILITY_DEFAULT)
+return true;
+
   /* Variables defined outside this object might not be local.  */
-  else if (DECL_EXTERNAL (exp) && !resolved_locally)
-local_p = false;
-  /* If defined in this object and visibility is not default, must be
- local.  */
-  else if (DECL_VISIBILITY (exp) != VISIBILITY_DEFAULT)
-local_p = true;
+  if (DECL_EXTERNAL (exp) && !resolved_locally)
+return false;
+
+  /* If defined in this object and visibility is not default,
+ must be local.  */
+  if (DECL_VISIBILITY (exp) != VISIBILITY_DEFAULT)
+return true;
+
   /* Default visibility weak data can be overridden by a strong symbol
  in another module and so are not local.  */
-  else if (DECL_WEAK (exp)
-	   && !resolved_locally)
-local_p = false;
+  if (DECL_WEAK (exp) && !resolved_locally)
+return false;
+
   /* If PIC, then assume that any global name can be overridden by
  symbols resolved from other modules.  */
-  else if (shlib)
-local_p = false;
+  if (shlib)
+return false;
+
   /* Uninitialized COMMON variable may be unified with symbols
  resolved from other modules.  */
-  else if (DECL_COMMON (exp)
-	   && !resolved_locally
-	   && (DECL_INITIAL (exp) == NULL
-	   || (!in_lto_p && DECL_INITIAL (exp) == error_mark_node)))
-local_p = false;
+  if (DECL_COMMON (exp)
+  && !resolved_locally
+  && (DECL_INITIAL (exp) == NULL
+	  || (!in_lto_p && DECL_INITIAL (exp) == error_mark_node)))
+return false;
+
   /* Otherwise we're left with initialized (or non-common) global data
  which is of necessity defined locally.  */
-  else
-local_p = true;
-
-  return local_p;
+  return true;
 }
 
 /* Return true when references to DECL must bind to current definition in
-- 
2.1.0

From 0b1ac37c688093d720a99b5ae446602a0c1120b3 Mon Sep 17 00:00:00 2001
From: Richard Henderson 
Date: Wed, 11 Feb 2015 19:27:14 -0800
Subject: [PATCH 2/5] Use symtab_node to unify var and function handling

Delay that until after static symbols are eliminated.
---
 gcc/varasm.c | 59 +++
 1 file changed, 19 insertions(+), 40 deletions(-)

diff --git a/gcc/varasm.c b/gcc/varasm.c
index 83d9de3..dcc70e3 100644
--- a/gcc/

Re: [RFA][PATCH][PR rtl-optimization/47477] Type narrowing in match.pd

2015-02-11 Thread Bin.Cheng

On Wed, Feb 11, 2015 at 4:55 AM, Jeff Law  wrote:
>
> This PR was originally minor issue where we regressed on this kind of
> sequence:
>
> typedef struct toto_s *toto_t;
> toto_t add (toto_t a, toto_t b) {
>   int64_t tmp = (int64_t)(intptr_t)a + ((int64_t)(intptr_t)b&~1L);
>   return (toto_t)(intptr_t) tmp;
> }
>
>
> There was talk of trying to peephole this in the x86 backend.  But later
> Jakub speculated that if we had good type narrowing this could be done in
> the tree optimizers...
>
> S, here we go.  I didn't do anything with logicals are those are already
> handled elsewhere in match.pd.  I didn't try to handle MULT as in the early
> experiments I did, it was a lose because of the existing mechanisms for
> widening multiplications.
>
> Interestingly enough, this patch seems to help out libjava more than
> anything else in a GCC build and it really only helps a few routines. There
> weren't any routines I could see where the code regressed after this patch.
> This is probably an indicator that these things aren't *that* common, or the
> existing shortening code better than we thought, or some important
> shortening case is missing.

Cool that we are trying to simplify type conversion using generic
match facility.  I have thought about type promotion in match.pd too.
For example, (unsigned long long)(unsigned long)(int_expr), if we can
prove int_expr is always positive (in my case, this is from vrp
information), then the first conversion can be saved.  This is another
way for (and related? I didn't look at the code) the sign/zero
extension elimination work using VRP I suppose?

Thanks,
bin
>
>
> I think we should pull the other tests from 47477 which are not regressions
> out into their own bug for future work.  Or alternately, when this fix is
> checked in remove the regression marker in 47477.
>
>
> Bootstrapped and regression tested on x86_64-unknown-linux-gnu.  OK for the
> trunk?
>
>
>
>
>
>
>
>
> diff --git a/gcc/ChangeLog b/gcc/ChangeLog
> index 7f3816c..7a95029 100644
> --- a/gcc/ChangeLog
> +++ b/gcc/ChangeLog
> @@ -1,3 +1,8 @@
> +2015-02-10  Jeff Law  
> +
> +   * match.pd (convert (plus/minus (convert @0) (convert @1): New
> +   simplifier to narrow arithmetic.
> +
>  2015-02-10  Richard Biener  
>
> PR tree-optimization/64909
> diff --git a/gcc/match.pd b/gcc/match.pd
> index 81c4ee6..abc703e 100644
> --- a/gcc/match.pd
> +++ b/gcc/match.pd
> @@ -1018,3 +1018,21 @@ along with GCC; see the file COPYING3.  If not see
> (logs (pows @0 @1))
> (mult @1 (logs @0)
>
> +/* If we have a narrowing conversion of an arithmetic operation where
> +   both operands are widening conversions from the same type as the outer
> +   narrowing conversion.  Then convert the innermost operands to a suitable
> +   unsigned type (to avoid introducing undefined behaviour), perform the
> +   operation and convert the result to the desired type.
> +
> +   This narrows the arithmetic operation.  */
> +(for op (plus minus)
> +  (simplify
> +(convert (op (convert@2 @0) (convert @1)))
> +(if (TREE_TYPE (@0) == TREE_TYPE (@1)
> + && TREE_TYPE (@0) == type
> + && INTEGRAL_TYPE_P (type)
> + && TYPE_PRECISION (TREE_TYPE (@2)) > TYPE_PRECISION (TREE_TYPE
> (@0))
> +/* This prevents infinite recursion.  */
> +&& unsigned_type_for (TREE_TYPE (@0)) != TREE_TYPE (@2))
> +  (with { tree utype = unsigned_type_for (TREE_TYPE (@0)); }
> +(convert (op (convert:utype @0) (convert:utype @1)))
> diff --git a/gcc/testsuite/ChangeLog b/gcc/testsuite/ChangeLog
> index 15d5e2d..76e5254 100644
> --- a/gcc/testsuite/ChangeLog
> +++ b/gcc/testsuite/ChangeLog
> @@ -1,3 +1,8 @@
> +2015-02-10  Jeff Law  
> +
> +   PR rtl-optimization/47477
> +   * gcc.dg/tree-ssa/narrow-arith-1.c: New test.
> +
>  2015-02-10  Richard Biener  
>
> PR tree-optimization/64909
> diff --git a/gcc/testsuite/gcc.dg/tree-ssa/narrow-arith-1.c
> b/gcc/testsuite/gcc.dg/tree-ssa/narrow-arith-1.c
> new file mode 100644
> index 000..104cb6f5
> --- /dev/null
> +++ b/gcc/testsuite/gcc.dg/tree-ssa/narrow-arith-1.c
> @@ -0,0 +1,22 @@
> +/* PR tree-optimization/47477 */
> +/* { dg-do compile } */
> +/* { dg-options "-O2 -fdump-tree-optimized -w" } */
> +/* { dg-require-effective-target ilp32 } */
> +
> +typedef int int64_t __attribute__ ((__mode__ (__DI__)));
> +typedef int * intptr_t;
> +
> +typedef struct toto_s *toto_t;
> +toto_t add (toto_t a, toto_t b) {
> +  int64_t tmp = (int64_t)(intptr_t)a + ((int64_t)(intptr_t)b&~1L);
> +  return (toto_t)(intptr_t) tmp;
> +}
> +
> +/* For an ILP32 target there'll be 6 casts when we start, but just 4
> +   if the match.pd pattern is successfully matched.  */
> +/* { dg-final { scan-tree-dump-times "= \\(int\\)" 1 "optimized" } } */
> +/* { dg-final { scan-tree-dump-times "= \\(unsigned int\\)" 2 "optimized" }
> } */
> +/* { dg-final { scan-tree-dump-times "= \\(struct toto_s \\*\\)" 1
> "optimized" } } */
> +/* { dg-final {

Re: [PATCH PR64705]Don't aggressively expand induction variable's base

2015-02-11 Thread Bin.Cheng

On Wed, Feb 11, 2015 at 7:24 PM, Richard Biener
 wrote:
> On Wed, Feb 11, 2015 at 9:23 AM, Bin.Cheng  wrote:
>> On Tue, Feb 10, 2015 at 12:24 AM, Richard Biener
>>  wrote:
>>
>> Previously, the computation of _1174 can be replaced by _629 in bb8 in
>> DOM2 pass, while it can't after patching.  This is the only possible
>> regression that I can see on TREE level.  There is another difference
>> but not regression on TREE.  Seems real change happens on RTL pre with
>> different register uses in the input.  I failed to go further or
>> extract a test case, it's just very volatile.
>
> Well, I can see what is happening and indeed we shouldn't blame the
> patch for this.
>
>> With all of this, I guess this patch shouldn't be blamed for this.  I
>> also wonder if the PR should be fixed in this way since the patch
>> definitely is a corner case.
>
> It might not fix the real issue (whatever that is), but not making
> IVOPTs (or tree-affines) life harder is very good (I believe I have
> seen this kind of issue as well).
I guess IV's base is expanded because we want to explore more CSE opportunities?
I suspect this doesn't work very well because of two reasons: 1)
overload the tree-affine facility; 2) weak IV rewriting capacity in
GCC (for example, mess up loop variant/invariant part expressions).  I
will do experiments on this.

As for the patch itself, I collected instrumental data from GCC
bootstrap and Spec2k6 compilation.  Can confirm in most cases
(bootstrap 99.9%, spec2k6 99.1%), there is only one ssa name in IV's
step.

>
> So I do think that the patch is fine.  Just seen the known-to-work GCC 3.4
> version so it's even a regression 

Here is the refined patch according to your comments.  It passes
bootstrap and test on x86_64.

Thanks,
bin

2015-02-12  Bin Cheng  

PR tree-optimization/64705
* tree-ssa-loop-niter.h (expand_simple_operations): New parameter.
* tree-ssa-loop-niter.c (expand_simple_operations): New parameter.
* tree-ssa-loop-ivopts.c (extract_single_var_from_expr): New.
(find_bivs, find_givs_in_stmt_scev): Pass new argument to
expand_simple_operations.

gcc/testsuite/ChangeLog
2015-02-12  Bin Cheng  

PR tree-optimization/64705
* gcc.dg/tree-ssa/pr64705.c: New test.
Index: gcc/tree-ssa-loop-niter.c
===
--- gcc/tree-ssa-loop-niter.c   (revision 219574)
+++ gcc/tree-ssa-loop-niter.c   (working copy)
@@ -1552,10 +1552,11 @@ simplify_replace_tree (tree expr, tree old, tree n
 }
 
 /* Expand definitions of ssa names in EXPR as long as they are simple
-   enough, and return the new expression.  */
+   enough, and return the new expression.  If STOP is specified, stop
+   expanding if EXPR equals to it.  */
 
 tree
-expand_simple_operations (tree expr)
+expand_simple_operations (tree expr, tree stop)
 {
   unsigned i, n;
   tree ret = NULL_TREE, e, ee, e1;
@@ -1575,7 +1576,7 @@ tree
   for (i = 0; i < n; i++)
{
  e = TREE_OPERAND (expr, i);
- ee = expand_simple_operations (e);
+ ee = expand_simple_operations (e, stop);
  if (e == ee)
continue;
 
@@ -1594,7 +1595,8 @@ tree
   return ret;
 }
 
-  if (TREE_CODE (expr) != SSA_NAME)
+  /* Stop if it's not ssa name or the one we don't want to expand.  */
+  if (TREE_CODE (expr) != SSA_NAME || expr == stop)
 return expr;
 
   stmt = SSA_NAME_DEF_STMT (expr);
@@ -1614,7 +1616,7 @@ tree
  && src->loop_father != dest->loop_father)
return expr;
 
-  return expand_simple_operations (e);
+  return expand_simple_operations (e, stop);
 }
   if (gimple_code (stmt) != GIMPLE_ASSIGN)
 return expr;
@@ -1634,7 +1636,7 @@ tree
return e;
 
   if (code == SSA_NAME)
-   return expand_simple_operations (e);
+   return expand_simple_operations (e, stop);
 
   return expr;
 }
@@ -1643,7 +1645,7 @@ tree
 {
 CASE_CONVERT:
   /* Casts are simple.  */
-  ee = expand_simple_operations (e);
+  ee = expand_simple_operations (e, stop);
   return fold_build1 (code, TREE_TYPE (expr), ee);
 
 case PLUS_EXPR:
@@ -1658,7 +1660,7 @@ tree
   if (!is_gimple_min_invariant (e1))
return expr;
 
-  ee = expand_simple_operations (e);
+  ee = expand_simple_operations (e, stop);
   return fold_build2 (code, TREE_TYPE (expr), ee, e1);
 
 default:
Index: gcc/tree-ssa-loop-niter.h
===
--- gcc/tree-ssa-loop-niter.h   (revision 219574)
+++ gcc/tree-ssa-loop-niter.h   (working copy)
@@ -20,7 +20,7 @@ along with GCC; see the file COPYING3.  If not see
 #ifndef GCC_TREE_SSA_LOOP_NITER_H
 #define GCC_TREE_SSA_LOOP_NITER_H
 
-extern tree expand_simple_operations (tree);
+extern tree expand_simple_operations (tree, tree = NULL);
 extern bool loop_only_exit_p (const struct loop *, const_edge);
 extern bool number_of_iterations_exit (struct loop *, edge,

[PATCH] Fix rotate discovery in fold-const.c (PR tree-optimization/65014)

2015-02-11 Thread Jakub Jelinek

Hi!

We pass the values after STRIP_NOPS to *ROTATE_EXPR build, which is
undesirable, as the testcase shows, the operand could very well end up being
a pointer rather than integer.
This patch fixes it by passing the original trees instead.
Alternatively we could fold_convert the STRIP_NOPS result to the original
type, but I'd be afraid that in the common case that would just create more
GC garbage.

Bootstrapped/regtested on x86_64-linux and i686-linux, ok for trunk?

2015-02-12  Jakub Jelinek  

PR tree-optimization/65014
* fold-const.c (fold_binary_loc): When creating {L,R}ROTATE_EXPR,
use original second operand of arg0 or arg1 instead of
that adjusted by STRIP_NOPS.

* gcc.c-torture/compile/pr65014.c: New test.

--- gcc/fold-const.c.jj 2015-01-22 19:30:59.0 +0100
+++ gcc/fold-const.c2015-02-11 15:06:52.628078961 +0100
@@ -10261,7 +10261,9 @@ fold_binary_loc (location_t loc,
tem = build2_loc (loc, LROTATE_EXPR,
  TREE_TYPE (TREE_OPERAND (arg0, 0)),
  TREE_OPERAND (arg0, 0),
- code0 == LSHIFT_EXPR ? tree01 : tree11);
+ code0 == LSHIFT_EXPR
+ ? TREE_OPERAND (arg0, 1)
+ : TREE_OPERAND (arg1, 1));
return fold_convert_loc (loc, type, tem);
  }
else if (code11 == MINUS_EXPR)
@@ -10283,7 +10285,8 @@ fold_binary_loc (location_t loc,
   ? LROTATE_EXPR
   : RROTATE_EXPR),
  TREE_TYPE (TREE_OPERAND (arg0, 
0)),
- TREE_OPERAND (arg0, 0), tree01));
+ TREE_OPERAND (arg0, 0),
+ TREE_OPERAND (arg0, 1)));
  }
else if (code01 == MINUS_EXPR)
  {
@@ -10304,7 +10307,7 @@ fold_binary_loc (location_t loc,
? LROTATE_EXPR
: RROTATE_EXPR),
   TREE_TYPE (TREE_OPERAND (arg0, 0)),
-  TREE_OPERAND (arg0, 0), tree11));
+  TREE_OPERAND (arg0, 0), TREE_OPERAND (arg1, 1)));
  }
  }
   }
--- gcc/testsuite/gcc.c-torture/compile/pr65014.c.jj2015-02-11 
15:21:19.175681614 +0100
+++ gcc/testsuite/gcc.c-torture/compile/pr65014.c   2015-02-11 
15:22:18.040703653 +0100
@@ -0,0 +1,10 @@
+/* PR tree-optimization/65014 */
+/* { dg-do compile { target int32plus } } */
+
+extern int x;
+
+unsigned
+foo (unsigned int y)
+{
+  return (y << ((__INTPTR_TYPE__) &x)) | (y >> (32 - ((__INTPTR_TYPE__) &x)));
+}

Jakub

[PATCH] Fix UBSAN_OBJECT_SIZE lowering (PR sanitizer/65019)

2015-02-11 Thread Jakub Jelinek

Hi!

Similar problem to the recently fixed UBSAN_VPTR lowering,
ubsan_expand_objsize_ifn sets *gsi to the first stmt in a new bb
after splitting block after UBSAN_OBJECT_SIZE, which is the next stmt
that should be processed, so we should always return no_next = true
to avoid gsi_next on it before it will be processed.

Bootstrapped/regtested on x86_64-linux and i686-linux, ok for trunk?

2015-02-12  Jakub Jelinek  

PR sanitizer/65019
* ubsan.c (ubsan_expand_objsize_ifn): Always return true.

* g++.dg/ubsan/pr65019.C: New test.

--- gcc/ubsan.c.jj  2015-02-10 22:58:55.0 +0100
+++ gcc/ubsan.c 2015-02-11 17:20:36.313063536 +0100
@@ -987,7 +987,7 @@ ubsan_expand_objsize_ifn (gimple_stmt_it
   /* Get rid of the UBSAN_OBJECT_SIZE call from the IR.  */
   unlink_stmt_vdef (stmt);
   gsi_remove (&gsi_orig, true);
-  return gsi_end_p (*gsi);
+  return true;
 }
 
 /* Cached __ubsan_vptr_type_cache decl.  */
--- gcc/testsuite/g++.dg/ubsan/pr65019.C.jj 2015-02-11 17:26:44.832959016 
+0100
+++ gcc/testsuite/g++.dg/ubsan/pr65019.C2015-02-11 17:26:23.0 
+0100
@@ -0,0 +1,24 @@
+// PR sanitizer/65019
+// { dg-do compile }
+// { dg-options "-fsanitize=alignment,object-size,vptr -std=c++11 -O2 
-fcompare-debug" }
+
+struct A { };
+struct B { };
+struct C final {
+  C (const A &, int);
+  static B *foo (const A &, int = 1);
+  virtual ~C ();
+  void *c;
+};
+
+B *
+C::foo (const A &x, int y)
+{
+  C *d = new C (x, y);
+  if (d->c == nullptr)
+delete d;
+}
+
+C::~C ()
+{
+}

Jakub

71 matches

Mail list logo