--- Comment #19 from pault at gcc dot gnu dot org 2009-12-18 14:32 ---
Can this now be closed or has it transmogrified itself into something else?
Paul
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40168
--- Comment #18 from rguenth at gcc dot gnu dot org 2009-06-14 13:39
---
Subject: Bug 40168
Author: rguenth
Date: Sun Jun 14 13:39:37 2009
New Revision: 148469
URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=148469
Log:
2009-06-14 Richard Guenther
Backport from mainl
--- Comment #17 from rguenther at suse dot de 2009-06-14 12:31 ---
Subject: Re: missing
unrolling/scalarization/reassoc/free
On Sat, 6 Jun 2009, jv244 at cam dot ac dot uk wrote:
> --- Comment #16 from jv244 at cam dot ac dot uk 2009-06-06 07:08 ---
> (In reply to comment #1
--- Comment #16 from jv244 at cam dot ac dot uk 2009-06-06 07:08 ---
(In reply to comment #13)
> Subject: Bug 40168
Richard, this empty constructor patch was also OKed for 4.4 and has been on
mainline for a while.
http://gcc.gnu.org/ml/fortran/2009-05/msg00288.html
Do you intend to c
--- Comment #15 from rguenth at gcc dot gnu dot org 2009-05-27 11:01
---
The issue is not the temporary array but the way how CSE works. In S2 there
are simply no CSE opportunities - for example consider
t1 = a * b;
t2 = t1 * c;
t3 = a * c;
t4 = t3 * b;
The current CSE implem
--- Comment #14 from jv244 at cam dot ac dot uk 2009-05-18 12:19 ---
Created an attachment (id=17886)
--> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=17886&action=view)
simplified testcase for common subexpressions.
Richard,
thanks very much for the first patch. I tried to get a be
--- Comment #13 from rguenth at gcc dot gnu dot org 2009-05-18 10:24
---
Subject: Bug 40168
Author: rguenth
Date: Mon May 18 10:24:34 2009
New Revision: 147659
URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=147659
Log:
2009-05-18 Richard Guenther
PR fortran/40168
--- Comment #12 from rguenth at gcc dot gnu dot org 2009-05-18 08:53
---
I'm now testing the one-liner.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40168
--- Comment #11 from pinskia at gcc dot gnu dot org 2009-05-16 14:48
---
Part of this optimization could (should) be done in the middle-end see PR
36602.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40168
--- Comment #10 from jvdelisle at gcc dot gnu dot org 2009-05-16 13:46
---
Nice for a one liner patch.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40168
--- Comment #9 from jv244 at cam dot ac dot uk 2009-05-16 12:39 ---
BTW, the patch also applies to 4.4_branch and has the same positive effect...
pretty please ?
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40168
--- Comment #8 from jv244 at cam dot ac dot uk 2009-05-16 12:20 ---
(In reply to comment #7)
> Subject: Re: missing
> unrolling/scalarization/reassoc/free
so, double good news.
First, the unrelated other testcase that speeds up by 30% does this thanks to
this patch only. This is a re
--- Comment #7 from rguenther at suse dot de 2009-05-16 11:39 ---
Subject: Re: missing
unrolling/scalarization/reassoc/free
On Sat, 16 May 2009, jv244 at cam dot ac dot uk wrote:
> --- Comment #6 from jv244 at cam dot ac dot uk 2009-05-16 11:31 ---
> (In reply to comment #5)
--- Comment #6 from jv244 at cam dot ac dot uk 2009-05-16 11:31 ---
(In reply to comment #5)
This looks somewhat different from what I get here.
trunk without patch:
vond...@pcihopt3:/data03/vondele/contract> gfortran -O3 -march=native
-ffast-math -funroll-loops -ffree-line-length-200
--- Comment #5 from rguenth at gcc dot gnu dot org 2009-05-16 11:20 ---
With that patch and
-O3 -ffast-math -funroll-loops -mfpmath=sse -msse2 --param
max-completely-peel-times=27 --param max-completely-peeled-insns=1500
--params for allow unrolling of all innermost loops I get
./test
--- Comment #4 from jv244 at cam dot ac dot uk 2009-05-16 11:19 ---
(In reply to comment #3)
> Like so:
>
> Index: trans-expr.c
> ===
> --- trans-expr.c(revision 147583)
> +++ trans-expr.c(working copy)
> @@
--- Comment #3 from rguenth at gcc dot gnu dot org 2009-05-16 10:04 ---
Like so:
Index: trans-expr.c
===
--- trans-expr.c(revision 147583)
+++ trans-expr.c(working copy)
@@ -4430,7 +4430,8 @@ gfc_trans_zero_
--- Comment #2 from rguenth at gcc dot gnu dot org 2009-05-16 09:54 ---
translating
buffer1 = 0.0_dp
to
(void) __builtin_memset ((void *) &buffer1, 0, 648);
pessimizes the middle-end analysis because buffer1 is now addressable and
escapes. The maybe valid (if n
18 matches
Mail list logo