[Bug target/71722] incorrect code for test pr64252.c for -mcpu=power9 -mpower9-vector -ftree-vectorize -O3

2016-07-02 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71722 --- Comment #4 from Bill Schmidt --- Another possibility is all the vperm instructions, as this is little endian and we might expect to see vpermr on occasion. That's hard to tell without digging deeper.

[Bug target/71297] ICE on invalid code in altivec_resolve_overloaded_builtin (rs6000-c.c:5106) on powerpc64le-linux

2016-07-06 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71297 Bill Schmidt changed: What|Removed |Added CC||wschmidt at gcc dot gnu.org --- Comment

[Bug target/71201] PowerPC XXPERM instruction fails on ISA 3.0 system.

2016-07-06 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71201 Bill Schmidt changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/71297] [7 regression] ICE on invalid code in altivec_resolve_overloaded_builtin (rs6000-c.c:5106) on powerpc64le-linux

2016-07-07 Thread wschmidt at gcc dot gnu.org
||2016-07-07 Assignee|unassigned at gcc dot gnu.org |wschmidt at gcc dot gnu.org Summary|ICE on invalid code in |[7 regression] ICE on |altivec_resolve_overloaded_ |invalid code in |builtin (rs6000-c.c:5106

[Bug tree-optimization/71815] SLSR misses several PHI candidate cases

2016-07-08 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71815 --- Comment #1 from Bill Schmidt --- Interesting. How do we enable/disable code hoisting? I don't see a documented option for this.

[Bug tree-optimization/71815] SLSR misses several PHI candidate cases

2016-07-08 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71815 --- Comment #3 from Bill Schmidt --- OK. I'm busy wrapping up some things before a vacation, but I'll plan to look into this when I get back.

[Bug target/71297] [7 regression] ICE on invalid code in altivec_resolve_overloaded_builtin (rs6000-c.c:5106) on powerpc64le-linux

2016-07-08 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71297 --- Comment #3 from Bill Schmidt --- Author: wschmidt Date: Fri Jul 8 15:42:47 2016 New Revision: 238168 URL: https://gcc.gnu.org/viewcvs?rev=238168&root=gcc&view=rev Log: [gcc] 2016-07-08 Bill Schmidt PR target/71297 * con

[Bug target/71297] [7 regression] ICE on invalid code in altivec_resolve_overloaded_builtin (rs6000-c.c:5106) on powerpc64le-linux

2016-07-08 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71297 Bill Schmidt changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug target/71805] incorrect code for test pr45752.c with -mcpu=power9

2016-07-13 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71805 --- Comment #7 from Bill Schmidt --- *** Bug 71731 has been marked as a duplicate of this bug. ***

[Bug target/71731] incorrect result for vectorized char rotate with -mcpu=power9

2016-07-13 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71731 Bill Schmidt changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug tree-optimization/71915] A missed opportunity for SLSR

2016-07-24 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71915 Bill Schmidt changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |wschmidt at gcc dot gnu.org

[Bug tree-optimization/71815] SLSR misses several PHI candidate cases

2016-07-25 Thread wschmidt at gcc dot gnu.org
||2016-07-25 Assignee|unassigned at gcc dot gnu.org |wschmidt at gcc dot gnu.org Ever confirmed|0 |1 --- Comment #4 from Bill Schmidt --- Confirmed. Will have a look soon.

[Bug target/72103] ICE with gcc 7 for povray benchmark

2016-07-28 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72103 --- Comment #8 from Bill Schmidt --- Per c#3, could we please have another bug opened to track the tragic code generation opportunity? ;)

[Bug target/72747] New: powerpc: wrong code generated for vec_splats in cascading assignment

2016-07-28 Thread wschmidt at gcc dot gnu.org
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: wschmidt at gcc dot gnu.org Target Milestone: --- A statement such as "v = vec_splats (1);" correctly initializes a vector. However, a statement such as "v[1] = v[

[Bug target/72747] powerpc: wrong code generated for vec_splats in cascading assignment

2016-07-28 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72747 Bill Schmidt changed: What|Removed |Added Target||powerpc64le-unknown-linux-g

[Bug rtl-optimization/69847] Spec 2006 403.gcc slows down with -mlra vs. reload on PowerPC

2016-07-29 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69847 --- Comment #14 from Bill Schmidt --- Thanks, Vlad! I'll do some benchmarking with this patch in the next few days. Much obliged!

[Bug rtl-optimization/69847] Spec 2006 403.gcc slows down with -mlra vs. reload on PowerPC

2016-08-01 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69847 --- Comment #16 from Bill Schmidt --- Hi Vlad, I need to re-run my tests one more time because I goofed up the build on a few of them; however, I was able to verify that the degradation on 403.gcc has now gone away (I saw a slight improvement wi

[Bug rtl-optimization/69847] Spec 2006 403.gcc slows down with -mlra vs. reload on PowerPC

2016-08-02 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69847 --- Comment #17 from Bill Schmidt --- Vlad, the patch checks out very well on powerpc64le. 403.gcc no longer degrades. We are seeing some very nice improvements from LRA over reload on a few benchmarks (435.gromacs leads the way with +9.5%). E

[Bug tree-optimization/71815] SLSR misses several PHI candidate cases

2016-08-02 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71815 --- Comment #5 from Bill Schmidt --- I'll note that in the case where the stride is known (slsr-35.c), SLSR is making at least a somewhat rational decision based on cost not to strength-reduce the phi candidate. In this case the stride is a powe

[Bug tree-optimization/71815] SLSR misses several PHI candidate cases

2016-08-02 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71815 --- Comment #6 from Bill Schmidt --- Actually, it looks like a similar problem for the unknown stride case. Again there is logic that relies on single-reached-use for determining what expressions go dead. We need to factor in expressions that g

[Bug tree-optimization/71815] SLSR misses several PHI candidate cases

2016-08-03 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71815 --- Comment #7 from Bill Schmidt --- I have a prototype that fixes this in the obvious way and it causes both slsr-35.c and slsr-36.c to pass again without turning off code hoisting. I'll do a regstrap and then work on some benchmark testing. I

[Bug tree-optimization/71815] SLSR misses several PHI candidate cases

2016-08-08 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71815 --- Comment #8 from Bill Schmidt --- Created attachment 39085 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=39085&action=edit Patch under test Attaching a patch that passes regstrap. I want to do a little benchmarking before submitting i

[Bug rtl-optimization/72855] New: Long compile time due to integrity checking during dataflow analysis per loop

2016-08-09 Thread wschmidt at gcc dot gnu.org
: normal Priority: P3 Component: rtl-optimization Assignee: unassigned at gcc dot gnu.org Reporter: wschmidt at gcc dot gnu.org Target Milestone: --- Created attachment 39091 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=39091&action=edit C

[Bug target/72863] Powerpc64le: redundant swaps when using vec_vsx_ld/st

2016-08-10 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72863 --- Comment #1 from Bill Schmidt --- Egad. How appalling. I'll have a look soon.

[Bug rtl-optimization/72855] Long compile time due to integrity checking during dataflow analysis per loop

2016-08-10 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72855 --- Comment #5 from Bill Schmidt --- (In reply to Richard Biener from comment #2) > If we have release checking enabled then we shuould hit > > static void > df_analyze_1 (void) > { > ... > #ifndef ENABLE_DF_CHECKING > if (df->changeable_flags

[Bug rtl-optimization/72855] Long compile time due to integrity checking during dataflow analysis per loop

2016-08-10 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72855 --- Comment #6 from Bill Schmidt --- (In reply to amker from comment #4) > It reduces compile time for powerpc-elf on x86_64 machine from 54m to 5m. > The compiler is configured with checking. With "--enable-checking=release", > the current tru

[Bug tree-optimization/71815] SLSR misses several PHI candidate cases

2016-08-10 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71815 --- Comment #9 from Bill Schmidt --- I'm not comfortable with the results of the patch. Overall I see a slight improvement for SPECint CPU2006 and a slightly larger degradation for SPECfp CPU2006. But there are some individual slowdowns that ar

[Bug target/72863] Powerpc64le: redundant swaps when using vec_vsx_ld/st

2016-08-10 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72863 Bill Schmidt changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed|

[Bug target/72863] Powerpc64le: redundant swaps when using vec_vsx_ld/st

2016-08-10 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72863 --- Comment #3 from Bill Schmidt --- This is a phase ordering issue involving the expanders for the built-ins. In vsx.md: ;; Explicit load/store expanders for the builtin functions (define_expand "vsx_load_" [(set (match_operand:VSX_M 0 "vsx

[Bug rtl-optimization/72855] Long compile time due to integrity checking during dataflow analysis per loop

2016-08-10 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72855 --- Comment #10 from Bill Schmidt --- Some experiments on trunk: - Using Bin's patch, I see compile time reduced to ~14 minutes. - Using Richi's patch, I see compile time reduced to ~9 minutes. So both are quite helpful compared to somewhere ar

[Bug target/72863] Powerpc64le: redundant swaps when using vec_vsx_ld/st

2016-08-10 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72863 Bill Schmidt changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |wschmidt at gcc dot gnu.org

[Bug target/72863] Powerpc64le: redundant swaps when using vec_vsx_ld/st

2016-08-10 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72863 --- Comment #5 from Bill Schmidt --- Regstrap passes. I'll prepare the formal patch tomorrow. Thanks for the report!

[Bug rtl-optimization/72855] Long compile time due to integrity checking during dataflow analysis per loop

2016-08-11 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72855 --- Comment #12 from Bill Schmidt --- Last night before I ran out of time, I built a debug compiler on gcc-6-branch, and flag_checking was always 0, and I didn't have the compile time issue. So it appears to be something that only occurs with a

[Bug tree-optimization/74585] New: [5/6/7] Tree-sra forces parameters to memory causing awful code generation

2016-08-11 Thread wschmidt at gcc dot gnu.org
: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: wschmidt at gcc dot gnu.org Target Milestone: --- For the PowerPC64 ELF V2 ABI, homogeneous aggregates of vectors (any combination of structs and arrays containing

[Bug tree-optimization/74585] [5/6/7] Tree-sra forces parameters to memory causing awful code generation

2016-08-11 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=74585 Bill Schmidt changed: What|Removed |Added Target||powerpc64*-*-* CC|

[Bug target/72863] Powerpc64le: redundant swaps when using vec_vsx_ld/st

2016-08-11 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72863 --- Comment #6 from Bill Schmidt --- Author: wschmidt Date: Thu Aug 11 21:39:49 2016 New Revision: 239394 URL: https://gcc.gnu.org/viewcvs?rev=239394&root=gcc&view=rev Log: [gcc] 2016-08-11 Bill Schmidt PR target/72863 * vsx

[Bug rtl-optimization/72855] Long compile time due to integrity checking during dataflow analysis per loop

2016-08-11 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72855 --- Comment #14 from Bill Schmidt --- Oh, I should have mentioned, it passed bootstrap with no regressions, so the patch LGTM.

[Bug rtl-optimization/72855] Long compile time due to integrity checking during dataflow analysis per loop

2016-08-11 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72855 --- Comment #15 from Bill Schmidt --- For your patch submission, the testing was done on powerpc64le-unknown-linux-gnu.

[Bug rtl-optimization/72855] Long compile time due to integrity checking during dataflow analysis per loop

2016-08-11 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72855 --- Comment #16 from Bill Schmidt --- Author: wschmidt Date: Thu Aug 11 22:20:41 2016 New Revision: 239395 URL: https://gcc.gnu.org/viewcvs?rev=239395&root=gcc&view=rev Log: 2016-08-11 Richard Biener Bill Schmidt PR rtl

[Bug tree-optimization/74585] SRA forces parameters to memory causing awful code generation

2016-08-12 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=74585 --- Comment #5 from Bill Schmidt --- (In reply to Richard Biener from comment #1) > The issue is not SRA pushing things to memory - it doesn't. The issue is > that in the GIMPLE IL the parameter appears as "memory" as it is an > aggregate type.

[Bug rtl-optimization/74585] powerpc64: Very poor code generation for homogeneous vector aggregates passed in registers

2016-08-12 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=74585 Bill Schmidt changed: What|Removed |Added Component|tree-optimization |rtl-optimization Summary|SRA f

[Bug rtl-optimization/74585] powerpc64: Very poor code generation for homogeneous vector aggregates passed in registers

2016-08-12 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=74585 --- Comment #8 from Bill Schmidt --- FYI, adding -mcpu=power9 to the options makes it much easier to read the RTL, as it gets rid of the extra vector swaps needed for POWER8.

[Bug rtl-optimization/74585] powerpc64: Very poor code generation for homogeneous vector aggregates passed in registers

2016-08-12 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=74585 --- Comment #9 from Bill Schmidt --- We do optimize things well for the following: typedef struct { __vector double vx0; __vector double vx1; __vector double vx2; __vector double vx3; } vdoublex8_t; vdoublex8_t test_vecd8_rotate_left (v

[Bug rtl-optimization/74585] powerpc64: Very poor code generation for homogeneous vector aggregates passed in registers

2016-08-12 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=74585 --- Comment #10 from Bill Schmidt --- The dse pass is responsible for removing all the unnecessary stack activity. I think that we are probably confusing it because the stores are full vector stores, but the loads are vector element loads of sma

[Bug rtl-optimization/63491] Ice in LRA with simple vector test case on power

2016-08-15 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=63491 --- Comment #16 from Bill Schmidt --- Ping...

[Bug rtl-optimization/74585] powerpc64: Very poor code generation for homogeneous vector aggregates passed in registers

2016-08-15 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=74585 --- Comment #11 from Bill Schmidt --- With the original test case, -mcpu=power8 is problematic because of the use of the "swapping stores," whose RHS is a vec_select rather than a register or subreg. This prevents us from saving the RHS of the s

[Bug c++/77034] New: [6.2RC regression] g++.dg/init/elide5.C fails on powerpc64-unknown-linux-gnu with -m32

2016-08-15 Thread wschmidt at gcc dot gnu.org
Severity: normal Priority: P3 Component: c++ Assignee: unassigned at gcc dot gnu.org Reporter: wschmidt at gcc dot gnu.org Target Milestone: --- Testing the release candidate, I see a regression versus the 6.1 release when compiling the referenced test. From

[Bug c++/77034] [6.2RC regression] g++.dg/init/elide5.C fails on powerpc64-unknown-linux-gnu with -m32

2016-08-15 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77034 Bill Schmidt changed: What|Removed |Added Target||powerpc64-unknown-linux-gnu

[Bug rtl-optimization/74585] powerpc64: Very poor code generation for homogeneous vector aggregates passed in registers

2016-08-15 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=74585 --- Comment #12 from Bill Schmidt --- The rest of the ugly code (once you ignore the loads/stores) is horrible choices of register allocation. Need to understand why we're not making use of the high floating-point registers; too much copying bac

[Bug rtl-optimization/74585] powerpc64: Very poor code generation for homogeneous vector aggregates passed in registers

2016-08-16 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=74585 --- Comment #14 from Bill Schmidt --- (In reply to Richard Biener from comment #13) > > You mean stores like the following? > > (insn 13 12 14 2 (set (mem/c:V4SI (plus:DI (reg/f:DI 150 virtual-stack-vars) > (const_int 112 [0x70]

[Bug target/72863] Powerpc64le: redundant swaps when using vec_vsx_ld/st

2016-08-25 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72863 --- Comment #7 from Bill Schmidt --- Author: wschmidt Date: Thu Aug 25 14:24:17 2016 New Revision: 239761 URL: https://gcc.gnu.org/viewcvs?rev=239761&root=gcc&view=rev Log: [gcc] 2016-08-25 Bill Schmidt Backport from mainline

[Bug target/72863] Powerpc64le: redundant swaps when using vec_vsx_ld/st

2016-08-25 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72863 --- Comment #8 from Bill Schmidt --- Author: wschmidt Date: Thu Aug 25 16:12:23 2016 New Revision: 239762 URL: https://gcc.gnu.org/viewcvs?rev=239762&root=gcc&view=rev Log: [gcc] 2016-08-25 Bill Schmidt Backport from mainline (minus

[Bug target/72863] Powerpc64le: redundant swaps when using vec_vsx_ld/st

2016-08-25 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72863 Bill Schmidt changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug ada/77405] SIGBUS from gnatmake in stage 3 (gcc 7.0)

2016-08-29 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77405 Bill Schmidt changed: What|Removed |Added CC||wschmidt at gcc dot gnu.org --- Comment

[Bug target/72827] [7 Regression] gnat bootstrap broken on powerpc64le-linux-gnu

2016-08-29 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72827 Bill Schmidt changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed|

[Bug target/72827] [7 Regression] gnat bootstrap broken on powerpc64le-linux-gnu

2016-08-29 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72827 --- Comment #2 from Bill Schmidt --- See also https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77405 which appears to be very similar (slight difference in the error message).

[Bug target/72827] [7 Regression] gnat bootstrap broken on powerpc64le-linux-gnu

2016-08-29 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72827 --- Comment #6 from Bill Schmidt --- Backtrace info (svn r239837): Program received signal SIGSEGV, Segmentation fault. system.secondary_stack.ss_release (m=...) at ../rts/s-secsta.adb:479 479 To_Stack_Ptr (M.Sstk).Top := M.Sptr; (g

[Bug target/72827] [7 Regression] gnat bootstrap broken on powerpc64le-linux-gnu

2016-08-29 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72827 --- Comment #7 from Bill Schmidt --- It does look possible that this is an LRA issue. Here's the code right before failure: 100dae08: f8 95 22 39 addir9,r2,-27144 100dae0c: 01 00 40 39 li r10,1 100dae10: 00 00

[Bug target/72827] [7 Regression] gnat bootstrap broken on powerpc64le-linux-gnu

2016-08-30 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72827 --- Comment #11 from Bill Schmidt --- (In reply to Eric Botcazou from comment #8) > > I'm afraid I don't know anything about Ada and how its runtime works; it > > looks like system.secondary_stack.ss_release is called automatically somehow > > as

[Bug target/72827] [7 Regression] gnat bootstrap broken on powerpc64le-linux-gnu

2016-08-30 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72827 --- Comment #12 from Bill Schmidt --- (In reply to Eric Botcazou from comment #10) > > So the double-word load before the call to SS_Release should be from a > Mark_Id object obtained from a preceding call to SS_Mark. It indeed looks > like the

[Bug target/72827] [7 Regression] gnat bootstrap broken on powerpc64le-linux-gnu

2016-08-30 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72827 --- Comment #14 from Bill Schmidt --- Confirmed that the frame pointer dance is where the issue is. Prior to dse2: (insn 3854 144 4133 2 (set (reg:DI 9 9 [1674]) (const_int 1208 [0x4b8])) /home/wschmidt/gcc/gcc-mainline-base/gcc/ada/\ m

[Bug target/72827] [7 Regression] gnat bootstrap broken on powerpc64le-linux-gnu

2016-08-30 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72827 --- Comment #15 from Bill Schmidt --- Created attachment 39520 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=39520&action=edit Dumps before and after dse2 Here are the full dumps before and after dse2 in tarball form.

[Bug target/72827] [7 Regression] gnat bootstrap broken on powerpc64le-linux-gnu

2016-08-30 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72827 --- Comment #16 from Bill Schmidt --- Looks like the back end must be inserting the frame pointer adjustments, as they aren't there at expand time. From the 218.vregs dump: (call_insn 141 140 3128 2 (parallel [ (set (reg:TI 3 3)

[Bug target/72827] [7 Regression] gnat bootstrap broken on powerpc64le-linux-gnu

2016-08-30 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72827 --- Comment #18 from Bill Schmidt --- The frame pointer adjustments are introduced in 263r.split2. I haven't yet run down the offending split, but the pattern being split is a *vsx_movti_64bit. I know we've had changes in the back end fairly re

[Bug target/72827] [7 Regression] gnat bootstrap broken on powerpc64le-linux-gnu

2016-08-30 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72827 --- Comment #19 from Bill Schmidt --- I'm suspicious of rs6000_split_multireg_move in rs6000.c, which appears to be the code that gets called to split a TImode move involving a GPR pair. In particular, this code: else

[Bug target/72827] [7 Regression] gnat bootstrap broken on powerpc64le-linux-gnu

2016-08-30 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72827 --- Comment #21 from Bill Schmidt --- I think for the purposes of this bug we should be able to work around it by forcing the offset register to be modified instead of the base register. Going to try testing this: Index: gcc/config/rs6000/rs600

[Bug target/72827] [7 Regression] gnat bootstrap broken on powerpc64le-linux-gnu

2016-08-30 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72827 --- Comment #23 from Bill Schmidt --- Bleah, that doesn't work because offsetreg needs to contain something that's a valid address in order to use replace_equiv_address. So something more involved is needed.

[Bug target/72827] [7 Regression] gnat bootstrap broken on powerpc64le-linux-gnu

2016-08-30 Thread wschmidt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72827 --- Comment #24 from Bill Schmidt --- This seems to work as a short-term solution (c,c++,ada bootstrap succeeds). Need to do a full regstrap with all the languages. Index: gcc/config/rs6000/rs6000.c =

[Bug ipa/96825] [11 Regression] Commit r11-2645 degrades CPU2017 548.exchange2_r by 35%

2021-03-17 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96825 --- Comment #3 from Bill Schmidt --- Is this going to be addressed in GCC 11? Should this be only a P3?

[Bug target/57547] Missing vector intrinsics in PowerPC Altivec documentation

2021-04-16 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=57547 Bill Schmidt changed: What|Removed |Added Assignee|kelvin at gcc dot gnu.org |wschmidt at gcc dot gnu.org

[Bug target/88696] Power VSX builtins missing vmuluwm / vector int vec_mul (vector int, vector int);

2021-04-16 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
at gcc dot gnu.org |wschmidt at gcc dot gnu.org Resolution|--- |FIXED --- Comment #4 from Bill Schmidt --- Documentation fixed today to point to the master intrinsic reference document, which has the vec_mul omissions fixed.

[Bug libstdc++/94080] -mabi=ieeelongdouble and -mfloat128 cause libstc++ header breakage

2021-04-19 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94080 --- Comment #4 from Bill Schmidt --- Thanks, Jonathan!

[Bug target/98519] rs6000: @pcrel unsupported on this instruction error in pveclib

2021-01-05 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98519 --- Comment #10 from Bill Schmidt --- But it seems we would also need a new constraint that does permit PC-relative addresses, since new code will/may not have a TOC.

[Bug target/98519] rs6000: @pcrel unsupported on this instruction error in pveclib

2021-01-05 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98519 --- Comment #12 from Bill Schmidt --- Right...but if somebody specifies an instruction with a 'p' that is legitimately a pc-relative instruction, we don't want to say that the memory operand can't use PC-relative addressing, do we? I just want t

[Bug target/98519] rs6000: @pcrel unsupported on this instruction error in pveclib

2021-01-05 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98519 --- Comment #14 from Bill Schmidt --- I agree, Peter.

[Bug libstdc++/94080] -mabi=ieeelongdouble and -mfloat128 cause libstc++ header breakage

2021-01-21 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94080 --- Comment #2 from Bill Schmidt --- Let's see, with patches from late last year, can this be closed now?

[Bug testsuite/98325] [11 regression] gcc.dg/pr25376.c fails after r11-5027

2021-01-22 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98325 Bill Schmidt changed: What|Removed |Added CC||wschmidt at gcc dot gnu.org --- Comment

[Bug target/98959] ICE in extract_constrain_insn, at recog.c:2670

2021-02-07 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98959 --- Comment #14 from Bill Schmidt --- We should definitely not be allowing the AltiVec "& ~16" flavors into these patterns. I'm not certain whether your fix is the best way to achieve that, but it could well be; I'll defer to Segher on that.

[Bug target/98734] ABI diagnostics emitted despite always_inline attribute

2021-05-19 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98734 Bill Schmidt changed: What|Removed |Added CC||wschmidt at gcc dot gnu.org --- Comment

[Bug c++/100809] PPC: __int128 divide/modulo does not use P10 instructions vdivsq/vdivuq

2021-06-01 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100809 --- Comment #2 from Bill Schmidt --- I believe this work is pending, but the patches are still under review.

[Bug testsuite/100750] new test case gcc.target/powerpc/rop-5.c fails on BE

2021-06-01 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100750 --- Comment #3 from Bill Schmidt --- Fixed the BE problem. Will look into the GCC11 report.

[Bug testsuite/100750] new test case gcc.target/powerpc/rop-5.c fails on BE

2021-06-01 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100750 --- Comment #4 from Bill Schmidt --- I cannot reproduce failures for powerpc64le on P10 LE.

[Bug testsuite/100750] new test case gcc.target/powerpc/rop-5.c fails on BE

2021-06-02 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100750 Bill Schmidt changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug target/100706] Invalid instructions in plt calls on PPC

2021-06-02 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100706 Bill Schmidt changed: What|Removed |Added CC||wschmidt at gcc dot gnu.org

[Bug testsuite/100749] [12 regression] gcc.dg/pch/valid-1.c fails after r12-949

2021-06-02 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100749 Bill Schmidt changed: What|Removed |Added CC||wschmidt at gcc dot gnu.org --- Comment

[Bug target/100703] __vector_pair and __vector_quad cannot be passed by reference

2021-06-03 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100703 Bill Schmidt changed: What|Removed |Added Resolution|--- |INVALID Status|UNCONFIRMED

[Bug target/100930] PPC: Missing builtins for P9 vextsb2w, vextsb2w, vextsb2d, vextsh2d, vextsw2d

2021-06-06 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100930 Bill Schmidt changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed|

[Bug target/100930] PPC: Missing builtins for P9 vextsb2w, vextsb2w, vextsb2d, vextsh2d, vextsw2d

2021-06-10 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100930 --- Comment #2 from Bill Schmidt --- Carl Love implemented these on trunk yesterday. They will be backported to GCC 11 in a week or so, at which point we can close this.

[Bug target/101022] New: rs6000: __builtin_altivec_vcmpequt expands to wrong pattern

2021-06-10 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: wschmidt at gcc dot gnu.org Target Milestone: --- This line appears in a recent patch committed this week: +BU_P10V_AV_2 (VCMPEQUT,"vcmpequt", CONST,

[Bug target/101022] rs6000: __builtin_altivec_vcmpequt expands to wrong pattern

2021-06-10 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101022 Bill Schmidt changed: What|Removed |Added CC||bergner at gcc dot gnu.org,

[Bug target/101022] rs6000: __builtin_altivec_vcmpequt expands to wrong pattern

2021-06-10 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101022 --- Comment #2 from Bill Schmidt --- Looks like the proper pattern should be altivec_eqv1ti.

[Bug target/101022] rs6000: __builtin_altivec_vcmpequt expands to wrong pattern

2021-06-11 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101022 --- Comment #4 from Bill Schmidt --- Hi Carl -- while you're in there, can you please remove these? +BU_P10_OVERLOAD_2 (VRLQ, "vrlq") +BU_P10_OVERLOAD_2 (VSLQ, "vslq")

[Bug target/100866] PPC: Inefficient code for vec_revb(vector unsigned short) < P9

2021-06-21 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100866 --- Comment #10 from Bill Schmidt --- Right, it would be a good optimization. We've stopped focusing much on P8 optimization work at this point simply because of lack of resources. The needed transform is to recognize load-xxlnor-vperm as a gr

[Bug target/100866] PPC: Inefficient code for vec_revb(vector unsigned short) < P9

2021-06-21 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100866 --- Comment #11 from Bill Schmidt --- Segher, does this fit naturally in combine?

[Bug target/95082] LE implementations of vec_cnttz_lsbb and vec_cntlz_lsbb are wrong

2022-02-23 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95082 Bill Schmidt changed: What|Removed |Added Known to fail|11.0| Summary|[11] LE implementatio

[Bug target/103622] [12 Regression] ICE: Segmentation fault (in altivec_resolve_new_overloaded_builtin)

2022-01-05 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103622 Bill Schmidt changed: What|Removed |Added Resolution|--- |FIXED Status|NEW

[Bug target/103981] New: powerpc64le: Wrong code generated for vec_cntlz_lsbb, vec_cnttz_lsbb

2022-01-11 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: wschmidt at gcc dot gnu.org Target Milestone: --- Per the PVIPR documentation, vec_cntlz_lsbb should generate vctzlsbb for little endian, and vclzlsbb for big endian

[Bug target/103981] powerpc64le: Wrong code generated for vec_cntlz_lsbb, vec_cnttz_lsbb

2022-01-11 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103981 Bill Schmidt changed: What|Removed |Added Ever confirmed|0 |1 Target|

[Bug tree-optimization/81953] Code sinking increases register pressure

2022-01-11 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81953 Bill Schmidt changed: What|Removed |Added Status|ASSIGNED|NEW --- Comment #6 from Bill Schmidt ---

[Bug rtl-optimization/68212] Loop unroller breaks basic block frequencies

2022-01-13 Thread wschmidt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=68212 Bill Schmidt changed: What|Removed |Added Assignee|kelvin at gcc dot gnu.org |unassigned at gcc dot gnu.org

<    11   12   13   14   15   16   17   >